Image and Video Generation MCP Repositories

135 repositories in this category.

Showing 30 of 135 repositories (Page 3 of 5)

dalle-mcp

Generate images from text prompts using OpenAI's DALL-E API. Edit existing images and create variations of them while ensuring API key validation for secure access.

Last Updated

No License

mcp-fetch

→

kazuph

Fetch web content and process images to facilitate efficient interaction with online resources. Supports integration with MCP clients like Claude Desktop for seamless content management.

Last Updated

MIT License

PromptShopMCP

→

Kira-Pgr

Transforms images based on natural language commands, enabling users to edit photos by describing desired changes such as adding accessories or modifying backgrounds.

Last Updated

MIT License

Al-StoryLab

→

aigc17

AI-StoryLab generates interactive stories with accompanying audio effects and provides illustration prompts. It leverages AI services for story creation, voice synthesis, sound effect generation, and suggests relevant audio placements.

Last Updated

No License

deep-research-mcp

→

Hajime-Y

Provides advanced web search capabilities, document analysis, and image processing. Extracts information from various sources including PDFs and YouTube transcripts efficiently.

Last Updated

Apache License 2.0

unsplash-mcp-server

→

hellokaton

Connects to Unsplash's image library to perform advanced searches and apply filters on keywords for rich, high-quality image retrieval.

Last Updated

174

MIT License

imagen-3.0-generate-google-mcp-server

→

falahgs

Generates high-quality images using Google's Imagen 3.0 model via the Gemini API, manages image files with intelligent naming, and creates HTML previews for local viewing. Integrates seamlessly with MCP-compatible hosts for enhanced AI capabilities.

Last Updated

No License

pollinations-mcp

→

bendusy

Connects AI models to Pollinations.ai's services for generating images and text via the MCP protocol. Facilitates seamless interaction with Pollinations.ai's API for image generation, downloading images, and text generation.

Last Updated

Apache License 2.0

hh-mcp-comfyui

→

zjf2671

Integrates with local ComfyUI instances via API calls to enable natural language-driven image generation. Supports dynamic parameter replacement in workflows and automatic loading of workflow files as resources.

Last Updated

MIT License

mcp-images

→

IA-Programming

Fetch and process images from URLs and local file paths, handling automatic compression and MIME type retrieval. Images are returned as base64-encoded strings to facilitate integration and support parallel processing with robust error handling.

Last Updated

MIT License

together-mcp-server

→

manascb1344

Generate high-quality images using the Flux.1 Schnell model by specifying customizable parameters such as width and height, while ensuring clear error handling for prompt validation and API interactions.

Last Updated

MIT License

openrouter-mcp-multimodal

→

stabgan

Combines text chat and image analysis capabilities to conduct multimodal conversations and handle custom queries seamlessly. Optimizes workflows with intelligent model selection and performance improvements.

Last Updated

No License

mcp-fetch

→

JeremyNixon

Fetches web content and processes images for integration with AI models, streamlining the retrieval and handling of online content in various applications.

Last Updated

MIT License

vidu-mcp-server

→

el-el-san

Generate videos from static images using advanced AI models, while monitoring the status of video generation tasks and uploading images for processing.

Last Updated

MIT License

MCP-image-gen

→

rmcendarfer2017

Generate stunning images using advanced AI models with a built-in storage system for managing and accessing creations. Users can customize image styles and utilize a prompt-based interface for generating images.

Last Updated

MIT License

mcp-video-gen

→

wheattoast11

Generate videos and images from text prompts or existing images using advanced AI models, with capabilities for audio addition, content upscaling, and prompt enhancement. Manage and refine AI-generated content through API interactions with RunwayML and Luma AI.

Last Updated

No License

mcp-asset-gen

→

jbrower95

Generate high-quality image assets for game or web development by providing descriptive prompts. Streamline asset creation workflows with automated image generation through AI.

Last Updated

MIT License

image-server

→

PawNzZi

Transform text prompts into images using advanced AI techniques, creating unique visuals tailored to user descriptions.

Last Updated

No License

image-tools-mcp

→

kshern

Retrieve image dimensions, compress images, and convert images to various formats using local files or URLs. Supports image processing with detailed output on dimensions, types, and compression information.

Last Updated

MIT License

MiniMax-MCP-JS

→

MiniMax-AI

Integrates with MiniMax's AI capabilities to facilitate interaction with multimedia generation tools, including image generation, video generation, text-to-speech, and voice cloning. Supports a flexible and configurable JavaScript/TypeScript framework for versatile deployment scenarios.

Last Updated

MIT License

video-editing-mcp

→

burningion

Upload, edit, search, and generate videos using large language models and Video Jungle's tools. The server enables interaction with videos through a custom URI scheme for managing individual videos and projects.

Last Updated

214

No License

ComfyUI_StoryDiffusion

→

396001000

ComfyUI_StoryDiffusion allows users to create visually enhanced stories by integrating advanced image generation features into the ComfyUI platform. It utilizes the StoryDiffusion and MS-Diffusion models for creative storytelling through visuals.

Last Updated

Unknown

mcp-3d-style-cartoon-gen-server

→

falahgs

Generates high-quality 3D-style cartoon images from text prompts using Google's Gemini AI, with child-friendly designs for engaging visuals. Offers secure file system operations for managing files, including reading and writing capabilities.

Last Updated

No License

mcp-ffmpeg

→

bitscorp-mcp

Manipulate video files by resizing them to various resolutions and extracting audio in multiple formats. Interact with video processing capabilities using natural language requests via API calls.

Last Updated

No License

flux-imagegen-mcp-server

→

falahgs

Generates and manipulates images using advanced AI models, offering functionalities such as image URL generation, direct image creation from text prompts, and management of multiple image generation models.

Last Updated

MIT License

shaka-packager-mcp-server

→

coderjun

Supports advanced video transcoding, packaging, and analysis using Shaka Packager. Facilitates format conversion, DRM application, and content preparation for streaming, featuring intelligent path handling and error management.

Last Updated

MIT License

mcp-screenshot

→

kazuph

Captures screenshots and performs OCR text recognition on macOS. Supports both Japanese and English text, offering multiple output formats.

Last Updated

MIT License

mcp-hfspace

→

evalstate

Connects to Hugging Face Spaces to access various AI models for tasks including image generation, text-to-speech, speech-to-text, and chat functionalities, requiring minimal setup.

Last Updated

360

MIT License

mcp_read_images

→

catalystneuro

Analyze images using OpenRouter vision models like Claude-3.5-sonnet and Claude-3-opus through a simple API interface.

Last Updated

MIT License

mcp-gemini

→

techkwon

Leverages Google's Gemini API to generate text, create and analyze images, perform video analysis on YouTube content, and conduct web searches. Provides a range of advanced AI functionalities for various applications.

Last Updated

No License

← Previous 1 2 3 4 5 Next →

Go to page:

← Back to MCP Directory 🙏 Credits & Acknowledgments