Image and Video Generation MCP Repositories
135 repositories in this category.
mcp-image-extractor
→
Extracts images from local files and URLs, processing them into base64 format for analysis by large language models (LLMs). Suitable for analyzing image-based data, such as screenshots from tests.
jigsawstack-mcp-server
→
Generate images from text using advanced AI models. The server facilitates the integration and management of image generation tools within an MCP framework.
Grok-MCP
→
MCP server for generating images using Grok's AI image generation capabilities, accepting text prompts and returning images as URLs or base64-encoded data. Supports multiple image generation requests and error handling, with configuration options for API keys.
flux-schnell-mcp
→
Generate images from text prompts using the Replicate API, enabling users to create customized visuals based on detailed descriptions. The server manages the communication with the API and handles errors effectively.
ideagram-mcp-server
→
Generate images based on prompts with customizable parameters like aspect ratio and style using the Ideogram API.
openai-gpt-image-mcp
→
Generate and edit images using the latest OpenAI GPT-4o and gpt-image-1 models with advanced prompt control. Outputs can be saved to disk or received in base64 format for integration with MCP-compatible clients.
piapi-mcp-server
→
Integrates with PiAPI's API to facilitate media content generation using various services like Midjourney, Flux, and more. It connects AI models with tools for seamless content creation directly from applications that support the Model Context Protocol.
replicate-flux-mcp
→
Generate images from text prompts using advanced AI models. Customize parameters for tailored outputs with secure and local processing.
image-generator-mcp-server
→
Connects to OpenAI's DALL-E 3 model to generate images based on user prompts, saving the results to a specified directory on the user's desktop.
together-ai-image-server
→
Generates images from text prompts using Together AI's image generation models via the MCP protocol. It supports optional parameters for fine-tuning the image generation process.
tinypng-mcp-server
→
Compress images using the TinyPNG API to reduce file size while maintaining quality. Integrate image optimization into various projects seamlessly.
ffmpeg-mcp
→
Enables local video search, trimming, stitching, and playback through conversational commands using ffmpeg. Provides tools for finding, clipping, concatenating, and playing video files on macOS platforms.
mcp-image-placeholder
→
Generates placeholder images from multiple providers, supporting both simple and real images as placeholders. Validates input parameters and returns image URLs for immediate use.
mermaid-mcp-server
→
Converts Mermaid diagram descriptions into high-quality PNG images using the Mermaid markdown syntax. Supports customizable themes and backgrounds for visual representations of data and processes.
mcp-replicate
→
Access Replicate models to run predictions through a tool-based interface, facilitating interactions with various AI models hosted on Replicate's platform.
Image-Generation-MCP-Server
→
Generate images from text prompts using the Replicate Flux model, enabling the creation of unique visuals tailored to specific specifications.
moondream-mcp
→
Advanced image analysis capabilities including captioning, object detection, and visual question answering for applications requiring sophisticated computer vision tasks.
LoganLxb
→
Logan provides tools and applications aimed at enhancing user interaction in mixed reality environments through augmented and virtual reality technologies. It focuses on facilitating the development of immersive digital experiences and applications.
4oimage-mcp
→
Generate and edit high-quality images using text prompts. Transform existing images or create new visuals and 3D characters with real-time updates and automatic viewing in the browser.
mcp-image-generator
→
Generate, edit, and create variations of images using OpenAI's DALL-E API, supporting multiple DALL-E models with customizable parameters. Validate OpenAI API keys for seamless operation.
mcp-webcam
→
Streams live images from a webcam to an MCP Client, supporting both capturing frames and taking screenshots.
jimeng-mcp
→
Integrates with the Jimeng AI service to generate images from text prompts. Supports customization of image parameters such as size, quality, and negative prompts without the need for third-party APIs.
unet
→
Train and deploy U-Net models for biomedical image segmentation using the Medical Decathlon dataset, with support for both 2D and 3D U-Net scripts. Visualize predictions and assess model performance through comprehensive demos and visual outputs.
image-mcp-server-gemini
→
Analyzes images and videos by providing URLs or local file paths, allowing for detailed insights and descriptions of the content. Uses the Gemini 2.0 Flash model for high-precision recognition and can evaluate relationships between multiple visual inputs.
unsplash-smart-mcp-server
→
Connects AI models to Unsplash for searching and delivering stock photos with context-aware selection and automatic attribution management.
luma-ai-mcp-server
→
Integrates with Luma AI's Dream Machine API to facilitate the generation and manipulation of AI-generated videos and images. Offers tools for text-to-video generation, image processing, and audio integration to enhance creative projects.
modal-mcp-toolbox
→
A collection of tools that provides a sandboxed environment for executing Python code and generating images using the FLUX model.
mcp-image-compression
→
Optimizes images by compressing various formats for faster loading and improved user experience, while offering features like offline usage and batch processing. Supports smart compression to balance file size and visual quality based on image content.
everart-forge-mcp
→
Generates and converts vector and raster images using advanced AI models with support for multiple formats. Provides flexible storage options and automatic formatting for efficient image processing.
MCP-Storybook-Image-Generator
→
Generates high-quality storybook images and matching children's stories using Google's Gemini AI, offering multiple art styles such as 3D cartoon, watercolor, and pixel art. It allows instant previewing of creations and saves them locally in an organized manner.
