Image and Video Generation MCP Repositories

135 repositories in this category.

Showing 30 of 135 repositories (Page 4 of 5)

mcp-image-extractor

Extracts images from local files and URLs, processing them into base64 format for analysis by large language models (LLMs). Suitable for analyzing image-based data, such as screenshots from tests.

Last Updated

MIT License

jigsawstack-mcp-server

→

JigsawStack

Generate images from text using advanced AI models. The server facilitates the integration and management of image generation tools within an MCP framework.

Last Updated

No License

Grok-MCP

→

8bitsats

MCP server for generating images using Grok's AI image generation capabilities, accepting text prompts and returning images as URLs or base64-encoded data. Supports multiple image generation requests and error handling, with configuration options for API keys.

Last Updated

No License

flux-schnell-mcp

→

ckz

Generate images from text prompts using the Replicate API, enabling users to create customized visuals based on detailed descriptions. The server manages the communication with the API and handles errors effectively.

Last Updated

MIT License

ideagram-mcp-server

→

Sunwood-ai-labs

Generate images based on prompts with customizable parameters like aspect ratio and style using the Ideogram API.

Last Updated

No License

openai-gpt-image-mcp

→

SureScaleAI

Generate and edit images using the latest OpenAI GPT-4o and gpt-image-1 models with advanced prompt control. Outputs can be saved to disk or received in base64 format for integration with MCP-compatible clients.

Last Updated

MIT License

piapi-mcp-server

→

apinetwork

Integrates with PiAPI's API to facilitate media content generation using various services like Midjourney, Flux, and more. It connects AI models with tools for seamless content creation directly from applications that support the Model Context Protocol.

Last Updated

MIT License

replicate-flux-mcp

→

awkoy

Generate images from text prompts using advanced AI models. Customize parameters for tailored outputs with secure and local processing.

Last Updated

MIT License

image-generator-mcp-server

→

sammyl720

Connects to OpenAI's DALL-E 3 model to generate images based on user prompts, saving the results to a specified directory on the user's desktop.

Last Updated

No License

together-ai-image-server

→

zym9863

Generates images from text prompts using Together AI's image generation models via the MCP protocol. It supports optional parameters for fine-tuning the image generation process.

Last Updated

MIT License

tinypng-mcp-server

→

aiyogg

Compress images using the TinyPNG API to reduce file size while maintaining quality. Integrate image optimization into various projects seamlessly.

Last Updated

Apache License 2.0

ffmpeg-mcp

→

video-creator

Enables local video search, trimming, stitching, and playback through conversational commands using ffmpeg. Provides tools for finding, clipping, concatenating, and playing video files on macOS platforms.

Last Updated

MIT License

mcp-image-placeholder

→

husniadil

Generates placeholder images from multiple providers, supporting both simple and real images as placeholders. Validates input parameters and returns image URLs for immediate use.

Last Updated

MIT License

mermaid-mcp-server

→

peng-shawn

Converts Mermaid diagram descriptions into high-quality PNG images using the Mermaid markdown syntax. Supports customizable themes and backgrounds for visual representations of data and processes.

Last Updated

185

MIT License

mcp-replicate

→

deepfates

Access Replicate models to run predictions through a tool-based interface, facilitating interactions with various AI models hosted on Replicate's platform.

Last Updated

MIT License

Image-Generation-MCP-Server

→

GongRzhe

Generate images from text prompts using the Replicate Flux model, enabling the creation of unique visuals tailored to specific specifications.

Last Updated

MIT License

moondream-mcp

→

NightTrek

Advanced image analysis capabilities including captioning, object detection, and visual question answering for applications requiring sophisticated computer vision tasks.

Last Updated

Apache License 2.0

LoganLxb

→

LoganLxb

Logan provides tools and applications aimed at enhancing user interaction in mixed reality environments through augmented and virtual reality technologies. It focuses on facilitating the development of immersive digital experiences and applications.

Last Updated

No License

4oimage-mcp

→

Antipas

Generate and edit high-quality images using text prompts. Transform existing images or create new visuals and 3D characters with real-time updates and automatic viewing in the browser.

Last Updated

MIT License

mcp-image-generator

→

joshmouch

Generate, edit, and create variations of images using OpenAI's DALL-E API, supporting multiple DALL-E models with customizable parameters. Validate OpenAI API keys for seamless operation.

Last Updated

Unknown

mcp-webcam

→

evalstate

Streams live images from a webcam to an MCP Client, supporting both capturing frames and taking screenshots.

Last Updated

MIT License

jimeng-mcp

→

c-rick

Integrates with the Jimeng AI service to generate images from text prompts. Supports customization of image parameters such as size, quality, and negative prompts without the need for third-party APIs.

Last Updated

No License

unet

→

vishwa684

Train and deploy U-Net models for biomedical image segmentation using the Medical Decathlon dataset, with support for both 2D and 3D U-Net scripts. Visualize predictions and assess model performance through comprehensive demos and visual outputs.

Last Updated

Apache License 2.0

image-mcp-server-gemini

→

murataskin

Analyzes images and videos by providing URLs or local file paths, allowing for detailed insights and descriptions of the content. Uses the Gemini 2.0 Flash model for high-precision recognition and can evaluate relationships between multiple visual inputs.

Last Updated

MIT License

unsplash-smart-mcp-server

→

drumnation

Connects AI models to Unsplash for searching and delivering stock photos with context-aware selection and automatic attribution management.

Last Updated

MIT License

luma-ai-mcp-server

→

bobtista

Integrates with Luma AI's Dream Machine API to facilitate the generation and manipulation of AI-generated videos and images. Offers tools for text-to-video generation, image processing, and audio integration to enhance creative projects.

Last Updated

No License

modal-mcp-toolbox

→

philipp-eisen

A collection of tools that provides a sandboxed environment for executing Python code and generating images using the FLUX model.

Last Updated

MIT License

mcp-image-compression

→

InhiblabCore

Optimizes images by compressing various formats for faster loading and improved user experience, while offering features like offline usage and batch processing. Supports smart compression to balance file size and visual quality based on image content.

Last Updated

MIT License

everart-forge-mcp

→

nickbaumann98

Generates and converts vector and raster images using advanced AI models with support for multiple formats. Provides flexible storage options and automatic formatting for efficient image processing.

Last Updated

No License

MCP-Storybook-Image-Generator

→

falahgs

Generates high-quality storybook images and matching children's stories using Google's Gemini AI, offering multiple art styles such as 3D cartoon, watercolor, and pixel art. It allows instant previewing of creations and saves them locally in an organized manner.

Last Updated

No License

← Previous 1 ... 1 2 3 4 5 Next →

Go to page:

← Back to MCP Directory 🙏 Credits & Acknowledgments