logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

Image and Video Generation MCP Repositories

135 repositories in this category.

Showing 30 of 135 repositories (Page 2 of 5)

MCP-LOGO-GEN

sshtunnelvision
MCP-LOGO-GEN logo

Logo generation using AI tools, including features for image creation, background removal, and automatic scaling for high-quality outputs in various sizes.

Last Updated
GitHub 171
NPM 0
1
GNU General Public License v3.0

ComfyUI

dangtanloc
ComfyUI logo

A visual graph-based interface for designing and executing advanced stable diffusion pipelines, enabling users to create complex workflows without coding. It features smart memory management and asynchronous processing, supporting both GPU and CPU usage for offline functionality.

Last Updated
GitHub 0
NPM 0
1
GNU General Public License v3.0

omniparser-autogui-mcp

NON906
omniparser-autogui-mcp logo

Analyzes the screen using OmniParser to automatically operate graphical user interfaces. It provides capabilities for interpreting visual content and executing GUI actions based on analysis.

Last Updated
GitHub 55
NPM 0
1
MIT License

CnOCR-TextExtractor

breezedeus
CnOCR-TextExtractor logo

A comprehensive Python toolkit engineered for robust Optical Character Recognition (OCR) across Chinese scripts, the Latin alphabet, and numerical sequences. It facilitates utilization of pre-trained recognition systems or supports user-defined model calibration, delivering advanced text extraction capabilities for diverse computational vision pipelines.

Last Updated
GitHub 3.7K
NPM 0
1
Apache License 2.0

flux-schnell-server

m-mcp
flux-schnell-server logo

Provides an MCP protocol-based API for generating images from text prompts with customizable dimensions and reproducible results using a specified random seed. Supports asynchronous streaming responses and integration with Hugging Face model services.

Last Updated
GitHub 1
NPM 0
1
No License

mcp-mavae

xenoailimited
mcp-mavae logo

A Model Context Protocol (MCP) server for interacting with image media tools, providing capabilities for image generation, editing, and management of collections and models.

Last Updated
GitHub 0
NPM 0
1
No License

mcp_media_generator

dvejsada
mcp_media_generator logo

Create images using the Amazon Nova Canvas model and videos using the Amazon Nova Reel model. Connects to existing tools for media generation and storage.

Last Updated
GitHub 3
NPM 0
1
No License

mcp-image-recognition

mario-andreschak
mcp-image-recognition logo

Leverages image recognition capabilities to analyze and describe images using advanced vision APIs. Supports multiple formats and allows for optional text extraction from images.

Last Updated
GitHub 26
NPM 0
1
MIT License

cos-mcp

Tencent
cos-mcp logo

Integrate large language models with Tencent Cloud Object Storage (COS) and Data Insight (CI), enabling file management, automated cloud data handling, and various image and video processing tasks. Supports natural language-based metadata search and efficient backup workflows.

Last Updated
GitHub 15
NPM 0
1
Other

ImageOnC

luojunhui1
ImageOnC logo

Implement vehicle license plate recognition using C/C++ on FPGA, utilizing OpenCV for image display and Eigen for optimized matrix operations. The project includes code for training neural networks and processing license plate images.

Last Updated
GitHub 1
NPM 0
1
No License

mcp-veo2

mario-andreschak
mcp-veo2 logo

Generates high-quality videos from text prompts or images using Google's Veo2 model and provides access to these generated videos through MCP resources.

Last Updated
GitHub 30
NPM 0
1
MIT License

gpt-image-1-mcp

CLOUDWERX-DEV
gpt-image-1-mcp logo

Enables AI assistants to generate and edit images from text prompts, supporting both creation and modification of images using specified masks. Integrates with various MCP clients and provides flexible workflows for image handling, including automatic file saving and comprehensive error reporting.

Last Updated
GitHub 16
NPM 0
1
MIT License

tinypng-mcp-server

beordle
tinypng-mcp-server logo

Compress images efficiently using the TinyPNG API. Supports both local and remote image compression with minimal setup required.

Last Updated
GitHub 0
NPM 0
1
Apache License 2.0

game-asset-mcp

MubarakHAlketbi
game-asset-mcp logo

Generates 2D and 3D game assets from text prompts using AI models. Integrates with Hugging Face Spaces for asset generation, facilitating rapid prototyping for game developers.

Last Updated
GitHub 85
NPM 0
1
MIT License

image-generator-mcp-server

luoshui-coder
image-generator-mcp-server logo

Generates images based on prompts using OpenAI's DALL-E model, saving them in a specified directory on the user's desktop.

Last Updated
GitHub 1
NPM 0
1
No License

imagen3-mcp

hamflx
imagen3-mcp logo

Generate high-quality images using Google's Imagen 3.0 model through an MCP interface, facilitating integration with tools like Cherry Studio or Cursor. Supports configurable deployment options using a Google Gemini API key.

Last Updated
GitHub 43
NPM 0
1
No License

mcp-templateio

Lucker631
mcp-templateio logo

Generates customized visuals by creating images based on templates using the Templated.io API. Supports dynamic graphics creation through user-provided text and image URLs.

Last Updated
GitHub 0
NPM 0
1
No License

mcp-server-gemini-image-generator

qhdrl12
mcp-server-gemini-image-generator logo

Generate high-quality images from text prompts using the Gemini AI model, manage local image storage, and facilitate creative modifications of existing images.

Last Updated
GitHub 23
NPM 0
1
MIT License

DiffuGen

CLOUDWERX-DEV
DiffuGen logo

Seamlessly generate AI images directly within development environments by leveraging local Stable Diffusion models and precise control over parameters. Integrate with MCP-compatible IDEs to facilitate creative development without disruption.

Last Updated
GitHub 15
NPM 0
1
MIT License

tupianyasuo

laosu888
tupianyasuo logo

A front-end image compression tool supporting various formats like PNG and JPG, enabling users to customize compression ratios and preview results in real-time. The application allows users to download optimized images with comparisons of file sizes before and after compression.

Last Updated
GitHub 0
NPM 0
1
No License

mcp-server-amazon-bedrock

zxkane
mcp-server-amazon-bedrock logo

Integrates with Amazon Bedrock's Nova Canvas model to generate high-quality images based on text descriptions. Provides advanced features for refining image composition through negative prompts and allows control over image dimensions and quality.

Last Updated
GitHub 21
NPM 0
1
MIT License

mcp-imagegen

GMKR
mcp-imagegen logo

Generate images from text prompts using advanced AI models. Supports both local and SSE endpoint configurations with specific provider requirements.

Last Updated
GitHub 4
NPM 0
1
No License

pixabay-mcp

zym9863
pixabay-mcp logo

Connect to the Pixabay API to search for images and retrieve formatted results that include image URLs and metadata. Handle errors seamlessly during API interactions for reliable performance.

Last Updated
GitHub 4
NPM 0
1
MIT License

StyleCLIP

attarmau
StyleCLIP logo

A CLIP-based fashion recommendation system that enables users to upload clothing images and receive similar clothing tag recommendations through an interactive web interface. It utilizes YOLO for clothing detection and integrates seamlessly with an MCP framework.

Last Updated
GitHub 0
NPM 0
1
Apache License 2.0

aws-nova-canvas-mcp

yunwoong7
aws-nova-canvas-mcp logo

Generate and edit images with advanced features such as text-to-image generation, image inpainting, and background removal, using the Nova Canvas model from Amazon Bedrock.

Last Updated
GitHub 4
NPM 0
1
MIT License

mcp-image-downloader

qpd-v
mcp-image-downloader logo

Provides tools for downloading images from URLs and performing basic image optimization tasks such as resizing, quality adjustment, and format conversion.

Last Updated
GitHub 11
NPM 0
1
Apache License 2.0

jina-ai-mcp-multimodal-search

Sheshiyer
jina-ai-mcp-multimodal-search logo

Seamless integration with Jina AI's neural search capabilities enables semantic, image, and cross-modal searches through a simple interface. Perform searches based on natural language queries, visual similarities, and text-to-image or image-to-text conversions.

Last Updated
GitHub 4
NPM 0
1
MIT License

GarbageSorting

nansasuke
GarbageSorting logo

Identify and classify waste using image and voice recognition techniques to streamline the recycling process and enhance environmental awareness.

Last Updated
GitHub 0
NPM 0
1
No License

MCPollinations

pinkpixel-dev
MCPollinations logo

Generates images, text, and audio from prompts using the Pollinations APIs. It supports returning images as base64-encoded data and allows listing available models for image and text generation.

Last Updated
GitHub 34
NPM 0
1
MIT License

mcp-flux-schnell

bytefer
mcp-flux-schnell logo

Generate images from text descriptions using the Flux Schnell model through an MCP interface. This server connects with Cloudflare's Flux Schnell worker API to deliver image generation capabilities.

Last Updated
GitHub 5
NPM 0
1
MIT License
Go to page: