Multimodal Input Processing MCP Repositories

4 repositories in this category.

Showing 4 of 4 repositories (Page 1 of 1)

MCPollinations

Generates images, text, and audio from prompts using the Pollinations APIs. It supports returning images as base64-encoded data and allows listing available models for image and text generation.

Last Updated

MIT License

openrouter-mcp-multimodal

→

stabgan

Combines text chat and image analysis capabilities to conduct multimodal conversations and handle custom queries seamlessly. Optimizes workflows with intelligent model selection and performance improvements.

Last Updated

No License

touchdesigner-mcp

→

8beeeaaat

The TouchDesigner MCP Server allows AI agents to interact with TouchDesigner projects by creating, modifying, and deleting project elements, as well as executing Python scripts to automate tasks.

Last Updated

MIT License

vedit-mcp

→

zakahan

Enables video editing through natural language commands for basic editing operations. Integrates with projects to automate video processing tasks using ffmpeg.

Last Updated

MIT License

← Previous 1 Next →

Go to page:

← Back to MCP Directory 🙏 Credits & Acknowledgments