logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

Speech Recognition and Synthesis MCP Repositories

31 repositories in this category.

Showing 30 of 31 repositories (Page 1 of 2)

MCPollinations

pinkpixel-dev
MCPollinations logo

Generates images, text, and audio from prompts using the Pollinations APIs. It supports returning images as base64-encoded data and allows listing available models for image and text generation.

Last Updated
GitHub 34
NPM 0
1
MIT License

speech-mcp

Kvadratni
speech-mcp logo

Provides a voice interface for real-time audio interaction, converting spoken words into text and generating spoken responses. Includes features like audio visualization and a modern user interface for an engaging conversational experience.

Last Updated
GitHub 71
NPM 0
1
MIT License

vapi-mcp

mrgeeko
vapi-mcp logo

Integrate voice AI capabilities into applications for managing voice assistants and conducting outbound calls. Provides advanced features for enhancing user interactions through voice conversations.

Last Updated
GitHub 0
NPM 0
1
No License

GarbageSorting

nansasuke
GarbageSorting logo

Identify and classify waste using image and voice recognition techniques to streamline the recycling process and enhance environmental awareness.

Last Updated
GitHub 0
NPM 0
1
No License

Zonos-TTS-MCP

PhialsBasement
Zonos-TTS-MCP logo

Facilitates text-to-speech capabilities using Claude, supporting various emotions and languages for speech generation.

Last Updated
GitHub 14
NPM 0
1
No License

VoiceMacroProject

ImOrenge
VoiceMacroProject logo

VoiceMacro enables executing keyboard shortcuts and macros through voice commands on Windows. It supports custom voice command configurations and manages presets for frequent macro operations while running in the background.

Last Updated
GitHub 0
NPM 0
1
No License

elevenlabs-mcp

elevenlabs
elevenlabs-mcp logo

This server provides APIs for generating speech, voice cloning, and audio transcription. It facilitates seamless interaction with text-to-speech and audio processing functionalities.

Last Updated
GitHub 998
NPM 0
1
MIT License

typecast-api-mcp-server-sample

neosapience
typecast-api-mcp-server-sample logo

Integrates with the Typecast API to manage voices, convert text to speech, and play audio. Provides a standardized MCP interface for seamless interaction with voice capabilities.

Last Updated
GitHub 2
NPM 0
1
No License

Votars-MCP

scarletlabs-ai
Votars-MCP logo

Integrate advanced AI functionalities for processing complex tasks through robust APIs. Supports voice recording, transcription, and intelligent AI processing for meetings.

Last Updated
GitHub 27
NPM 0
1
No License

chatgpt-on-wechat

rsagacom
chatgpt-on-wechat logo

A multi-platform intelligent dialogue service that supports text, voice, and image interactions. It can connect to various AI models and allows for custom enterprise AI applications through plugin extensions.

Last Updated
GitHub 0
NPM 0
1
MIT License

whisper.cpp

anilcosaran
whisper.cpp logo

Transcribes and translates audio files using a lightweight implementation of OpenAI's Whisper model, optimized for speed and low memory usage across various platforms.

Last Updated
GitHub 0
NPM 0
1
MIT License

jessica

georgi-io
jessica logo

Integrates ElevenLabs Text-to-Speech capabilities for seamless text conversion to speech, offering voice selection and management through a modern interface. Supports real-time communication with a FastAPI backend and a React frontend.

Last Updated
GitHub 1
NPM 0
1
No License

edge_tts_mcp_server

yuiseki
edge_tts_mcp_server logo

Provide natural text-to-speech conversion using Microsoft Edge's speech synthesis capabilities, enabling customizable voice output in multiple languages with adjustable speed and pitch.

Last Updated
GitHub 5
NPM 0
1
No License

MiniMax-MCP-JS

MiniMax-AI
MiniMax-MCP-JS logo

Integrates with MiniMax's AI capabilities to facilitate interaction with multimedia generation tools, including image generation, video generation, text-to-speech, and voice cloning. Supports a flexible and configurable JavaScript/TypeScript framework for versatile deployment scenarios.

Last Updated
GitHub 84
NPM 0
1
MIT License

mcp_servers

DefiBax
mcp_servers logo

Record audio and transcribe it using advanced AI models like OpenAI's Whisper. Supports integration with AI agents for enhanced interactivity and includes prompts for common recording scenarios.

Last Updated
GitHub 6
NPM 0
1
MIT License

rime-mcp

MatthewDailey
rime-mcp logo

Convert text to speech and play it through the system's audio with high-quality voice synthesis. Customize speech behavior using environment variables for tailored interactions.

Last Updated
GitHub 19
NPM 0
1
The Unlicense

fishaudio-mcp

CengSin
fishaudio-mcp logo

Converts text into natural human speech with customizable audio formats and bitrates, while integrating seamlessly with MCP-compatible applications.

Last Updated
GitHub 2
NPM 0
1
No License

elevenlabs-mcp-server

mamertofabian
elevenlabs-mcp-server logo

Integrates with ElevenLabs text-to-speech API to generate audio from text input, manage voice generation tasks, and store history using an SQLite database. Includes a sample SvelteKit client for performing text-to-speech conversions and managing script parts.

Last Updated
GitHub 112
NPM 0
1
MIT License

Al-StoryLab

aigc17
Al-StoryLab logo

AI-StoryLab generates interactive stories with accompanying audio effects and provides illustration prompts. It leverages AI services for story creation, voice synthesis, sound effect generation, and suggests relevant audio placements.

Last Updated
GitHub 0
NPM 0
1
No License

bouyomichan-mcp-nodejs

uraoz
bouyomichan-mcp-nodejs logo

Provides text-to-speech capabilities using BouyomiChan's Yukkuri voice, enabling voice output from text commands with customizable options for voice type, volume, speed, and pitch. Integrates seamlessly with Claude for Desktop for enhanced user interaction.

Last Updated
GitHub 2
NPM 0
1
MIT License

Audio-MCP-Server

GongRzhe
Audio-MCP-Server logo

Enables interaction with a computer's audio system by listing audio devices, recording audio from microphones, and playing back recordings or audio files. Facilitates audio management and integrates audio input and output control for AI assistants.

Last Updated
GitHub 4
NPM 0
1
MIT License

kokoro-tts-mcp

giannisanni
kokoro-tts-mcp logo

Integrates text-to-speech capabilities using the Kokoro TTS engine, enabling conversion of written content into spoken audio with customizable voices and adjustable speed. Supports saving audio files and cross-platform playback.

Last Updated
GitHub 10
NPM 0
1
No License

retellai-mcp-server

abhaybabbar
retellai-mcp-server logo

Manage and interact with RetellAI's voice services, facilitating call management, voice agent creation, phone number provisioning, and voice option access through a unified interface.

Last Updated
GitHub 18
NPM 0
1
No License

tts-mcp

nakamurau1
tts-mcp logo

Integrates high-quality text-to-speech capabilities into applications, converting text to audio with customizable voice options and output formats. Provides a command-line tool for quick conversions and supports various parameters for audio customization.

Last Updated
GitHub 1
NPM 0
1
MIT License

say-mcp-server

bmorphism
say-mcp-server logo

Provides text-to-speech functionality using macOS's built-in `say` command, allowing the generation of spoken output from text input.

Last Updated
GitHub 18
NPM 0
1
MIT License

koroko-speech-mcp

hammeiam
koroko-speech-mcp logo

Provides text-to-speech capabilities using the Kokoro TTS model, converting text into natural-sounding speech with customizable options and multiple voice choices.

Last Updated
GitHub 1
NPM 0
1
No License

audio-transcriber-mcp

Ichigo3766
audio-transcriber-mcp logo

Transcribes audio files using OpenAI's speech-to-text capabilities, enabling accurate audio transcriptions and the option to save them directly to files.

Last Updated
GitHub 7
NPM 0
1
MIT License

aivis-speech-mcp

kentaro
aivis-speech-mcp logo

Integrate with the AivisSpeech Engine to provide high-quality speech synthesis capabilities for applications, facilitating the conversion of text to natural-sounding speech. The server offers a type-safe API compliant with the Model Context Protocol, ensuring easy configuration and extensibility.

Last Updated
GitHub 0
NPM 0
1
No License

voicevox-mcp-server

Dosugamea
voicevox-mcp-server logo

Provides voice synthesis capabilities compatible with VOICEVOX and similar engines through the Model Context Protocol. Facilitates speech audio generation using AI agents compatible with MCP clients.

Last Updated
GitHub 10
NPM 0
1
MIT License

mcp_voice_identify

yangsenessa
mcp_voice_identify logo

Provides voice recognition and text extraction capabilities, supporting both file input and base64 encoded data processed in structured formats. Operates in stdio and MCP modes for flexible integration with various systems.

Last Updated
GitHub 0
NPM 0
1
MIT License
1 2
Go to page: