Document Processing MCP Repositories
326 repositories in this category.
Opentk Mcp
→
Provides access to Dutch parliamentary documents, debates, and member information through a standardized interface for natural language queries, facilitating research and analysis of legislative activities.
Lsp Tools Mcp
→
Enhances text analysis by providing regex matching capabilities to find positions of regex matches in files and managing directory access for secure interaction with system files.
Mcp Servers
→
Access and manage Microsoft OneNote content, enabling the reading and creation of notebooks, sections, and pages directly through AI assistants. Converts HTML content to text for improved retrieval-augmented generation (RAG) processing.
Deepwiki Mcp
→
Crawls Deepwiki.com documentation, converting it into Markdown format by removing unnecessary HTML elements and adjusting links for better readability. Supports fetching multiple pages and offers structured output formats for knowledge retrieval.
Gitbook Mcp
→
Access GitBook Organizations, Spaces, Collections, and Content through a standardized MCP interface, enabling programmatic operations for documentation workflows.
Google Drive Mcp
→
Integrate Google Drive functionalities with the Model Context Protocol (MCP) to facilitate file management, content retrieval, and permission handling. Access Google's Drive resources seamlessly from LLM applications through standardized tools.
Docgen Mcp
→
Automates the creation of standardized documentation by extracting information from various source files and applying templates. Integrates with Google Drive and GitHub to enhance documentation processes with AI-generated content and management features.
Youtube_transcriptor
→
Transcribes YouTube videos by extracting transcripts, including both manual and autogenerated captions, using the provided video URL. Supports integration with MCP clients for enhanced workflows involving video transcription.
Mcp Rtfm
→
Facilitates the creation of manuals from existing documentation through content analysis, generates metadata, and provides intelligent search capabilities to form a functional knowledge base.
Deepseek_chat_rag
→
Utilizes advanced retrieval-augmented generation models to answer queries based on indexed documents extracted from various file formats. Engages users by providing relevant answers from a Chroma database that stores extracted text from PDF, DOCX, TXT, and CSV files.
Mcp Data Extractor
→
Extracts embedded data such as i18n translations and configurations from TypeScript and JavaScript source code, converting them into structured JSON files while preserving the hierarchical structure and template variables.
Mcp Server Ragdocs
→
Retrieve and process documentation using vector search to enhance AI responses. Enable the creation of documentation-aware AI assistants and context-aware tooling for developers.
Sourcesyncai Mcp
→
Integrate and manage knowledge from various data sources using a standardized interface to retrieve and update documents. Perform semantic searches and manage connections to external services within a knowledge management platform.
Claudekeep
→
A server implementation that enables the saving and sharing of AI conversations from Claude Desktop, featuring both a private chat storage and a public chat display web app. This implementation utilizes the Model Context Protocol (MCP) to manage interactions with AI chat logs.
Convert Markdown Pdf Mcp
→
Converts Markdown files into styled PDF documents using VS Code's markdown formatting and Python's ReportLab. Offers note storage with custom URI access and provides functionality to summarize all stored notes.
Mcp File Preview
→
Enables previewing and analyzing local HTML files, including capturing full-page screenshots and examining their structural elements such as headings, paragraphs, images, and links.
Server Youtube Transcription
→
Transcribes audio from YouTube videos to text, providing accurate and fast text representations of video content. Integrates transcription capabilities seamlessly into applications using the MCP framework.
Mcp Units
→
Provides tools for converting cooking measurements between various volume, weight, and temperature units commonly used in cooking, such as milliliters to cups and grams to pounds.
Dicom Mcp
→
Manage and summarize DICOM images by adding notes and generating summaries to enhance workflows with medical imaging data.
Doc Lib Mcp
→
Manage and semantically search documentation by adding, ingesting, chunking, and querying notes across various document types. Create summaries tailored to specific detail levels and access individual notes through a custom URI scheme.
Youtube Transcript Download
→
Download subtitles from popular video platforms like YouTube, Bilibili, TED, and Coursera using the AITransDub MCP service. Supports multiple subtitle languages for easier access and processing.
Skrape Mcp
→
Convert web pages into clean, structured Markdown suitable for large language model (LLM) consumption, streamlining the process of feeding web content into AI applications.
Mcp Server Rememberizer
→
Interact with Rememberizer's document and knowledge management API to perform document search, retrieval, and management. Supports access to both documents and Slack discussions.
Ru Heritage
→
Downloads digitised books from e-heritage.ru and converts them into PDF format, facilitating easier access to digital heritage content.
Mcp Excel Reader Server
→
Extract data from Excel files in structured JSON format, allowing access to all sheets or specific sheets by name or index. Handles data type conversions and manages empty cells efficiently.
Paprika 3 Mcp
→
Exposes Paprika 3 recipes as LLM-readable resources for interaction with LLMs like Claude. Enables creation and updating of recipes within the Paprika app through specialized tools.
Obsidian_fetch
→
Retrieve and load notes efficiently from Obsidian vaults, enabling enhanced interactions with language models by cleaning link queries and displaying backlinks to opened files. Streamlined for local GPU setups to improve note retrieval speed and efficiency.
Cargo Doc Mcp
→
Manage and interact with Rust documentation, performing tasks such as checking, building, and searching through project documentation. Access crate documentation and symbol listings to enhance development workflows.
Markdown Sidecar Mcp
→
Access and serve markdown documentation for NPM packages, Go Modules, and PyPi packages from a local environment, enhancing the code generation process. Default support for Python help documentation is included for packages lacking markdown documentation.
Mcp Jinaai Reader
→
Integrates Jina.ai's Reader API for efficient web content extraction, enabling analysis and processing of documentation and web content.