Document Processing MCP Repositories
326 repositories in this category.
mcp-youtube-transcript
→
Retrieve transcripts from YouTube videos.
ms-365-mcp-server
→
Enables interaction with Microsoft 365 services through the Graph API, allowing management of Excel files, calendar events, emails, and OneDrive files securely using the Microsoft Authentication Library.
mcp-server-python
→
Integrates documentation with a powerful API for efficient management and access. Facilitates the development of intelligent applications that seamlessly interact with documentation.
markdown-sidecar-mcp
→
Access and serve markdown documentation for NPM packages, Go Modules, and PyPi packages from a local environment, enhancing the code generation process. Default support for Python help documentation is included for packages lacking markdown documentation.
mcp-pyzotero
→
Integrates a local Zotero library with Claude Desktop, enabling direct read access to bibliographic data through a local web API in Zotero 7.
mcp-zotero
→
Integrates with Zotero to enable interactions with a Zotero library, facilitating the management and retrieval of bibliographic data.
memory-bank-MCP
→
Create and manage structured project documentation with AI assistance, generating interconnected Markdown files that capture project knowledge from goals to progress. It supports context-aware querying for efficient searching and exporting of project information.
kaltura-mcp
→
Integrates Kaltura's media management for uploading, retrieving, and managing media with standardized API interactions. Supports operations like metadata retrieval, media search, category management, and user permissions management.
ops-mcp
→
Search and retrieve Confluence documents, accessing full page content and associated metadata for efficient document management. Supports full-text search and retrieval of document details including title, space, and version information.
mcp-ragdocs
→
Retrieve and process documentation through vector search, enabling AI models to integrate relevant context into their responses. Supports multiple sources and offers semantic search capabilities for enhanced information retrieval.
mcp-invoice
→
Advanced OCR capabilities for invoice and receipt management, enabling data extraction from various formats and document merging for efficient handling.
ppt
→
The PowerPoint Presentation Automation Server allows users to create and edit PowerPoint presentations automatically. It provides an easy way to generate slides, add content, and customize designs through simple API calls or natural language commands.
textClassifier
→
Multiple common text classification models based on CNN, RNN, and pre-trained NLP architectures for sentiment analysis and text classification. Supports data preprocessing, training word embeddings, and implementing advanced models like Bi-LSTM, Transformer, ELMo, and BERT for improved classification accuracy.
docs
→
Comprehensive documentation for DataEase, focusing on data management, system architecture, and user functionalities. Provides structured resources for installation and deployment, along with user manuals for various features.
markdown2pdf-mcp
→
This server converts Markdown documents into PDF files, allowing for customizable styles and syntax highlighting for code blocks. It provides an easy way to generate well-formatted PDFs from text-based Markdown content.
opentk-mcp
→
Provides access to Dutch parliamentary documents, debates, and member information through a standardized interface for natural language queries, facilitating research and analysis of legislative activities.
mcp-pdf-tools
→
Provides tools for manipulating PDF files, including merging multiple PDFs, extracting specific pages, and finding related PDFs based on text extraction and regex patterns.
qiniu-mcp-server
→
Connect to Qiniu Cloud Storage for accessing, managing, and processing multimedia files within AI large model clients. Perform operations such as listing buckets, uploading files, reading file contents, and utilizing intelligent multimedia features.
lsp-tools-mcp
→
Enhances text analysis by providing regex matching capabilities to find positions of regex matches in files and managing directory access for secure interaction with system files.
simple-files-vectorstore
→
Creates a vector store from local directories and files, enabling semantic search across document contents. Monitors specified directories for file changes and generates vector embeddings to facilitate search functionality.
apple-books-mcp
→
Manage and explore your Apple Books library, summarize highlights, and receive book recommendations by harnessing Claude's capabilities.
mcp-ARCknowledge
→
Manage and query a custom knowledge base by registering document sources, querying information, and aggregating results from multiple webhook endpoints. Simplifies knowledge management and enhances querying capabilities.
apple-mcp
→
Integrates with Apple applications to manage messages, notes, and emails. Enables automation of tasks across the Apple ecosystem using simple commands for improved productivity.
mcp-docs-provider
→
Enables AI models to access and query local markdown technical documentation, enhancing context-aware responses. Supports dynamic integration of documentation, allowing updates without server rebuilds.
mlcbakery
→
Provides access to MLC Bakery functionalities through an MCP-compatible interface, enabling clients to search datasets, retrieve previews, and validate metadata efficiently. Facilitates interaction with MLC Bakery API tools for data exploration and management.
paprika-3-mcp
→
Exposes Paprika 3 recipes as LLM-readable resources for interaction with LLMs like Claude. Enables creation and updating of recipes within the Paprika app through specialized tools.
mcp
→
Integrates multiple AI models and implements retrieval-augmented generation (RAG) alongside large language models (LLMs). Supports PDF and OCR processing for enhanced data handling while providing a simplified setup for backend and frontend deployment.
mcp-server-youtube-transcript
→
Retrieve transcripts and subtitles from YouTube videos, accessing video captions and detailed metadata through a simple interface.
fivem-docs
→
FiveM Documentation is a resource hub that provides guides and reference materials for developers working with FiveM, a multiplayer modification framework for Grand Theft Auto V. It aims to assist users in creating and managing their FiveM servers effectively.
mcp-jina-reader
→
Fetches the content of a remote URL and converts it into Markdown format using Jina Reader.
