Document Processing MCP Repositories
326 repositories in this category.
Entity-Resolution
→
Compares two sets of data to determine if they originate from the same entity using text normalization and semantic analysis. It evaluates both exact and semantic equality of values, ensuring accurate data validation.
kibela-mcp-server
→
Interact with Kibela's knowledge base to search, organize, and manage notes and folders while utilizing AI-assisted writing and content creation. Integrate seamlessly with MCP clients like Claude Desktop or VSCode for enhanced productivity.
CnOCR
→
Enables optical character recognition for Chinese, English, and numbers using pre-trained models or custom training. Provides powerful text recognition capabilities for a variety of applications.
FILE-CONVERTER-MCP
→
Convert documents between various formats using Pandoc, enabling seamless integration and automation in workflows. Supports a wide range of formats including Markdown, DOCX, HTML, PDF, and EPUB.
mcp-server-azure-ai-agents
→
Integrates Azure AI services with Claude Desktop for enhanced search capabilities, enabling intelligent searches across indexed documents and the web while providing source citations. Offers two implementations: one that utilizes the Azure AI Agent Service for document and web searches, and another for direct access to Azure AI Search.
Office-PowerPoint-MCP-Server
→
Create, edit, and manipulate PowerPoint presentations using various automation tools. Streamlines workflow by providing functionalities to enhance presentation tasks.
ru-heritage
→
Downloads digitised books from e-heritage.ru and converts them into PDF format, facilitating easier access to digital heritage content.
cos-mcp
→
Integrate large language models with Tencent Cloud Object Storage (COS) and Data Insight (CI), enabling file management, automated cloud data handling, and various image and video processing tasks. Supports natural language-based metadata search and efficient backup workflows.
docs
→
A starter kit designed for creating and managing project documentation with features like guide pages, navigation, and API references. It facilitates local previews and automatic deployment of documentation changes through a GitHub app integration.
mcp-webresearch-stealthified
→
Connects AI models to the web for real-time information retrieval, webpage content extraction, and research session tracking, along with the ability to capture screenshots.
excel-mcp-server
→
Manipulate Excel files programmatically without Microsoft Excel installed. Create, read, modify workbooks, apply formatting, generate charts, and manage data ranges.
mcp-google-docs
→
Manipulate Google Spreadsheets and Drive to create, copy, and manage files, allowing for efficient integration of document operations within applications.
markmap-mcp-server
→
Converts Markdown text into interactive mind maps, supporting export in various image formats including PNG, JPG, and SVG. Features include zooming, node expansion, automatic browser preview, and one-click Markdown copying.
sqlite-literature-management-fastmcp-mcp-server
→
Manages various types of sources such as papers, books, and webpages while integrating them with knowledge graphs. Tracks relationships between sources and entities, supports multiple identifiers, and maintains structured note-taking and status tracking.
markdownify-mcp
→
Converts various file types and web content into Markdown format, supporting multiple input types such as PDFs, images, and audio files.
knowledge-graph-mcp
→
Manage, analyze, and visualize knowledge graphs with support for various graph types, including ontologies and timelines. Integrate effectively with MCP-compatible AI assistants to query and manipulate knowledge graph data while tracking resource management and version status.
mcp-server-ragdocs
→
Retrieve and process documentation using vector search to enhance AI responses. Enable the creation of documentation-aware AI assistants and context-aware tooling for developers.
jianshu
→
Manage and organize writing projects with a user-friendly interface, providing features tailored specifically for writers to enhance their workflow.
mcp-google-suite
→
Enable interaction with Google Workspace services such as Drive, Docs, and Sheets for operations including file searching, folder creation, document management, and spreadsheet manipulation. Flexible integration is supported through multiple transport modes compatible with MCP clients.
quip-mcp-server
→
Interact directly with Quip documents to read, append, prepend, and replace content, enhancing document management capabilities for AI assistants.
excel-mcp-server
→
Read, write, and analyze Excel files while seamlessly managing data through various functionalities, including accessing multiple worksheets and exporting structure information.
mcp-server-neurolora-p
→
Collects and documents code from projects into markdown, providing tools for code analysis and documentation.
JSON-MCP-Server
→
Query and manipulate JSON data using standardized tools and JSONPath syntax with extended operations.
Swarms_MCPserver
→
Retrieve and interact with documentation databases using hybrid semantic and keyword search. Automatically index and reindex various file types while supporting live file watching and low-latency document querying through a FastMCP tools API.
doctair
→
Enables editing and deployment of applications through a web interface and local development environment, while synchronizing changes and facilitating project management with custom domain connections.
mcp-xpath
→
Execute XPath queries on XML and HTML content, fetching and querying data from URLs or local files. Return structured results to enhance applications with powerful XML data manipulation capabilities.
mcp-server-rememberizer
→
Interact with Rememberizer's document and knowledge management API to perform document search, retrieval, and management. Supports access to both documents and Slack discussions.
Yuque-MCP-Server
→
Integrates with the Yuque knowledge base platform for document management and user information retrieval, enabling operations such as creating, reading, updating, and deleting documents while utilizing AI capabilities for enhanced workflow and analytics.
mcp-Pdf2png
→
Convert PDF documents into high-quality PNG images seamlessly, transforming each page of a PDF into a PNG file using a simple MCP tool call. Enhance document processing with efficient image generation from PDFs.
excel-mcp
→
Manipulate Excel files without requiring Microsoft Excel. Create, modify, and format workbooks while utilizing advanced features like charts and pivot tables.
