Document Processing MCP Repositories
326 repositories in this category.
Docs Fetch Mcp
→
Fetch and explore web content autonomously by navigating through documentation and web pages to extract relevant information. It supports recursive exploration and filters navigation links for content-rich pages.
Mcp Jina Reader
→
Fetches the content of a remote URL and converts it into Markdown format using Jina Reader.
Mcp Server
→
Automates the collection of project information from GitHub for students, assisting in resume writing, interview question generation, and portfolio management. Provides tools for project-based self-introduction and interview practice to streamline career preparation.
Scrapbox Cosense Mcp
→
Access and interact with Scrapbox project pages, facilitating content retrieval, page listing, and full-text searching across project content.
Textwell Mcp
→
A specialized MCP server for writing text to the Textwell application on macOS, offering modes for replacing, inserting, or appending text. It facilitates text manipulation directly within the Textwell environment.
File Converter Mcp
→
Convert various document and image formats such as DOCX to PDF, PDF to DOCX, and multiple image formats (JPG, PNG, WebP, etc.). Provides reliable and flexible file handling to meet diverse conversion needs.
Finance_news_analysis
→
Scrapes financial data, performs NLP algorithm analysis, and facilitates quantitative strategy backtesting. Integrates various components for automated financial data processing and insights extraction.
Perplexity Mcp Zerver
→
Integrate AI research capabilities by performing web searches, retrieving documentation, and analyzing code through interaction with the Perplexity website. It provides persistent chat history and does not require an API key, relying on browser automation.
Ops Mcp
→
Search and retrieve Confluence documents, accessing full page content and associated metadata for efficient document management. Supports full-text search and retrieval of document details including title, space, and version information.
Mcp Pdf Extraction Server
→
Extracts text from PDF files using advanced reading and OCR capabilities. Supports content retrieval from specified pages or entire documents for seamless integration into applications.
Mcp Bookstack
→
Search and retrieve structured data from BookStack pages with customizable queries, pagination, and HTML-to-text conversion for enhanced reading. It includes robust error handling and validation for seamless content access.
Cosense Mcp Server
→
Access and interact with the Cosense knowledge sharing platform by retrieving, listing, and searching for pages, as well as inserting text into existing pages.
Mcp Excel Server
→
Manage and analyze Excel files, including reading, writing, and visualizing data. Perform statistical analysis and data quality assessments to enhance data manipulation and insights.
Mcp Docling
→
Convert documents to markdown, extract tables, and process multiple files efficiently for enhanced document processing capabilities.
Markdownify Mcp
→
Converts various file types and web content into Markdown format, supporting multiple input types such as PDFs, images, and audio files.
Front Code Sum
→
Summarize front-end learning materials and showcase small demo projects. Provides concise notes and practical examples for better understanding of front-end technologies.
Zntl Mcp Server
→
Provides AI-powered transcription and analysis functionalities via a standardized Model Context Protocol interface, enabling efficient data searching, summarizing, and retrieval. Integrates with the Transcripter project to facilitate interaction with transcription and analysis data.
Confluence Mcp
→
Integrate with the Confluence API to access and manipulate Confluence data. Execute CQL queries and retrieve page content seamlessly.
Local Rag
→
Access and query information from large PDF files using a powerful retrieval-augmented generation (RAG) system that integrates with Claude. Utilize advanced document processing and vector storage to enhance data retrieval capabilities.
Excel Mcp Server
→
Read, write, and analyze Excel files while seamlessly managing data through various functionalities, including accessing multiple worksheets and exporting structure information.
Pdfmathtranslate
→
Translate PDF scientific papers while maintaining the integrity of formulas, charts, and annotations. Supports multiple languages and various translation services through a command-line interface, interactive GUI, or Docker deployment.
Jianshu
→
Manage and organize writing projects with a user-friendly interface, providing features tailored specifically for writers to enhance their workflow.
Panw
→
Integrates Palo Alto Networks AI security capabilities into clients compatible with the Model Context Protocol, enabling real-time content risk analysis and interaction with large language models. Supports various input types for dynamic content detection and compliance during AI interactions.
Kaltura Mcp
→
Integrates Kaltura's media management for uploading, retrieving, and managing media with standardized API interactions. Supports operations like metadata retrieval, media search, category management, and user permissions management.
Sqlite Literature Management Fastmcp Mcp Server
→
Manages various types of sources such as papers, books, and webpages while integrating them with knowledge graphs. Tracks relationships between sources and entities, supports multiple identifiers, and maintains structured note-taking and status tracking.
Tuniao Server
→
Provides access to TuNiao UI components documentation and listings via the Model Context Protocol. Features include retrieving component information and detailed documentation for specific components.