Document Processing MCP Repositories

326 repositories in this category.

Showing 30 of 326 repositories (Page 10 of 11)

Docs

srvdneat
Docs logo

Provides a starter kit for creating and maintaining documentation, including guide pages, navigation, customizations, and API references. Supports local previews and automatic deployment of documentation updates via integration with a GitHub app.

Last Updated Invalid Date
GitHub0
NPM0
1
Unknown

Mcp Storage Server

storacha
Mcp Storage Server logo

Facilitates secure storage and retrieval of files using decentralized storage via IPFS and CIDs. Enables verifiable data exchange and integration with AI frameworks, offering free storage options to users.

Last Updated 08/24/2025
GitHub12
NPM0
1
Apache License 2.0

Zotero Mcp Server

swairshah
Zotero Mcp Server logo

Access and manage your Zotero library programmatically, enabling the search of papers, management of notes, and the ability to request summaries through MCP clients. Facilitates seamless integration into research workflows with existing tools.

Last Updated 08/23/2025
GitHub20
NPM0
1
Apache License 2.0

Pdf Reader Mcp

sylphxltd
Pdf Reader Mcp logo

Enables secure reading and extraction of text, metadata, and page counts from PDF files. Processes multiple PDFs from local paths or URLs with structured JSON output for easy parsing.

Last Updated 10/03/2025
GitHub262
NPM0
1
MIT License

Arxiv Latex Mcp

takashiishida
Arxiv Latex Mcp logo

Fetches and processes LaTeX sources of arXiv papers, enabling AI models to accurately interpret mathematical content and equations without the limitations of PDF files.

Last Updated 10/04/2025
GitHub66
NPM0
1
MIT License

Obsidian Mcp

takuya0206
Obsidian Mcp logo

Interact with an Obsidian vault to read, write, and manipulate notes using a standardized interface, facilitating enhanced productivity and organization.

Last Updated 04/09/2025
GitHub2
NPM0
1
ISC License

Mcp Server Diff Python

tatn
Mcp Server Diff Python logo

Obtain text differences between two strings using Python's `difflib`, providing output in Unified diff format suitable for text comparison and version control.

Last Updated 06/02/2025
GitHub7
NPM0
1
MIT License

Mcp Server Fetch Typescript

tatn
Mcp Server Fetch Typescript logo

Retrieves and converts web content using various formats and rendering methods, suitable for both data extraction and web scraping tasks. It allows access to text-based resources and provides raw text content from specified URLs without additional processing.

Last Updated 09/19/2025
GitHub3
NPM0
1
MIT License

Mcp Rss Md

taweili
Mcp Rss Md logo

Generates Markdown content from RSS feeds, transforming raw RSS data into well-structured Markdown documents for easy sharing and publishing.

Last Updated 03/20/2025
GitHub0
NPM0
1
Other

Parsemypdf

taxihabbel
Parsemypdf logo

Extract and analyze complex PDF documents using various tools to maintain document structure and efficiently extract tables, images, and mixed content. Specialized processors are available tailored to the complexity and content type of the PDFs.

Last Updated 08/14/2025
GitHub1
NPM0
1
MIT License

Mcp Xpath

thirdstrandstudio
Mcp Xpath logo

Execute XPath queries on XML and HTML content, fetching and querying data from URLs or local files. Return structured results to enhance applications with powerful XML data manipulation capabilities.

Last Updated 08/04/2025
GitHub0
NPM0
1
MIT License

Mcp Server Ietf

tizee
Mcp Server Ietf logo

Access and retrieve IETF RFC documents, enabling search by keywords and management of document pagination. Provides standardized access to essential specifications for Large Language Models.

Last Updated 09/27/2025
GitHub8
NPM0
1
MIT License

Mcp Unix Manual

tizee
Mcp Unix Manual logo

Retrieve Unix command documentation, including help pages and version information. List common commands and check command availability within conversations.

Last Updated 04/06/2025
GitHub1
NPM0
1
MIT License

Pdf Reader Mcp

trafflux
Pdf Reader Mcp logo

Extracts text from both local and online PDF files with robust error handling and standardized output. Supports various PDF formats and includes features for auto-detection of encoding and volume mounting.

Last Updated 10/02/2025
GitHub31
NPM0
1
No License

Mcp Pdf2png

truaxki
Mcp Pdf2png logo

Convert PDF documents into high-quality PNG images seamlessly, transforming each page of a PDF into a PNG file using a simple MCP tool call. Enhance document processing with efficient image generation from PDFs.

Last Updated 08/13/2025
GitHub6
NPM0
1
No License

Eagle Mcp Server

tuki0918
Eagle Mcp Server logo

Integrates with the Eagle app to manage and interact with digital assets through a standardized MCP interface, enabling operations such as folder and item management, metadata retrieval, and media handling.

Last Updated 09/22/2025
GitHub3
NPM0
1
MIT License

Mcp Text Editor

tumf
Mcp Text Editor logo

Provides line-oriented text file editing capabilities through a standardized API, optimized for efficient interaction with large language models, enabling partial file access to minimize token usage.

Last Updated 10/04/2025
GitHub161
NPM0
1
MIT License

Memory Bank Mcp

tuncer-byte
Memory Bank Mcp logo

Create and manage structured project documentation with AI assistance, generating interconnected Markdown files that capture project knowledge from goals to progress. It supports context-aware querying for efficient searching and exporting of project information.

Last Updated 09/27/2025
GitHub100
NPM0
1
No License

Autoguarantee

u3588064
Autoguarantee logo

自动提取保函文本中的要素和条款,提供法律和金融专业人士分析所需的信息。输出结果为 JSON 格式,支持提取担保人的 SWIFT 标识代码、开立日期和保函种类等要素。

Last Updated 02/28/2025
GitHub2
NPM0
1
MIT License

Entity Resolution

u3588064
Entity Resolution logo

Compares two sets of data to determine if they originate from the same entity using text normalization and semantic analysis. It evaluates both exact and semantic equality of values, ensuring accurate data validation.

Last Updated 05/03/2025
GitHub1
NPM0
1
MIT License

Prem Mcp Server

ucalyptus
Prem Mcp Server logo

Integrates with Prem AI's features for chat interactions and document management, supporting Retrieval-Augmented Generation with document repositories and real-time streaming responses.

Last Updated 03/26/2025
GitHub0
NPM0
1
No License

Markai

umuthopeyildirim
Markai logo

MarkAI is a platform that enables users to ask questions and receive answers derived from their documents, providing efficient data access. It supports various file formats and offers both public and private collaboration options.

Last Updated 03/27/2025
GitHub22
NPM0
1
Apache License 2.0

Context7

upstash
Context7 logo

Fetches up-to-date, version-specific documentation and code examples directly from source libraries to enhance prompts. Integrates real-time documentation into AI coding workflows for improved code accuracy and productivity.

Last Updated 10/04/2025
GitHub32.5k
NPM0
1
MIT License

Apple Books Mcp

vgnshiyer
Apple Books Mcp logo

Manage and explore your Apple Books library, summarize highlights, and receive book recommendations by harnessing Claude's capabilities.

Last Updated 09/26/2025
GitHub32
NPM0
1
Apache License 2.0

Docmcp

visheshd
Docmcp logo

Index and query technical documentation using AI-powered semantic search. It crawls, processes, and embeds documentation for efficient retrieval through AI IDEs with built-in MCP tools for seamless integration.

Last Updated 08/24/2025
GitHub6
NPM0
1
No License

Mcp Pandoc

vivekVells
Mcp Pandoc logo

Facilitates document format conversion using pandoc, enabling transformation between various document types while maintaining formatting and structure.

Last Updated 10/03/2025
GitHub420
NPM0
1
MIT License

Mcp Framework

w-jeon
Mcp Framework logo

This framework enables the creation of custom tools for interaction with large language models, facilitating web content retrieval and various file handling capabilities. It automates the processing of PDF, Word, and Excel documents for enhanced productivity.

Last Updated 06/12/2025
GitHub3
NPM0
1
MIT License

Dify

weloyun
Dify logo

Dify allows users to build and test AI workflows on a visual canvas, facilitating the integration of tools and data sources for enhanced AI interactions. It supports both cloud hosting and self-hosting options for flexible usage.

Last Updated 03/01/2025
GitHub0
NPM0
1
Other

Macos Ocr Mcp

whiteking64
Macos Ocr Mcp logo

Perform Optical Character Recognition (OCR) on images with the help of macOS's Vision framework, extracting recognized text segments, confidence scores, and bounding box coordinates. Suitable for applications that require text extraction from image files.

Last Updated 06/07/2025
GitHub1
NPM0
1
No License

Airylark Mcp Server

wizd
Airylark Mcp Server logo

Provides high-accuracy translation services through a structured three-stage workflow, ensuring consistency and quality across multiple languages. Supports various professional fields such as technical documentation, academia, law, medicine, and finance.

Last Updated 09/25/2025
GitHub22
NPM0
1
Other
Page 10 of 11 • 326 total items