Document Processing MCP Repositories

326 repositories in this category.

Showing 30 of 326 repositories (Page 2 of 11)

Deep Research Mcp

Hajime-Y
Deep Research Mcp logo

Provides advanced web search capabilities, document analysis, and image processing. Extracts information from various sources including PDFs and YouTube transcripts efficiently.

Last Updated 09/07/2025
GitHub12
NPM0
1
Apache License 2.0

Handwriting Ocr Mcp Server

Handwriting-OCR
Handwriting Ocr Mcp Server logo

Integrate applications with the Handwriting OCR service to process images and PDF documents for text extraction. Upload documents, check processing status, and retrieve OCR results in Markdown format.

Last Updated 10/01/2025
GitHub11
NPM0
1
No License

Yuque Mcp Server

HenryHaoson
Yuque Mcp Server logo

Integrate with the Yuque API for managing documents and user information. Supports creating, reading, updating, and deleting documents while providing access to analytics and statistics for knowledge bases.

Last Updated 09/23/2025
GitHub25
NPM0
1
No License

Mcp Server Novacv

HireTechUpUp
Mcp Server Novacv logo

Connect to the NovaCV API for generating professional resumes, analyzing resume content, and converting resume text into structured formats like JSON. It provides features for creating tailored resumes in PDF format and accessing available template options.

Last Updated 06/11/2025
GitHub3
NPM0
1
No License

Markdownify Mcp Utf8

JDJR2024
Markdownify Mcp Utf8 logo

Converts various file types to Markdown format, with robust support for UTF-8 encoding and optimized for multilingual content handling. Ensures accurate transformation of documents and web pages while addressing encoding issues, especially on Windows systems.

Last Updated 09/08/2025
GitHub10
NPM0
1
MIT License

Semanticscholar Mcp Server

JackKuo666
Semanticscholar Mcp Server logo

Search for academic papers, retrieve detailed information about specific papers and authors, and access citations and references through the Semantic Scholar API.

Last Updated 09/27/2025
GitHub29
NPM0
1
No License

Figma Mcp

JayZeeDesign
Figma Mcp logo

Facilitates access to Figma files and prototypes, enabling integration of design assets directly into AI coding environments. Streamlines design workflows by connecting AI agents with Figma's design resources.

Last Updated 08/18/2025
GitHub61
NPM0
1
MIT License

Mcp Jina Ai

JoeBuildsStuff
Mcp Jina Ai logo

Access Jina AI's web services for web page reading, web search, and fact checking. Extract and format content from web pages for use with LLMs.

Last Updated 10/03/2025
GitHub30
NPM0
1
MIT License

Mcp Accessibility Scanner

JustasMonkev
Mcp Accessibility Scanner logo

Automated web accessibility scanning using Playwright and Axe-core, enabling WCAG compliance checks and annotated screenshot capture. Generates detailed accessibility reports and interacts with web pages through browser automation.

Last Updated 10/03/2025
GitHub18
NPM0
1
MIT License

Klavis

Klavis-AI
Klavis logo

Generates visually appealing web reports based on simple search queries, integrating live web search results and storing reports in a database for easy access. Utilizes AI to synthesize information into interactive HTML formats.

Last Updated 10/04/2025
GitHub4.5k
NPM0
1
Apache License 2.0

Markitdown_mcp_server

KorigamiK
Markitdown_mcp_server logo

Converts various file formats to Markdown, utilizing the MarkItDown utility to handle documents, images, and audio files.

Last Updated 09/30/2025
GitHub56
NPM0
1
MIT License

Mcp Bibliotheque_nationale_de_france

Kryzo
Mcp Bibliotheque_nationale_de_france logo

Access the Gallica digital library to search for documents, images, maps, and other resources, and generate structured research reports that include organized bibliographies and relevant visual content.

Last Updated 10/01/2025
GitHub5
NPM0
1
No License

Kv Extractor Mcp Server

KunihiroS
Kv Extractor Mcp Server logo

Extracts key-value pairs from noisy or unstructured text in multiple languages, ensuring type-safe outputs in JSON, YAML, or TOML formats. Utilizes advanced LLMs and pydantic for data structuring and validation, supporting languages like Japanese, English, and Chinese.

Last Updated 07/16/2025
GitHub1
NPM0
1
GNU General Public License v3.0

Mcp Webdav Server

LaubPlusCo
Mcp Webdav Server logo

Enable natural language interaction with WebDAV file systems to perform CRUD operations on files and directories through a secure and configurable MCP server. Supports connections with optional authentication and efficient management of file operations via multiple transport methods.

Last Updated 09/22/2025
GitHub9
NPM0
1
MIT License

Eigenlayer Mcp Server

Layr-Labs
Eigenlayer Mcp Server logo

Provides detailed EigenLayer documentation to AI assistants through a dedicated server interface, enabling seamless integration and querying of EigenLayer concepts and mechanisms.

Last Updated 07/22/2025
GitHub1
NPM0
1
MIT License

Ntealan Apis Mcp Server

Levis0045
Ntealan Apis Mcp Server logo

Manage dictionary data, articles, and user contributions through a modular and extensible interface. Supports asynchronous operations for efficient integration with NTeALan REST APIs.

Last Updated 05/01/2025
GitHub0
NPM0
1
MIT License

Cosa Sai

M-Gonzalo
Cosa Sai logo

Access documentation for a variety of technologies through the Gemini API, leveraging a curated knowledge base to provide accurate responses to complex queries. This server is designed to handle large context windows for improved comprehension of technical materials.

Last Updated 05/23/2025
GitHub13
NPM0
1
No License

Unstructured Mcp

MKhalusova
Unstructured Mcp logo

Enable extraction and utilization of content from various unstructured document formats, supporting seamless storage and retrieval via AWS S3. Process documents directly in applications to enhance data extraction capabilities for LLMs.

Last Updated 05/22/2025
GitHub6
NPM0
1
No License

File Converter Mcp

MaitreyaM
File Converter Mcp logo

Convert documents between various formats using Pandoc, enabling seamless integration and automation in workflows. Supports a wide range of formats including Markdown, DOCX, HTML, PDF, and EPUB.

Last Updated 08/09/2025
GitHub5
NPM0
1
No License

Meeting Mcp

Meeting-BaaS
Meeting Mcp logo

Manage meeting data including transcripts, recordings, and calendar events while providing search functionality for easy organization and retrieval.

Last Updated 09/13/2025
GitHub20
NPM0
1
MIT License

Docs2prompt Mcp

Melbourneandrew
Docs2prompt Mcp logo

Transforms documentation from GitHub repositories or dedicated websites into LLM-friendly prompts for enhanced context and understanding in AI applications.

Last Updated 03/21/2025
GitHub0
NPM0
1
Apache License 2.0

Mcp Doc

MeterLong
Mcp Doc logo

Create, edit, and manage Word documents using natural language commands, facilitating document operations and formatting. Support for table processing, image insertion, and layout control is also included.

Last Updated 10/04/2025
GitHub132
NPM0
1
No License

Mcp Japanesetextanalyzer

Mistizz
Mcp Japanesetextanalyzer logo

Analyzes Japanese and English texts by counting characters and words and evaluating linguistic features such as average sentence length and lexical diversity. Supports input via file paths or direct text input, accommodating both absolute and relative paths.

Last Updated 08/25/2025
GitHub2
NPM0
1
MIT License

Mcp Server Firecrawl

Msparihar
Mcp Server Firecrawl logo

Provides capabilities for web scraping, intelligent content searching, and site crawling using the Firecrawl API, facilitating customizable data extraction and structured output.

Last Updated 02/20/2025
GitHub2
NPM0
1
MIT License

Transcriptiontools Mcp

MushroomFleet
Transcriptiontools Mcp logo

Enhances transcription workflows by automatically repairing errors, formatting transcripts naturally, and generating concise summaries. Utilizes advanced language models for intelligent processing of audio transcripts.

Last Updated 09/29/2025
GitHub17
NPM0
1
MIT License

Textclassifier

MymInsomnia
Textclassifier logo

Multiple common text classification models based on CNN, RNN, and pre-trained NLP architectures for sentiment analysis and text classification. Supports data preprocessing, training word embeddings, and implementing advanced models like Bi-LSTM, Transformer, ELMo, and BERT for improved classification accuracy.

Last Updated 06/14/2019
GitHub1
NPM0
1
No License

Mcp Sefaria Server

OpenTorah-ai
Mcp Sefaria Server logo

Access and reference Jewish texts and commentaries through a standardized interface.

Last Updated 02/25/2025
GitHub0
NPM0
1
MIT License

Autodocument

PARS-DOE
Autodocument logo

Generates comprehensive documentation, test plans, and code reviews by analyzing code repositories and directory structures. Utilizes AI to enhance development workflows with detailed insights into security and best practices.

Last Updated 05/26/2025
GitHub5
NPM0
1
Creative Commons Zero v1.0 Universal

Gemforge Mcp

PV-Bhat
Gemforge Mcp logo

Provides tools for interacting with Google's Gemini AI models, enabling intelligent model selection and advanced file handling. Facilitates AI tasks such as search, reasoning, code analysis, and file operations through a standardized MCP server interface.

Last Updated 08/10/2025
GitHub3
NPM0
1
MIT License

Mcp Webresearch Stealthified

PhialsBasement
Mcp Webresearch Stealthified logo

Connects AI models to the web for real-time information retrieval, webpage content extraction, and research session tracking, along with the ability to capture screenshots.

Last Updated 09/24/2025
GitHub7
NPM0
1
MIT License
Page 2 of 11 • 326 total items