Document Processing MCP Repositories

326 repositories in this category.

Showing 30 of 326 repositories (Page 8 of 11)

docs

Provides a starter kit for creating and maintaining documentation, including guide pages, navigation, customizations, and API references. Supports local previews and automatic deployment of documentation updates via integration with a GitHub app.

Last Updated

Unknown

AverbePorto-MCP

→

GHSix

Integrates with AverbePorto to manage authentication and document submission for cargo insurance endorsements. Provides a secure API for automated document handling and protocol consultations.

Last Updated

MIT License

unstructured-mcp

→

MKhalusova

Enable extraction and utilization of content from various unstructured document formats, supporting seamless storage and retrieval via AWS S3. Process documents directly in applications to enhance data extraction capabilities for LLMs.

Last Updated

No License

pptx-xlsx-mcp

→

jenstangen1

Interact with Microsoft Office applications like PowerPoint and Excel to create, modify, and analyze presentations and spreadsheets using natural language commands. Automate complex tasks and data manipulations efficiently within the Office environment.

Last Updated

No License

mcp-outline

→

Vortiago

Enables interaction with Outline's document management services through natural language commands for searching, creating, and managing documents. Facilitates tasks like reading document content and managing comments within a structured collection.

Last Updated

MIT License

whiskerrag_toolkit

→

petercat-ai

Provides retrieval-augmented generation capabilities for applications, allowing integration of various data sources with advanced processing methods. Features a toolkit with type definitions and methods for effective RAG implementation.

Last Updated

MIT License

MCP-llms-txt

→

SecretiveShell

Integrate documentation directly into conversations by utilizing MCP resources for chat applications. This server enhances interactions by providing relevant documentation content as part of the dialogue.

Last Updated

MIT License

ClaudeHopper

→

Arborist-ai

Interact with construction documents, drawings, and specifications. Analyze technical details and retrieve specific information through advanced retrieval-augmented generation and hybrid search.

Last Updated

MIT License

puremd-mcp

→

puremd

Access web content in markdown format by prefixing URLs with `pure.md/`, facilitating seamless retrieval of web pages while avoiding bot detection. It converts various formats like HTML and PDFs into markdown and globally caches responses for efficiency.

Last Updated

No License

sui-mcp-server

→

ProbonoBonobo

Enables AI agents to retrieve documents from a vector database using Retrieval-Augmented Generation (RAG) techniques. Integrates with GitHub to process Move files and incorporates a language model for generating responses based on retrieved information.

Last Updated

No License

handwriting-ocr-mcp-server

→

Handwriting-OCR

Integrate applications with the Handwriting OCR service to process images and PDF documents for text extraction. Upload documents, check processing status, and retrieve OCR results in Markdown format.

Last Updated

No License

MCP-server-readability-python

→

jmh108

Extracts and transforms webpage content into clean, LLM-optimized Markdown, removing ads and non-essential elements for improved readability and processing by language models.

Last Updated

MIT License

word-mcp-server

→

cuongpham2107

Facilitates the creation and editing of Microsoft Word documents via a straightforward API. Supports adding formatted text, images, and tables, enabling document generation and modification through natural language commands with LLM integration.

Last Updated

No License

mcp-server-docy

→

oborchers

Provides real-time access to technical documentation from various sources, enabling accurate coding assistance. Supports dynamic updates to documentation sources and employs caching to reduce latency while ensuring fresh content.

Last Updated

MIT License

seo-inspector-mcp

→

mgsrevolver

Analyzes HTML files and web pages to identify SEO issues and validate structured data schemas. Provides actionable recommendations for improving SEO quality directly through integrated tools without the need for a browser extension.

Last Updated

No License

open-docs-mcp

→

askme765cs

Crawl, index, and manage documentation while enabling full-text search across various document formats for efficient information retrieval. Integrates with AI to enhance document access and management capabilities.

Last Updated

No License

klavis

→

Klavis-AI

Generates visually appealing web reports based on simple search queries, integrating live web search results and storing reports in a database for easy access. Utilizes AI to synthesize information into interactive HTML formats.

Last Updated

4.5K

Apache License 2.0

textin-mcp

→

intsig-textin

Extract text from images, PDFs, and Word documents while performing OCR and document conversion tasks. Convert documents to Markdown format, and retrieve key information from files intelligently.

Last Updated

MIT License

laas-rag-mcp

→

bettehub

Upload documents in PDF or CSV formats and perform natural language queries to retrieve relevant information. It features document segmentation and embedding storage using a Chroma vector store for efficient retrieval.

Last Updated

No License

context7

→

antonioevans

Fetches up-to-date, version-specific code documentation and examples from source libraries to enhance prompts, reducing reliance on outdated code and inaccurate APIs. Integrates real-time library documentation into LLM context to improve coding accuracy and productivity.

Last Updated

MIT License

obsidian_fetch

→

soukouki

Retrieve and load notes efficiently from Obsidian vaults, enabling enhanced interactions with language models by cleaning link queries and displaying backlinks to opened files. Streamlined for local GPU setups to improve note retrieval speed and efficiency.

Last Updated

MIT License

servers

→

modelcontextprotocol

Integrates with Google Drive to provide functionality for listing, reading, and searching files. It supports various file formats and exports Google Workspace files to applicable formats for easier access.

Last Updated

69.5K

MIT License

notion-readonly-mcp-server

→

Taewoong1378

Provides read-only access to Notion content, enabling retrieval of pages, blocks, databases, comments, and properties with optimized performance. Focuses on minimizing API calls and supports parallel processing for efficient data acquisition.

Last Updated

MIT License

wiki_mcp_server

→

albertshao

Manage Confluence wiki pages by creating, updating, deleting, and searching them through a unified interface. Automatically selects the relevant knowledge base based on user queries to enhance content management efficiency.

Last Updated

No License

mindmap-mcp-server

→

YuChenSSR

Converts Markdown content into interactive mindmaps, generating HTML mindmaps or saving them as files for easy access and sharing. Enhances project planning and brainstorming through visual representations of ideas.

Last Updated

197

MIT License

MRConfluenceLinker-mcp-server

→

CodeByWaqas

Fetch and analyze GitLab merge requests, and store the analysis results in Confluence documentation to enhance documentation workflows.

Last Updated

MIT License

mcp-data-extractor

→

sammcj

Extracts embedded data such as i18n translations and configurations from TypeScript and JavaScript source code, converting them into structured JSON files while preserving the hierarchical structure and template variables.

Last Updated

MIT License

mcp-webdav-server

→

LaubPlusCo

Enable natural language interaction with WebDAV file systems to perform CRUD operations on files and directories through a secure and configurable MCP server. Supports connections with optional authentication and efficient management of file operations via multiple transport methods.

Last Updated