🌐 Unified AI Gateway (MCP Orchestrator)

[![Python 3.13+](https://img.shields.io/badge/python-3.13+-blue.svg)](https://www.python.org/downloads/) [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT) [![MCP Protocol Compliant](https://img.shields.io/badge/Protocol-MCP-purple.svg)](https://github.com/modelcontextprotocol) ### Decoupling AI Agent Logic from Infrastructure via Standardized Tool Calling

🤖 System Overview: The Cognitive Dispatcher

This platform serves as the Model Context Protocol (MCP) Gateway, acting as a single, unified endpoint for advanced AI agents (like Claude, GPT variants) to interact with a vast, heterogeneous ecosystem of digital capabilities. It moves beyond simple LLM chaining by providing Intelligent Orchestration and Cost-Aware Delegation.

While it incorporates sophisticated task routing to balance cost and performance (e.g., routing simple classification to a low-cost model while reserving high-end models for nuanced reasoning), its primary function is exposing a comprehensive Tool Set for autonomous agents.

Key capabilities exposed include: persistent knowledge management, web automation (via Playwright), structured data manipulation (Excel, SQL), document processing (OCR, RAG), and secure local system interaction (CLI utilities).

🎯 Core Architectural Shift: Tool-First Design

The gateway fundamentally transforms an agent's operational scope by presenting complex actions as standardized, schema-defined MCP tools. This abstraction shields the agent from provider-specific details (API keys, endpoint differences) and infrastructure complexity (file paths, database connections).

Key Architectural Pillars:

MCP Native: 100% adherence to the Model Context Protocol for robust, standardized agent integration.
Tool Registry: Dynamic discovery and management of dozens of integrated capabilities.
Intelligent Routing Layer: Analyzes task requirements (cost sensitivity, quality needs, speed constraints) to select the optimal LLM provider/model or specialized non-LLM tool for execution.
Ecosystem Integration: Unified access layer for providers (OpenAI, Anthropic, Google, etc.) and external systems (Databases, File Systems, Web Browsers).

⚙️ Key Functional Domains (Tool Categories)

The Gateway aggregates capabilities across several critical domains:

1. Generative & Cost Optimization

Multi-Provider Completion: Unified interface for all major LLMs. Implements sophisticated cost/performance balancing logic.
Model Tournaments: Run head-to-head evaluations for code or text generation tasks to determine the best performer for a specific query.
Advanced Caching: Semantic and task-aware caching layers to drastically reduce redundant API calls and associated costs.

2. Data & Document Intelligence

Structured Extraction: Reliable extraction of JSON, tabular data, and key-value pairs from unstructured text, guided by enforced schemas.
Document Processing: Smart chunking (semantic boundary detection), summarization, and format conversion (HTML to Markdown).
Optical Character Recognition (OCR): Processing images and PDFs, enhanced by LLMs for correction and structured output.
Vector & RAG Operations: Seamless interfacing with vector stores for contextual retrieval-augmented generation.

3. System Interaction & Automation

Browser Automation (Playwright): Full headless control over web navigation, scraping, interaction, and screenshot capture.
Filesystem Security: Secure, validated read/write/manipulation of files and directories, enforced by strict permission boundaries.
Database Connectivity: Direct interaction with SQL databases via ORM abstraction, supporting query execution and schema analysis.
Local CLI Wrapper: Securely exposes powerful local utilities (ripgrep, jq, awk) as callable tools, enabling offline text manipulation.

4. Knowledge & Reflection

Cognitive Memory System: Hierarchical storage and retrieval of working, episodic, and semantic memories, crucial for persistent agent state and contextual awareness across multi-step tasks.
Entity Graph Construction: Extract entities and relationships from text to build and query structured knowledge graphs.
Workflow Orchestration: Execute defined, multi-stage pipelines where outputs from one tool dynamically feed inputs to subsequent tools, with automated dependency resolution.

5. Meta-Tools & Self-Improvement

Tool Discovery: Agents can introspect the server to list available tools, schemas, and descriptions dynamically.
Autonomous Documentation Refiner: A unique system that uses LLM ensembles to systematically test, analyze failures, and iteratively refine the schemas and docstrings of other tools to improve agent reliability.
Dynamic API Registration: Register external REST services on the fly by pointing the server to their OpenAPI specifications, instantly exposing them as new MCP tools.

🚀 Deployment & Integration

Communication is primarily handled via modern, high-throughput HTTP transport supporting bidirectional streaming (recommended for real-time feedback), adhering strictly to the MCP specification.

Getting Started: Configuration relies on environment variables (e.g., API keys, port settings) typically managed via a .env file. Server startup automatically detects and registers all available tools.

Security: Emphasizes least-privilege principle, path sandboxing for filesystem tools, and strict input validation. Production deployments should utilize a reverse proxy (Nginx/Caddy) for SSL termination, authentication, and rate limiting.