Browser Automation MCP Repositories
146 repositories in this category.
Chromium-Automation-Service-Linux
→
Offers robust web page interaction via the Puppeteer framework, specifically tailored for operational execution within diverse Linux display environments (including X11 and Wayland setups), facilitating web scripting, visual capture, and JavaScript runtime execution.
one-search-mcp
→
Enables web search, scraping, and content extraction from various websites using multiple search engines and scrapers. It supports local browser searches and integrates with tools like SearXNG, DuckDuckGo, and Bing for enhanced data retrieval.
chromium-mcp-controller
→
Programmatically manipulate a Google Chrome or Chromium browser instance, enabling navigation, element interaction, and data extraction to enhance automated workflows.
302AI_WebInteraction_Module
→
This specialized Model Context Protocol (MCP) server, designated 302AI_WebInteraction_Module, empowers users to direct web browser operations via intuitive, natural language instructions. It streamlines web task execution and data retrieval workflows, circumventing the need for writing extensive programming scripts.
mcp-server-browserbase
→
Automates web browsing and data extraction using cloud browser capabilities. Interacts with web pages, captures screenshots, and executes JavaScript in a secure environment.
mcp-configurable-puppeteer
→
Automate browser interactions to navigate web pages, capture screenshots, and execute JavaScript in a real browser environment. Customize browser automation options using Puppeteer with environment variables for flexibility.
mcp-servers
→
Control a headless browser for automated navigation, screenshot capturing, and interaction with web page elements. Facilitates the creation of automation projects using the Multi-Context Protocol framework.
omnichat-fingerprint-toolkit
→
A unified communication conduit aggregating disparate messaging channels, including WhatsApp and Line, into a singular operational view. Simultaneously enhances browser evasion capabilities via advanced UserAgent manipulation techniques and robust fingerprint spoofing verification routines, featuring a dedicated dashboard for real-time status telemetry.
CodeGuardian Nexus
→
An AI-integrated security enforcement agent that intercepts code generation from LLMs, performing static/dynamic analysis and orchestrating AI-driven patches, while also offering natural language-to-browser-test script generation and execution via Playwright.
mcp-browser-use
→
Connects MCP clients with web browsers seamlessly, utilizing existing language model setups without needing additional API keys. Enhances browser interactions through integrated language model capabilities.
Skyvern Automator
→
Facilitates the connection of sophisticated AI applications to web browsers, enabling the execution of complex digital tasks such as data entry into forms, secure retrieval of files, and comprehensive online information synthesis. It offers flexibility via a locally deployable setup leveraging a preferred Large Language Model (LLM) or through a robust cloud-based API service.
DesktopAgentControlModule
→
Enables AI agents to programmatically interface with the desktop operating system's graphical elements via simulated physical input, facilitating automated execution of tasks within native applications.
WebInteractionOrchestrator
→
Facilitates automated manipulation of web interfaces by reusing established browser contexts, thereby preserving active login states and streamlining navigation across digital pages without instantiating separate browser environments. This dedicated backend ensures that all browser operations are executed locally, prioritizing user confidentiality and operational efficiency.
Xiaohongshu Content Interaction Automation via MCP
→
A robust backend service designed for programmatic engagement with Xiaohongshu (Redbook) platform functionalities. This tool leverages Playwright to manage automated workflows including secure user authentication, targeted keyword-based content discovery, detailed retrieval of specific note assets, and advanced, AI-driven comment dispatch. It is architected to integrate seamlessly as an MCP Server for external clients, enabling contextually relevant and natural language comment generation based on analyzed post data.
mcp-surface-interaction-controller
→
Orchestrate physical system interfaces via simulated input methods (cursor manipulation, keystroke generation) and visual data acquisition. Augment these functions with cognitive reasoning engines for sophisticated, context-aware operation.
web-agent-interface-mcp
→
Provides AI agents with a standardized service endpoint for executing web navigation and data extraction operations. This component injects contemporary web context capabilities into large language models.
mcp-ai-vision-debug-ui-automation
→
Capture and analyze website screenshots with AI-powered visual analysis. Generate comprehensive UI/UX reports and maintain context across multiple analysis steps for enhanced debugging. Streamline your visual testing process with precise file operations and automated insights.
mcp-sap-gui
→
Automate interactions with SAP GUI to control transactions programmatically, allowing for seamless integration and execution of various SAP tasks.
computer-control-mcp
→
Control computer functions programmatically using mouse and keyboard interactions, screen capture, and OCR capabilities. Integrate automation for clicking, typing, window management, and text extraction from screenshots to enhance workflows.
ai-e2e-driver
→
Facilitates comprehensive, AI-guided validation within browser automation workflows, specializing in end-to-end scenario coverage. It employs Playwright for lightweight, predictable test execution, enabling users to articulate test logic using natural language directives for efficient deployment.
virtual-desktop-orchestrator-mcp
→
Provides a control layer for managing simulated Ubuntu environments, allowing remote code execution, web navigation, and sophisticated interaction capabilities via the Model Context Protocol. Supports live operational feedback for enhanced client integration.
stealth-browser-mcp
→
Navigate websites and capture screenshots while bypassing bot detection systems using advanced stealth techniques. Modifies browser fingerprints to disguise web interactions as regular user traffic.
web-interaction-suite-http
→
Facilitates sophisticated Hypertext Transfer Protocol requests incorporating full browser simulation capabilities to engage with online resources and circumvent bot defenses. Converts rendered HTML and Portable Document Format files into Markdown format for superior ingestion by large language models.
browser-automation-toolkit-mcp
→
An apparatus for observing and manipulating web page data, encompassing the capture of console output, network transactions, and visual snapshots to bolster AI model development. It delivers a secure, isolated environment prioritizing data sovereignty while enriching artificial intelligence systems with direct browser telemetry.
mcp-browser-use
→
Automate web browsing tasks using natural language commands to control browsers, conduct research, and generate reports. This server facilitates seamless integration of AI-driven browser automation into various workflows.
mcp-playwright
→
Automate web browsers using Playwright, allowing interaction with web pages, the ability to take screenshots, generate test code, scrape content, and execute JavaScript in a real browser environment.
interactive-web-diagnostics-engine
→
A specialized MCP service engineered for deep web application inspection. It automatically executes user-defined workflows, captures intricate operational telemetry (network choreography and console diagnostics), and surfaces these findings directly within the developer's integrated coding environment to accelerate the resolution of interface defects.
ui-interaction-agent-python-toolkit
→
Facilitates sophisticated operational scripting on Windows environments by programmatically manipulating user interface components, capturing visual state data, and directing web browser instances. This library integrates within Python ecosystems to augment productivity via advanced artificial intelligence routines.
mcp-web-surfer
→
Enables sophisticated, non-graphical control over web browser environments and subsequent remote procedure call integration. This tool allows for dynamic navigation, manipulation of the Document Object Model (DOM), and arbitrary JavaScript execution within a web page context, featuring stateful session management and comprehensive request/response logging capabilities for intricate web task orchestration.
mcp-server-browserbase
→
Automates interaction with web pages in a cloud browser environment, enabling tasks such as data extraction, screenshot capture, and JavaScript execution.
