Browser Automation MCP Repositories
146 repositories in this category.
WebInterface-for-LLMs-Self-Hosted
→
A robust, entirely local web interface designed for superior interaction with large language models. It integrates critical features such as voice communication and rich Markdown rendering, while maintaining compatibility across diverse LLM execution environments.
fetcher-mcp
→
Retrieve web page content using a Playwright headless browser to navigate and extract information efficiently. Designed for easy setup and configuration, it leverages AI to streamline web scraping tasks.
LightGoControl
→A Go implementation of an MCP facilitator designed to orchestrate operations with the Lightpanda ultra-responsive, non-visual web navigation utility.
mcp
→
Provides web scraping, structured data extraction, and web crawling capabilities. Integrates browser automation agents for tasks related to data retrieval from webpages.
puppeteer-control
→
A powerful JavaScript toolkit for programmatically directing Chromium or Firefox instances via an elevated JS interface, enabling complex web data extraction, functional validation, and user interaction simulation in environments that are either visually hidden (headless) or fully rendered.
Cloudflare-Worker-Powered Web Automation Engine
→
A high-performance utility for programmatic browser manipulation and quality assurance procedures, leveraging Playwright capabilities. It orchestrates web navigation, element manipulation, and artifact capture (visuals/snapshots). Deployed via Cloudflare Workers, this tool significantly bolsters Large Language Model-driven task execution workflows with robust, real-time browser control.
playwright-plus-python-mcp
→
Provides browser automation capabilities for web navigation, interaction, content retrieval, and screenshot capture using Playwright. Includes a note storage system for summarizing notes with customizable detail levels.
mcp-aoai-web-browsing
→
An MCP server for web browsing automation that integrates Azure OpenAI to facilitate interactions with web applications through an automated interface. It utilizes Playwright for end-to-end testing and customizes responses to fit the OpenAI function calling format.
mcp-playwright-stealth
→
Automates browser interactions, enabling LLMs to navigate web pages, capture screenshots, generate test scripts, and execute JavaScript in real-time.
playwright-consolelogs-mcp
→
Open a browser to monitor console logs and track network requests for improved debugging and analysis. Retrieve structured log data and network activity while maintaining a clean environment post-session.
browser-mcp
→
Interact with the browser to execute commands, modify styles, and access browsing history. Enables retrieval of content from the current page in markdown format and customization of page styles directly through an MCP interface.
youtube-operator-suite
→A comprehensive toolkit, implemented as an MCP endpoint and command-line interface, designed for the programmatic management and automation of all facets of the YouTube platform.
BrowserAgent-Interface
→
Facilitate smooth engagement with sophisticated AI entities through an intuitive graphical front-end, supporting a diverse array of Large Language Models while preserving active browser contexts across sequential operations for enhanced productivity. The backend system incorporates capabilities for premium-quality visual session capture and bespoke browser modifications without necessitating repetitive sign-in procedures.
llm-driven-web-agent
→
Facilitate the programmatic control of web environments using natural language prompts, enabling tasks such as site navigation, data form submission, and element interaction via an integrated language model interface.
browser-control-mcp
→
Control and manage local browser tabs and history, perform web content reading and searching, and integrate browsing capabilities with AI agents. Offers features such as tab management and text highlighting within web pages.
mcp-server-browser
→Browser automation capabilities using Puppeteer, both support local and remote browser connection.
mcp-server-browserbase
→
The Browserbase MCP Server automates web browsing tasks in the cloud, allowing AI applications to navigate websites, capture screenshots, and run JavaScript code. This makes it easier to gather data from the web and integrate it into various AI solutions.
appium-mcp-driver
→
Facilitates native mobile application control via Appium endpoints, adhering to the Model Context Protocol (MCP). Enables sophisticated element manipulation, application lifecycle management, device state modification, and execution of complex touch sequences through standardized remote procedure calls. Accelerates continuous integration cycles across a broad spectrum of mobile hardware.
bilibili-mcp
→A FastMCP-based tool that fetches Bilibili's trending videos and exposes them via a standard MCP interface.
ashra-mcp
→Extract structured data from any website. Just prompt and get JSON.
mcp-browser-tabs
→
Manage and retrieve information about currently open Chrome browser tabs, enabling interactions with the browser's tab content and control functions.
ai-driven-web-interface-verification-suite-mcp
→
Provides automated, AI-powered visual inspection, quality assurance, and performance auditing for web applications utilizing the Playwright automation framework for deep browser interaction.
mobile-mcp-toolkit
→
A platform-agnostic Model Context Protocol (MCP) server designed to orchestrate interactions with native mobile applications across both iOS and Android environments. It leverages structured accessibility metadata or image-derived coordinates for precise control, enabling sophisticated, LLM-driven automation of complex user flows, testing routines, and data manipulation tasks on simulators, emulators, and physical handsets.
playwright-mcp-server
→
Control browsers for web interaction and automate tasks, including content retrieval and simulation of mouse operations on web pages.
uiautomator2-mcp
→
Provides automation capabilities for Android devices, including command execution, app management, and UI interactions such as clicks and text inputs. Supports seamless control of Android devices or emulators with integrated OCR and device management features.
chromium-devtools-interface-server
→
Interface for controlling Google Chrome instances via the DevTools Protocol (CDP). Facilitates tab management, script injection, visual data extraction (screenshots), and network payload interception.
UniServe-Automation-Core
→
A centralized server for managing sophisticated web and desktop interactions, featuring resilient self-correction logic and deep large language model (LLM) synergy, designed to ensure continuous test viability and streamlined case orchestration.
rpg
→
A browser-based multiplayer RPG that facilitates empire expansion, strategic battles, and resource management among players. It supports the creation and sharing of platformer mini-games, offering rewards and robust community management features.
auto-spectrum-toolkit-server
→
A robust MCP endpoint designed to orchestrate complex digital interactions, encompassing rigorous quality assurance procedures such as visual regression analysis and accessibility auditing across web interfaces. It facilitates sophisticated programmatic interaction testing for REST/GraphQL endpoints and offers deep integration pathways with advanced cognitive architectures.
browser-use-mcp
→
Automate browser tasks using the Browser Use API. Manage and monitor automation tasks, including the ability to run, pause, and resume operations while retrieving task information.
