Browser Automation MCP Repositories

146 repositories in this category.

Showing 30 of 146 repositories (Page 2 of 5)

Chromium-Automation-Service-Linux

Offers robust web page interaction via the Puppeteer framework, specifically tailored for operational execution within diverse Linux display environments (including X11 and Wayland setups), facilitating web scripting, visual capture, and JavaScript runtime execution.

GitHub

NPM

one-search-mcp

Enables web search, scraping, and content extraction from various websites using multiple search engines and scrapers. It supports local browser searches and integrates with tools like SearXNG, DuckDuckGo, and Bing for enhanced data retrieval.

GitHub

NPM

chromium-mcp-controller

Programmatically manipulate a Google Chrome or Chromium browser instance, enabling navigation, element interaction, and data extraction to enhance automated workflows.

GitHub

NPM

302AI_WebInteraction_Module

This specialized Model Context Protocol (MCP) server, designated 302AI_WebInteraction_Module, empowers users to direct web browser operations via intuitive, natural language instructions. It streamlines web task execution and data retrieval workflows, circumventing the need for writing extensive programming scripts.

GitHub

NPM

mcp-server-browserbase

Automates web browsing and data extraction using cloud browser capabilities. Interacts with web pages, captures screenshots, and executes JavaScript in a secure environment.

GitHub

NPM

mcp-configurable-puppeteer

Automate browser interactions to navigate web pages, capture screenshots, and execute JavaScript in a real browser environment. Customize browser automation options using Puppeteer with environment variables for flexibility.

GitHub

NPM

mcp-servers

Control a headless browser for automated navigation, screenshot capturing, and interaction with web page elements. Facilitates the creation of automation projects using the Multi-Context Protocol framework.

GitHub

NPM

Apache License 2.0

omnichat-fingerprint-toolkit

A unified communication conduit aggregating disparate messaging channels, including WhatsApp and Line, into a singular operational view. Simultaneously enhances browser evasion capabilities via advanced UserAgent manipulation techniques and robust fingerprint spoofing verification routines, featuring a dedicated dashboard for real-time status telemetry.

GitHub

NPM

CodeGuardian Nexus

An AI-integrated security enforcement agent that intercepts code generation from LLMs, performing static/dynamic analysis and orchestrating AI-driven patches, while also offering natural language-to-browser-test script generation and execution via Playwright.

GitHub

NPM

Apache License 2.0

mcp-browser-use

Connects MCP clients with web browsers seamlessly, utilizing existing language model setups without needing additional API keys. Enhances browser interactions through integrated language model capabilities.

GitHub

NPM

Apache License 2.0

Skyvern Automator

Facilitates the connection of sophisticated AI applications to web browsers, enabling the execution of complex digital tasks such as data entry into forms, secure retrieval of files, and comprehensive online information synthesis. It offers flexibility via a locally deployable setup leveraging a preferred Large Language Model (LLM) or through a robust cloud-based API service.

GitHub

NPM

GNU Affero General Public License v3.0

DesktopAgentControlModule

Enables AI agents to programmatically interface with the desktop operating system's graphical elements via simulated physical input, facilitating automated execution of tasks within native applications.

GitHub

NPM

WebInteractionOrchestrator

Facilitates automated manipulation of web interfaces by reusing established browser contexts, thereby preserving active login states and streamlining navigation across digital pages without instantiating separate browser environments. This dedicated backend ensures that all browser operations are executed locally, prioritizing user confidentiality and operational efficiency.

GitHub

NPM

Apache License 2.0

Xiaohongshu Content Interaction Automation via MCP

A robust backend service designed for programmatic engagement with Xiaohongshu (Redbook) platform functionalities. This tool leverages Playwright to manage automated workflows including secure user authentication, targeted keyword-based content discovery, detailed retrieval of specific note assets, and advanced, AI-driven comment dispatch. It is architected to integrate seamlessly as an MCP Server for external clients, enabling contextually relevant and natural language comment generation based on analyzed post data.

GitHub

NPM

mcp-surface-interaction-controller

Orchestrate physical system interfaces via simulated input methods (cursor manipulation, keystroke generation) and visual data acquisition. Augment these functions with cognitive reasoning engines for sophisticated, context-aware operation.

GitHub

NPM

web-agent-interface-mcp

Provides AI agents with a standardized service endpoint for executing web navigation and data extraction operations. This component injects contemporary web context capabilities into large language models.

GitHub

NPM

mcp-ai-vision-debug-ui-automation

Capture and analyze website screenshots with AI-powered visual analysis. Generate comprehensive UI/UX reports and maintain context across multiple analysis steps for enhanced debugging. Streamline your visual testing process with precise file operations and automated insights.

GitHub

NPM

mcp-sap-gui

mario-andreschak

Automate interactions with SAP GUI to control transactions programmatically, allowing for seamless integration and execution of various SAP tasks.

GitHub

NPM

computer-control-mcp

Control computer functions programmatically using mouse and keyboard interactions, screen capture, and OCR capabilities. Integrate automation for clicking, typing, window management, and text extraction from screenshots to enhance workflows.

GitHub

NPM

ai-e2e-driver

Facilitates comprehensive, AI-guided validation within browser automation workflows, specializing in end-to-end scenario coverage. It employs Playwright for lightweight, predictable test execution, enabling users to articulate test logic using natural language directives for efficient deployment.

GitHub

NPM

Apache License 2.0

virtual-desktop-orchestrator-mcp

Provides a control layer for managing simulated Ubuntu environments, allowing remote code execution, web navigation, and sophisticated interaction capabilities via the Model Context Protocol. Supports live operational feedback for enhanced client integration.

GitHub

NPM

stealth-browser-mcp

Navigate websites and capture screenshots while bypassing bot detection systems using advanced stealth techniques. Modifies browser fingerprints to disguise web interactions as regular user traffic.

GitHub

NPM

web-interaction-suite-http

Facilitates sophisticated Hypertext Transfer Protocol requests incorporating full browser simulation capabilities to engage with online resources and circumvent bot defenses. Converts rendered HTML and Portable Document Format files into Markdown format for superior ingestion by large language models.

GitHub

NPM

browser-automation-toolkit-mcp

An apparatus for observing and manipulating web page data, encompassing the capture of console output, network transactions, and visual snapshots to bolster AI model development. It delivers a secure, isolated environment prioritizing data sovereignty while enriching artificial intelligence systems with direct browser telemetry.

GitHub

NPM

mcp-browser-use

Automate web browsing tasks using natural language commands to control browsers, conduct research, and generate reports. This server facilitates seamless integration of AI-driven browser automation into various workflows.

GitHub

NPM

mcp-playwright

Automate web browsers using Playwright, allowing interaction with web pages, the ability to take screenshots, generate test code, scrape content, and execute JavaScript in a real browser environment.

GitHub

NPM

interactive-web-diagnostics-engine

A specialized MCP service engineered for deep web application inspection. It automatically executes user-defined workflows, captures intricate operational telemetry (network choreography and console diagnostics), and surfaces these findings directly within the developer's integrated coding environment to accelerate the resolution of interface defects.

GitHub

NPM

Apache License 2.0

ui-interaction-agent-python-toolkit

fstandhartinger

Facilitates sophisticated operational scripting on Windows environments by programmatically manipulating user interface components, capturing visual state data, and directing web browser instances. This library integrates within Python ecosystems to augment productivity via advanced artificial intelligence routines.

GitHub

NPM

mcp-web-surfer

Enables sophisticated, non-graphical control over web browser environments and subsequent remote procedure call integration. This tool allows for dynamic navigation, manipulation of the Document Object Model (DOM), and arbitrary JavaScript execution within a web page context, featuring stateful session management and comprehensive request/response logging capabilities for intricate web task orchestration.

GitHub

NPM

Mozilla Public License 2.0

mcp-server-browserbase

Automates interaction with web pages in a cloud browser environment, enabling tasks such as data extraction, screenshot capture, and JavaScript execution.

GitHub

NPM

← Previous 1 2 3 4 5 ... 5 Next →

Go to page:

← Back to MCP Directory 🙏 Credits & Acknowledgments