google-web-retriever
A Model Context Protocol (MCP) service facilitating zero-cost web information retrieval leveraging public Google search results, circumventing the need for proprietary API credentials.
Author
pskill9
Quick Info
Actions
Tags
Google Web Information Retrieval Service
This Model Context Protocol (MCP) utility provides unrestricted access to information indexed by Google search, crucially requiring no external API subscriptions or authorization tokens.
Core Capabilities
- Access data derived directly from Google's public search index.
- Operates entirely without requiring API key management or authentication setup.
- Delivers results in a structured format, including document titles, source URIs, and brief abstracts.
- Features adjustable limits on the quantity of retrieved items per execution.
Deployment Instructions
- Obtain or clone the source code repository.
-
Install necessary project dependencies: bash npm install
-
Compile the server assets: bash npm run build
-
Integrate this service into your MCP ecosystem configuration:
For VSCode (Utilizing the Claude Developer Extension):
{ "mcpServers": { "google-web-retriever": { "command": "node", "args": ["/path/to/repository/build/index.js"] } } }
For Claude Desktop Environment:
{ "mcpServers": { "google-web-retriever": { "command": "node", "args": ["/path/to/repository/build/index.js"] } } }
Operational Guidelines
The service exposes one primary functional interface named fetch_results, which accepts the following parameters:
typescript { "query": string, // The specific information request string "limit": number // Optional: The maximum count of results desired (default: 5, maximum allowed: 10) }
Invocation Example: typescript use_mcp_tool({ server_name: "google-web-retriever", tool_name: "fetch_results", arguments: { query: "latest developments in quantum computing", limit: 5 // Optional specification } })
Expected Output Schema:
[ { "title": "Result Title Snippet", "url": "https://source-document.net/page", "description": "A concise summary extracted from the source page content..." } ]
Caveats and Constraints
As this utility directly scrapes public Google Search Engine Result Pages (SERPs), users must adhere to the following operational constraints:
- Request Throttling: Excessive querying within brief intervals may trigger temporary service restriction measures imposed by Google. To mitigate this:
- Maintain a measured frequency of information requests.
- Exercise prudence when setting the
limitparameter. -
Consider integrating programmatic pauses between subsequent calls.
-
Data Fidelity: The tool's reliability depends on the consistency of Google's underlying HTML structure, which is subject to modification.
- Metadata such as abstracts may occasionally be absent or incomplete.
-
Advanced search directives might yield unpredictable indexing.
-
Compliance Note: This instrument is designed strictly for private, non-commercial informational gathering.
- Adherence to Google's published terms of service is mandatory.
- Implementing internal mechanisms to manage request volume is strongly recommended.
Contributions
Feedback, bug reports, and feature suggestions are warmly welcomed via repository issue tracking!
