WebMCP

This skill should be used when browsing or automating web pages that expose tools via the WebMCP API (window.navigator.modelContext). It teaches agents how to discover, inspect, and invoke WebMCP tools on websites instead of relying on DOM scraping or UI actuation.

MIT-0 · Free to use, modify, and redistribute. No attribution required.
2 · 1.3k · 8 current installs · 10 all-time installs
byBruno Pérez@brunobuddy
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description describe discovering and invoking WebMCP (window.navigator.modelContext) tools. The skill is instruction-only and requests no binaries, environment variables, installs, or config access — all consistent with a browser-integration helper.
Instruction Scope
SKILL.md focuses on detecting modelContext, listing registered tools, validating input schemas, invoking execute callbacks, and handling user confirmation. It does not instruct reading unrelated files, accessing unrelated env vars, or sending data to external endpoints. Note: the instructions explicitly invoke page-provided execute callbacks (i.e., run code that the website registers), which by design will run arbitrary page-side logic — agents should ensure user consent and trust the site when performing sensitive actions.
Install Mechanism
No install spec and no code files — instruction-only. This minimizes disk footprint and attack surface; nothing is downloaded or installed by the skill itself.
Credentials
The skill declares no required environment variables, credentials, or config paths. The SKILL.md does not reference any hidden env vars or credentials beyond expected browser/agent interfaces.
Persistence & Privilege
always is false and the skill is user-invocable; it does not request permanent presence or modify other skills or system-wide settings. Autonomous invocation is allowed (platform default) but not elevated by the skill itself.
Assessment
This skill is coherent and does what it says: it tells an agent how to find and call WebMCP tools exposed by websites. The main risk is executing site-provided callbacks — those callbacks run page-side and can perform arbitrary actions within the page. Only use the skill when interacting with websites you trust for sensitive tasks, require or confirm user prompts before purchases or destructive actions, and prefer manual confirmation for anything that touches payments, personal data, or account changes.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0
Download zip
latestvk976daakx65vsx0axcggh5h11n80xbpr

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

WebMCP — Discover and Use Website Tools

What is WebMCP

WebMCP is a browser API that lets websites expose JavaScript functions as structured tools for AI agents. Pages register tools via window.navigator.modelContext, each with a name, description, JSON Schema input, and an execute callback. Think of it as an MCP server running inside the web page itself.

Spec: https://github.com/webmachinelearning/webmcp

Detecting WebMCP Support

Before interacting with tools, check whether the current page supports WebMCP:

const supported = "modelContext" in window.navigator;

If false, the page does not expose WebMCP tools — fall back to DOM interaction or actuation.

Discovering Available Tools

Tools are registered by the page via provideContext() or registerTool(). The browser mediates access. To list available tools from an agent's perspective, evaluate:

// Browser-specific — the exact discovery API depends on the agent runtime.
// Typically the browser exposes registered tools to connected agents automatically.
// From page-script perspective, tools are registered like this:
window.navigator.modelContext.provideContext({
  tools: [
    {
      name: "tool-name",
      description: "What this tool does",
      inputSchema: { type: "object", properties: { /* ... */ }, required: [] },
      execute: (params, agent) => { /* ... */ }
    }
  ]
});

Key points:

  • Each tool has name, description, inputSchema (JSON Schema), and execute.
  • provideContext() replaces all previously registered tools (useful for SPA state changes).
  • registerTool() / unregisterTool() add/remove individual tools without resetting.
  • Tools may change as the user navigates or as SPA state updates — re-check after page transitions.

Tool Schema Format

Tool input schemas follow JSON Schema (aligned with MCP SDK and Prompt API tool use):

{
  name: "add-stamp",
  description: "Add a new stamp to the collection",
  inputSchema: {
    type: "object",
    properties: {
      name: { type: "string", description: "The name of the stamp" },
      year: { type: "number", description: "Year the stamp was issued" },
      imageUrl: { type: "string", description: "Optional image URL" }
    },
    required: ["name", "year"]
  },
  execute({ name, year, imageUrl }, agent) {
    // Implementation — updates UI and app state
    return {
      content: [{ type: "text", text: `Stamp "${name}" added.` }]
    };
  }
}

Invoking Tools

When connected as an agent, send a tool call by name with parameters matching inputSchema. The execute callback runs on the page's main thread, can update the UI, and returns a structured response:

// Response format from execute():
{
  content: [
    { type: "text", text: "Result description" }
  ]
}
  • Tools run sequentially on the main thread (one at a time).
  • execute may be async (returns a Promise).
  • The second parameter agent provides agent.requestUserInteraction() for user confirmation flows.

User Interaction During Tool Execution

Tools can request user confirmation before sensitive actions:

async function buyProduct({ product_id }, agent) {
  const confirmed = await agent.requestUserInteraction(async () => {
    return confirm(`Buy product ${product_id}?`);
  });
  if (!confirmed) throw new Error("Cancelled by user.");
  executePurchase(product_id);
  return { content: [{ type: "text", text: `Product ${product_id} purchased.` }] };
}

Always respect user denials — do not retry cancelled tool calls.

Agent Workflow

  1. Navigate to the target website.
  2. Check "modelContext" in window.navigator to confirm WebMCP support.
  3. Discover registered tools (names, descriptions, schemas).
  4. Select the appropriate tool based on the user's goal and the tool description.
  5. Invoke with correct parameters matching inputSchema.
  6. Read the structured response and relay results to the user.
  7. After SPA navigation or state changes, re-discover tools — the set may have changed.
  8. If no WebMCP tool fits the task, fall back to DOM-based interaction.

Important Constraints

  • Browser context required — tools only exist in a live browsing context (tab/webview), not headlessly.
  • Sequential execution — tool calls run one at a time on the main thread.
  • No cross-origin tool sharing — tools are scoped to the page that registered them.
  • Permission-gated — the browser may prompt the user before allowing tool access.
  • Tools are dynamic — SPAs may register/unregister tools based on UI state.

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…