Agent-Browser-Bridge-AI

Other

Agent-Browser-Bridge-AI — Anti-detection browser control for AI agents. DOM-first, human-like interactions (Bezier), lead gen, extraction, MCP.

Install

openclaw skills install @alexandre-leng/agentbridge

AgentBridge 🌉 — Anti-Detection Browser Control for AI Agents

The browser that doesn't look like a bot. Full anti-detection browser bridge for AI agents. DOM-first interactions, human-like mouse movements (Bezier curves), typing jitter, stealth anti-fingerprinting, and MCP-native (15 tool + 1 raw tool) integration.

📋 Complete Feature Reference

1. 🕹️ Browser Control

Available via CLI command or MCP tool.

CLI command	MCP tool	Action
`navigate <url>`	`navigate`	Go to any URL with polite waiting
`annotate`	`annotate_page`	Screenshot + numbered DOM element tree with refs
`click <ref>`	`click_ref`	Click element by numbered ref (found in annotate)
`type <ref> <text>`	`type_ref`	Type text into element (clears field first)
`press <key>`	via raw	Press keyboard key (Enter/Tab/Escape/...)
`scroll [dir] [amount]`	via raw	Scroll viewport (down/up, default 300px)
`discover [steps] [px]`	via raw	Scroll step by step and capture each page state
`screenshot [--full-page]`	via raw	Capture page screenshot (viewport or full-page)
`back` / `forward`	via raw	Browser history with human-like pauses
`summary`	via raw	Page metadata: URL, title, interactive element count

2. 📄 Structured Extraction

CLI command	Extract types available	Details
`extract article`	`article`	Full article: title + paragraphs + headings
`extract table`	`table`	HTML `<table>` + ARIA `role="grid"`
`extract form`	`form`	All form fields with labels, types, placeholders
`extract listings`	`listings`	Directory/listings: names, ratings, reviews, addresses, phones
`extract marketplace`	`marketplace`	E-commerce: titles, prices, images, delivery, sponsored flags
Additional types via JSON-RPC	`search-results`, `google-maps`, `custom` (CSS/XPath), `schema`	Accessible via MCP or WebSocket

Common options: --limit=N | --format=json|csv | --out=file.csv

3. 📧 Lead Generation Pipeline

CLI command	Action
`scrape-emails <query>`	🔥 Full pipeline: search engine → visit each result page → extract all emails → save to CSV
Options	`--limit=N` (default 20)
`extract-emails <url>`	Navigate to URL and extract all email addresses found in HTML + visible text
`extract-phones <url>`	Navigate to URL and extract phone numbers (uses libphonenumber-js, supports French format +33)
`scrape`	Extract marketplace/listing results from current page (alias for `extract marketplace`)

4. 🔍 Visible Text

CLI command	Options
`visibleText`	`--limit=N`

Cherry-pick visible DOM text by filter, or extract only emails/phones from visible content.

5. 🌐 Web Search

CLI command	MCP tool	Action
`webSearch <query>`	`web_search`	Search Google/Bing, auto-paginate until limit reached, deduplicate results
Options	`--limit=N` (default 10)	`--engine=google
`siteSearch <query>`	`site_search`	Detect and use the current page's search form automatically

6. 🧍 Human Behavior Emulation

Every interaction goes through the human behavior layer. These commands add extra human-like activity:

CLI command	What it does
`scan`	Read visible text → scroll → pause → repeat (like a human researcher)
`findText <text>`	Search for visible text on page, auto-scrolling until found or max scrolls reached
`clickText <text>`	Find visible text AND click it — coordinates first, falls back to agent.click(ref)
`idle [ms]`	Pause with random cursor movements — looks human even while "doing nothing"
`jitter [radius] [moves]`	Small cursor hesitation movements (default radius 18px, 4 moves)
`skim [steps] [px]`	Scroll through content with natural reading pauses
`backtrack`	Small upward scroll + pause (emulates re-reading something)
`focusCycle [n]`	Press Tab through focusable controls with natural pauses
`wait [ms]`	Wait N milliseconds (default: 2000)

7. ⏱️ Timing & Anti-Spam

CLI command	What it does
`timing get`	Show current human timing profile: consultSpeed, WPM ranges, pause ranges
`timing set key=value ...`	Adjust any timing parameter live (`consultSpeed`, `focusedWpmMin/Max`, etc.)
`timing reset`	Restore default timing profile
`antispam`	Check current page for anti-bot / anti-spam blocking text — non-throwing

The timing system has 10 adjustable parameters controlling reading speed, pause duration, scroll behavior, and feedback intervals.

8. ⚡ Automation

CLI command	What it does
`run <cmd1> <args1> ...`	Chain multiple commands in sequence, preserving browser state between them
`batch <recipe.json>`	Execute multiple commands from a JSON recipe file (lightweight, no variable interpolation)
`repl`	Interactive REPL — type commands live, see results instantly
`start`	Launch the bridge server

9. 🖥️ Live Viewer

Web GUI → http://localhost:8080/viewer

Watch the browser in real time. Debug interactions, inspect the page, take over manually when needed.

🧩 MCP Integration (16 tools)

AgentBridge exposes an MCP server with the following tools:

Tool	Input	Description
`browser_status`	—	Check browser connection health, return current URL and page title
`navigate`	`{ url, autoAnnotate? }`	Navigate to any http(s) URL
`annotate_page`	`{ noImage? }`	Return interactive page elements with stable numeric refs + screenshot URL
`click_ref`	`{ ref: number	string }`
`type_ref`	`{ ref, text, clearFirst? }`	Type text into an element by ref
`inspect_forms`	—	Map all visible forms: fields, labels, types, options, selectors
`fill_form`	`{ values: {}	fields: [] }`
`submit_form`	`{ query? }`	Submit the active form via submit button or Enter key
`site_search`	`{ query, field? }`	Find and use the current page's search form automatically
`web_search`	`{ query, engine?, limit?, pages? }`	Search Google/Bing/DuckDuckGo with auto-pagination and dedup
`extract_schema`	`{ schema: { fields } }`	Extract structured data via CSS selectors or XPath
`extract_marketplace`	`{ limit?, format? }`	Extract e-commerce listing cards with title, price, image, delivery flags
`human_timing_get`	—	Current consultation timing profile (10 parameters)
`human_timing_set`	`{ consultSpeed?, wpmMin/Max? }`	Adjust timing to speed up or slow down browsing
`human_antispam_check`	—	Check page for anti-bot blocking without throwing
`browser_command` (raw)	`{ type, payload }`	🔐 Run ANY bridge command (requires `BRIDGE_MCP_ALLOW_RAW=1`)

Also exposes a resource (agentbridge://api) listing all registered command names, and a prompt template (browser_task) for asking agents to complete browser-based goals.

🛡️ Anti-Detection System

Feature	How it works (source-verified)
Bezier cursor curves	Every mouse movement uses cubic Bezier curves with 24-90 adaptive steps — no straight lines
Cursor position tracking	Server-side cursor state (`_curX`, `_curY`) — every move starts from last known position, never from (0,0)
Click delay	Random 15-60ms delay between mouse down and mouse up
Typing jitter	Each character typed with variable per-keystroke delay (15-180ms)
Typing jitter standard dev	0.45 of mean delay per character
Timing profile (10 params)	`consultSpeed` (0.25-8), `focusedWpmMin` (80-500), `focusedWpmMax` (80-650), `skimWpmMin` (100-700), `skimWpmMax` (100-850), `minFocusedMs` (0-120000), `maxFocusedMs` (500-180000), `minSkimMs`, `maxSkimMs`, `feedbackIntervalMs`
Stealth script	Patches `navigator.webdriver` → `undefined`, injects full `window.chrome` runtime mock, sets `navigator.plugins` with PDF viewer entries, spoofs `languages`, `platform`, `userAgent`, WebGL vendor/renderer, `navigator.hardwareConcurrency`
Cookie auto-accept	Auto-detects and clicks "Accept all" / "Tout accepter" / "Accepter" on Google, cookie banners, and common consent dialogs
Flash click	Visual click indicator (green circle animation) — visible when not headless
Anti-spam check	Inspects page for known anti-bot patterns without throwing — returns clean block/unblock status
Human behavior layer	Every interaction runs through `humanPreClick` → `humanMove` (Bezier) → click → `flashClick` → `assertNoAntiBot`

Stealth patches verified in source (`src/browser/stealth.ts`):

navigator.webdriver → undefined
window.chrome mock with runtime, app, loadTimes, csi
navigator.plugins → 5 plugins including "Chrome PDF Plugin", "PDF Viewer", "Chrome PDF Viewer"
navigator.languages → ["en-US", "en"]
navigator.platform → "Win32"
navigator.hardwareConcurrency → 4 or 8
WebGL vendor/renderer spoofing
__name polyfill for esbuild compatibility
Screen resolution and color depth normalization

🏆 Anti-Detection Bypass Comparison

Tool	YouTube	Google Search	JS-heavy sites
curl / wget	❌ 429 / captcha	❌ Blocked	❌ JS required
Puppeteer-Extra + StealthPlugin	❌ "Sign in" overlay	❌ Detected	⚠️ Partial
Playwright + stealth	❌ Blocked	❌ Detected	⚠️ Partial
Selenium + undetected-chromedriver	❌ Blocked	❌ Detected	❌ Blocked
Camoufox	❌ Sign-in prompt	⚠️ Partial	⚠️ Partial
yt-dlp (no cookies)	❌ LOGIN_REQUIRED	N/A	N/A
Tor Browser	❌ Blocked by most	❌ CAPTCHA loop	❌ Blocked
AgentBridge + Chrome CDP	✅ Full access	✅ Works	✅ Works

Tested on Ubuntu 26.04 with Chromium 149 headless via Chrome DevTools Protocol (CDP fallback mode — works without Playwright)

🔧 Use Cases

Bug Bounty & Security Research — Automate recon on authenticated sessions, bypass WAF, extract findings from JS-heavy apps
Lead Generation — scrape-emails "AI consultants France" --limit=100 --out=leads.csv --fast
Web Scraping — Extract structured data from JS-rendered sites (marketplaces, directories, SERPs)
Market Research — Competitor price monitoring, product catalog scraping, directory extraction
AI Training Data — Collect real-world content from protected sites without getting blocked
YouTube Research — Access video metadata, descriptions, comments, and channel data without cookie/login wall
SaaS Automation — Fill multi-step forms, navigate dashboards, extract reports
Data Pipelines — Integrate with Claude, Cursor, Windsurf via MCP, or build custom workflows with run / batch

🚀 Quick Start

# 1. Install from npm
npm install -g browser-agentbridge-ai
npm run build

# 2. Start the bridge server
agentbridge start
# → WebSocket: ws://localhost:8080/ws/browser-bridge
# → Live GUI:  http://localhost:8080/viewer

# 3. Navigate to any page
agentbridge navigate https://example.com

# 4. Analyze the page (returns numbered elements)
agentbridge annotate

# 5. Interact with numbered elements
agentbridge click 3
agentbridge type 5 "hello world"
agentbridge press Enter

# 6. Extract content
agentbridge extract article --format=json
agentbridge extract-emails https://example.com/contact

# 7. Full lead generation pipeline
agentbridge scrape-emails "AI consultants France" --limit=50 --out=leads.csv --fast

# 8. Web search with results
agentbridge webSearch "latest AI security tools 2026" --limit=20 --out=results.json

# 9. Interactive REPL
agentbridge repl

Without Playwright (CDP Fallback)

Works on any system with Chrome — no Playwright dependency:

# Launch Chrome headless with remote debugging
google-chrome --headless=new --no-sandbox --remote-debugging-port=9222 &
sleep 2

# Get the WebSocket URL
WS_URL=$(curl -s http://127.0.0.1:9222/json/version | \
  python3 -c "import sys,json; print(json.load(sys.stdin)['webSocketDebuggerUrl'])")

# Point AgentBridge at the running Chrome instance
export CHROME_CDP_URL="$WS_URL"
export BRIDGE_HEADLESS=true
npx agentbridge start

MCP Integration

# Start the MCP server (stdin/stdout transport for Claude/Cursor/Windsurf)
npm run mcp

Connects 16 tools to any MCP-compatible host.

🔗 Quick Links

ClawHub: https://clawhub.ai/alexandre-leng/agentbridge
npm: npm install -g browser-agentbridge-ai
GitHub: https://github.com/alexandre-leng/Agent-Browser-Bridge-AI
Author: Alexandre Leng (@alexandre-leng)

📦 Requirements

Node.js ^18.0.0
Browser: Chromium (via Playwright) OR any Chrome-based browser via CDP fallback
RAM: ~100MB for bridge server + browser process
Playwright: Optional — the CDP fallback works on systems where Playwright isn't supported (e.g., Ubuntu 26.04)

License

MIT-0 — Free to use, modify, and redistribute. No attribution required.