Install
openclaw skills install firecrawl-wsAI-native web scraping, crawling, domain mapping, and structured extraction. Use for converting websites into LLM-ready Markdown, scraping pages with dynamic JS/actions, crawling full domains, or extracting schema-based structured JSON.
openclaw skills install firecrawl-wsThis skill extends Manus with the capability to search, scrape, crawl, and extract structured data from any website using Firecrawl [1] [2].
/scrape)Use when you need the text content, Markdown, HTML, or screenshots of a specific webpage [2].
formats (e.g., ["markdown"] or ["markdown", "screenshot"]) [2].includeTags or excludeTags [2]./crawl)Use when you need to discover and scrape all pages under a specific domain or path recursively [1] [2].
url [2].maxDepth) and page limits (limit) to control token and credit usage [2].scrapeOptions to ensure each crawled page is parsed with the correct format (e.g., Markdown only) [2].jobId until completed [2]./map)Use when you need to quickly discover all URLs belonging to a domain without scraping page content [1] [2].
url [2].search filter to only return URLs matching a specific keyword or path [2].includeSubdomains to true if you need sub-domain discovery [2]./extract)Use when you need to parse raw web pages and extract structured JSON data conforming to a specific schema [3].
urls and a natural language extraction prompt [3].For comprehensive API parameters, SDK code templates, and configuration options, read the following reference files:
references/api_reference.md for complete endpoint request/response schemas, Python SDK templates, and TypeScript/Zod snippets.references/self_hosting.md for production-ready Docker Compose configurations, environment variables, and scaling guidelines.onlyMainContent: true to strip out navigation bars, headers, and footers. This dramatically reduces downstream LLM token costs and keeps context windows clean [2]./map before /crawl if you only need to discover pages or filter specific URLs to scrape. Mapping is significantly faster and cheaper than full crawls [1] [2].429) or transient server errors (5xx) to ensure scraping resiliency [4].waitFor delays when scraping dynamic content. Instead, use selector-based waits (e.g., {"type": "wait", "selector": "#loaded-element"}) to minimize request latency [2]./crawl endpoint and poll for results or use webhooks [2].[1] Firecrawl Homepage, "The API to search, scrape, and interact with the web at scale." URL: https://github.com/firecrawl/firecrawl
[2] Firecrawl Documentation, "Advanced Scraping Guide." URL: https://docs.firecrawl.dev/advanced-scraping-guide
[3] Firecrawl Documentation, "Agent Endpoint." URL: https://docs.firecrawl.dev/features/agent
[4] Firecrawl Documentation, "Rate Limits." URL: https://docs.firecrawl.dev/rate-limits
[5] Firecrawl GitHub Repository, "Self-hosting Firecrawl Guide." URL: https://raw.githubusercontent.com/firecrawl/firecrawl/main/SELF_HOST.md