Install
openclaw skills install desearch-crawlCrawl/scrape and extract content from any webpage URL. Returns the page content as clean text or raw HTML. Use this when you need to read the full contents of a specific web page.
openclaw skills install desearch-crawlExtract content from any webpage URL. Returns clean text or raw HTML.
export DESEARCH_API_KEY='your-key-here'# Crawl a webpage (returns clean text by default)
scripts/desearch.py crawl "https://en.wikipedia.org/wiki/Artificial_intelligence"
# Get raw HTML
scripts/desearch.py crawl "https://example.com" --crawl-format html
| Option | Description |
|---|---|
--crawl-format | Output content format: text (default) or html |
scripts/desearch.py crawl "https://docs.python.org/3/tutorial/index.html"
scripts/desearch.py crawl "https://example.com/page" --crawl-format html
format=text, truncated, default)Artificial intelligence (AI) is the capability of computational systems to perform tasks that typically require human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making...
format=html, truncated)<!DOCTYPE html>
<html>
<head><title>Artificial intelligence - Wikipedia</title></head>
<body>
<p>Artificial intelligence (AI) is the capability of computational systems...</p>
</body>
</html>
text. Use --crawl-format html only when you need to inspect page structure.text format to avoid bloating the agent context with markup.Status 401, Unauthorized (e.g., missing/invalid API key)
{
"detail": "Invalid or missing API key"
}
Status 402, Payment Required (e.g., balance depleted)
{
"detail": "Insufficient balance, please add funds to your account to continue using the service."
}