Install
openclaw skills install scrapling-aiUse Scrapling to scrape websites with adaptive parsing, Cloudflare bypass, and MCP support. Handles dynamic content, anti-bot detection, and provides clean H...
openclaw skills install scrapling-aiUse the scrapling CLI to scrape websites with adaptive parsing and anti-bot bypass.
✅ USE this skill when:
❌ DON'T use this skill when:
web_fetchbrowser tool# Install CLI
pipx install scrapling
scrapling --version
# Get clean HTML
scrapling https://example.com -o html
# Get JSON structure
scrapling https://example.com -o json
# Save to file
scrapling https://example.com -o output.html
# Custom headers
scrapling https://example.com --headers "User-Agent: Mozilla/5.0"
# Timeout (seconds)
scrapling https://slow-site.com --timeout 30
# XPath extraction
scrapling https://example.com -e "//div[@class='content']" -o html
# CSS selector
scrapling https://example.com -e "div.content" -o html
# Extract title, meta description
scrapling https://example.com \
--fields 'title,meta_description' \
-o json
Scrapling supports MCP (Model Context Protocol) for AI agents:
# Start MCP server
scrapling mcp start
Then configure your agent to use the scrape tool via MCP.
scrapling https://example.com/news/article-123 \
--fields 'title,author,publish_date,content' \
-o json
scrapling https://shop.example.com/products \
-e "//div[@class='product']" \
-o html
# Scrapling auto-bypasses most protections
scrapling https://protected-site.com -o html
{
"title": "Page Title",
"meta_description": "Description text",
"content": "<clean HTML>",
"links": ["http://...", "..."],
"images": [{"src": "...", "alt": "..."}]
}
Use the scrapling CLI to scrape websites with adaptive parsing and anti-bot bypass.
✅ USE this skill when:
❌ DON'T use this skill when:
web_fetchbrowser tool# Install CLI
pipx install scrapling
scrapling --version
# Get clean HTML
scrapling https://example.com -o html
# Get JSON structure
scrapling https://example.com -o json
# Save to file
scrapling https://example.com -o output.html
# Custom headers
scrapling https://example.com --headers "User-Agent: Mozilla/5.0"
# Timeout (seconds)
scrapling https://slow-site.com --timeout 30
# XPath extraction
scrapling https://example.com -e "//div[@class='content']" -o html
# CSS selector
scrapling https://example.com -e "div.content" -o html
# Extract title, meta description
scrapling https://example.com \
--fields 'title,meta_description' \
-o json
Scrapling supports MCP (Model Context Protocol) for AI agents:
# Start MCP server
scrapling mcp start
Then configure your agent to use the scrape tool via MCP.
scrapling https://example.com/news/article-123 \
--fields 'title,author,publish_date,content' \
-o json
scrapling https://shop.example.com/products \
-e "//div[@class='product']" \
-o html
# Scrapling auto-bypasses most protections
scrapling https://protected-site.com -o html
{
"title": "Page Title",
"meta_description": "Description text",
"content": "<clean HTML>",
"links": ["http://...", "..."],
"images": [{"src": "...", "alt": "..."}]
}