Install
openclaw skills install @dinghaibin/scraper-proExtract data from websites and APIs for analysis. Use when user needs to collect product prices from e-commerce sites, gather news articles, extract structured data from web pages, build datasets from public sources, or automate data collection for research.
openclaw skills install @dinghaibin/scraper-proExtract structured data from websites and APIs.
# Basic page scrape
python scripts/scrape.py https://example.com --output data.json
python scripts/scrape.py [OPTIONS]
Options:
--url TEXT URL to scrape (required)
--selector TEXT CSS selector for data extraction
--output PATH Output file path
--format FORMAT Output format: json, csv, markdown
--limit NUM Maximum items to scrape
--wait SECS Wait between requests
--login URL Login URL for authenticated scraping
python scripts/scrape.py \
--url "https://example.com/products" \
--selector ".product" \
--output prices.json \
--format json
python scripts/scrape.py \
--url "https://news.example.com/latest" \
--selector "article" \
--output news.md \
--format markdown
Create scrape.yaml for complex scraping:
url: https://example.com/products
selectors:
items: ".product-card"
title: ".product-title"
price: ".price::text"
image: "img::attr(src)"
link: "a::attr(href)"
pagination:
type: click
button: ".next-page"
max_pages: 10
output:
format: json
file: products.json
Ensure you have permission to scrape target websites. Check Terms of Service and robots.txt.