Install
openclaw skills install xcrawl-scraperXCrawl - AI-Powered Web Scraping API / AI 驱动网页爬虫,支持结构化数据提取
openclaw skills install xcrawl-scraper| 功能 | 说明 |
|---|---|
| 🏷️ 网页爬取 | 支持 Markdown、HTML、JSON、Screenshot |
| 🔍 搜索 | 搜索引擎结果爬取 |
| 🗺️ 网站地图 | 自动发现站点所有页面 |
| 🕷️ 站点爬取 | 批量爬取整个站点 |
| 📊 结构化数据 | JSON Schema 自动提取结构化数据 |
| 🌐 代理支持 | 全球代理可选 |
scripts\install.bat
# 1. 安装依赖
pip install xcrawl
# 2. 配置 API Key
python scripts\xcrawl_scraper.py set-key YOUR_API_KEY
python scripts\xcrawl_scraper.py set-key YOUR_API_KEY
python scripts\xcrawl_scraper.py scrape https://example.com markdown
python scripts\xcrawl_scraper.py scrape https://example.com markdown html links
python scripts\xcrawl_scraper.py scrape https://example.com json "提取产品名称和价格"
python scripts\xcrawl_scraper.py search "web scraping"
python scripts\xcrawl_scraper.py map https://example.com
python scripts\xcrawl_scraper.py crawl https://example.com
| 命令 | 说明 |
|---|---|
scrape <URL> [formats...] | 爬取网页 |
search <query> | 搜索 |
map <URL> | 网站地图 |
crawl <URL> | 站点爬取 |
set-key <API_KEY> | 设置 API Key |
config | 显示配置 |
配置文件: scripts/config.json
{
"apiKey": "YOUR_API_KEY",
"apiUrl": "https://run.xcrawl.com",
"timeout": 60,
"defaultFormats": ["markdown"],
"defaultProxy": ""
}
# Example Domain
This domain is for use in illustrative examples in documents.
{
"product_name": "iPhone 15 Pro",
"price": 999,
"currency": "USD"
}
欢迎提交 Issue 和 Pull Request!
MIT License