Apify Scraper Skill
Use this skill when you need to scrape content from sites that block bots — Twitter/X threads, Reddit, LinkedIn, YouTube comments, Google SERP, Amazon, Product Hunt, etc.
When to Use
- A Twitter/X URL is shared and you need the full thread (not just the first tweet)
- You need Reddit thread content without the expensive API
- LinkedIn company/profile data is needed
- YouTube comments or video metadata beyond what the API gives
- Google search results programmatically
- Any site that blocks standard web_fetch
Do NOT use for: sites accessible via normal web_fetch or web_search. Apify costs credits — use it only when needed.
Setup
- API Key:
op://OpenClaw/Apify API Credentials/credential (also in gateway plist as APIFY_API_KEY)
- Dashboard: https://console.apify.com (account: redditech)
- Plan: FREE ($5/mo credit)
- Script:
python3 scripts/apify-run.py <actor> <input_json>
Running an Actor
python3 /Users/loki/.openclaw/workspace/scripts/apify-run.py \
"apidojo/tweet-scraper" \
'{"twitterHandles": ["solanamobile"], "maxItems": 50}'
Key Actors
Twitter/X
apidojo/tweet-scraper — $0.40/1K tweets. Full thread support via conversationIds. Advanced search syntax.
{"conversationIds": ["2034675043033375103"], "maxItems": 50}
or by handle:
{"twitterHandles": ["solanamobile"], "maxItems": 20}
Reddit
YouTube
Google SERP
LinkedIn
anchor/linkedin-profile-scraper — ⚠️ ToS risk. Use sparingly for research only.
Pricing Reference
- 1 CU = 1 GB RAM × 1 hour
- Free tier: $5/mo (~16.7 CU)
- Tweet scraping:
0.035–0.04 CU/1K tweets ($0.01/1K on free tier)
- Some actors charge flat per-result: $0.25–$0.40/1K tweets
- Check usage: https://console.apify.com/billing
Notes
- Results are returned as a dataset — the script polls until complete
- Timeout: 5 minutes default (most actors finish in 30–60s)
- If an actor breaks (community-maintained), check Apify Store for alternatives
- MCP integration pending — Apify MCP server exists but openclaw.json doesn't support
mcpServers key yet (schema validation rejects it). Use this script approach instead.