Decodo Scraper

v1.1.0

Search Google, scrape web pages, Amazon product pages, YouTube subtitles, or Reddit (post/subreddit) using the Decodo Scraper OpenClaw Skill.

10· 1.5k·10 current·10 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for donatasdecodo/decodo-scraper.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Decodo Scraper" (donatasdecodo/decodo-scraper) from ClawHub.
Skill page: https://clawhub.ai/donatasdecodo/decodo-scraper
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Canonical install target

openclaw skills install donatasdecodo/decodo-scraper

ClawHub CLI

Package manager switcher

npx clawhub@latest install decodo-scraper
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description, README, SKILL.md, and the included Python tool all consistently implement a Decodo-backed web scraper (Google, Amazon, YouTube subtitles, Reddit, universal pages). The declared runtime requirement (DECODO_AUTH_TOKEN) is appropriate. Small inconsistency: the registry metadata at the top of the report lists no required env vars/primary credential, but SKILL.md and the script both require DECODO_AUTH_TOKEN.
Instruction Scope
SKILL.md and tools/scrape.py only instruct the agent to read a single DECODO_AUTH_TOKEN (or a .env file in the repo root), construct JSON payloads, and POST to scraper-api.decodo.com. The script does not read unrelated system files, other environment variables, or modify other skills or system settings.
Install Mechanism
There is no install spec (instruction-only), and included requirements.txt lists only requests and python-dotenv. No external arbitrary downloads, obscure URLs, or archive extraction are present. The user must install Python deps themselves (pip install -r requirements.txt).
Credentials
The only secret required is DECODO_AUTH_TOKEN (Basic token used in Authorization header) which is proportional to the task. Note the registry metadata did not list this required env var while SKILL.md and the code do — this mismatch should be reconciled before trusting the registry entry.
Persistence & Privilege
Skill is not always-enabled and is user-invocable. It does not request persistent elevated privileges, does not modify other skills or global agent configs, and does not request unrelated credentials.
Assessment
This skill implements a Decodo-hosted web-scraping client and requires a Decodo auth token (DECODO_AUTH_TOKEN). Before installing: 1) Verify you trust Decodo (https://decodo.com) because requests and scraped content are sent to scraper-api.decodo.com using your token. 2) Do not store the token in a committed .env file in a public repo — keep it in your environment or a secure secret store. 3) Reconcile the registry metadata mismatch (the registry claims no env vars while SKILL.md and the script require DECODO_AUTH_TOKEN). 4) Install dependencies locally (pip install -r requirements.txt) and inspect the code if you plan to run it in an environment with sensitive data. 5) If you need to prevent autonomous network calls, consider disabling model invocation or only invoking the skill manually. Overall the skill appears coherent for its described purpose, but only proceed if you trust the external Decodo service and follow best practices for handling the token.

Like a lobster shell, security has layers — review code before you run it.

latestvk978a29t9sqhf2g34x6xnt5ydd81fnwj
1.5kdownloads
10stars
3versions
Updated 1mo ago
v1.1.0
MIT-0

Decodo Scraper OpenClaw Skill

Use this skill to search Google, scrape any URL, or fetch YouTube subtitles via the Decodo Web Scraping API. Search outputs a JSON object of result sections; Scrape URL outputs plain markdown; Amazon and Amazon search output parsed product-page or search results (JSON). Amazon search uses --query. YouTube subtitles outputs transcript/subtitles. Reddit post and Reddit subreddit output post/listing content (JSON).

Authentication: Set DECODO_AUTH_TOKEN (Basic auth token from Decodo Dashboard → Scraping APIs) in your environment or in a .env file in the repo root.

Errors: On failure the script writes a JSON error to stderr and exits with code 1.


Tools

1. Search Google

Use this to find URLs, answers, or structured search results. The API returns a JSON object whose results key contains several sections (not all may be present for every query):

SectionDescription
organicMain search results (titles, links, snippets).
ai_overviewsAI-generated overviews or summaries when Google shows them.
paidPaid/sponsored results (ads).
related_questions“People also ask”–style questions and answers.
related_searchesSuggested related search queries.
discussions_and_forumsForum or discussion results (e.g. Reddit, Stack Exchange).

The script outputs only the inner results object (these sections); pagination info (page, last_visible_page, parse_status_code) is not included.

Command:

python3 tools/scrape.py --target google_search --query "your search query"

Examples:

python3 tools/scrape.py --target google_search --query "best laptops 2025"
python3 tools/scrape.py --target google_search --query "python requests tutorial"

Optional: --geo us or --locale en for location/language.


2. Scrape URL

Use this to get the content of a specific web page. By default the API returns content as Markdown (cleaner for LLMs and lower token usage).

Command:

python3 tools/scrape.py --target universal --url "https://example.com"

Examples:

python3 tools/scrape.py --target universal --url "https://example.com"
python3 tools/scrape.py --target universal --url "https://news.ycombinator.com/"

3. Amazon product page

Use this to get parsed data from an Amazon product (or other Amazon) page. Pass the product page URL as --url. The script sends parse: true and outputs the inner results object (e.g. ads, product details, etc.).

Command:

python3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/PRODUCT_ID"

Examples:

python3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/B09H74FXNW"

4. Amazon search

Use this to search Amazon and get parsed results (search results list, delivery_postcode, etc.). Pass the search query as --query.

Command:

python3 tools/scrape.py --target amazon_search --query "your search query"

Examples:

python3 tools/scrape.py --target amazon_search --query "laptop"

5. YouTube subtitles

Use this to get subtitles/transcript for a YouTube video. Pass the video ID (e.g. from youtube.com/watch?v=VIDEO_ID) as --query.

Command:

python3 tools/scrape.py --target youtube_subtitles --query "VIDEO_ID"

Examples:

python3 tools/scrape.py --target youtube_subtitles --query "dFu9aKJoqGg"

6. Reddit post

Use this to get the content of a Reddit post (thread). Pass the full post URL as --url.

Command:

python3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/SUBREDDIT/comments/ID/..."

Examples:

python3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/nba/comments/17jrqc5/serious_next_day_thread_postgame_discussion/"

7. Reddit subreddit

Use this to get the listing (posts) of a Reddit subreddit. Pass the subreddit URL as --url.

Command:

python3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/SUBREDDIT/"

Examples:

python3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/nba/"

Summary

ActionTargetArgumentExample command
Searchgoogle_search--querypython3 tools/scrape.py --target google_search --query "laptop"
Scrape pageuniversal--urlpython3 tools/scrape.py --target universal --url "https://example.com"
Amazon productamazon--urlpython3 tools/scrape.py --target amazon --url "https://www.amazon.com/dp/B09H74FXNW"
Amazon searchamazon_search--querypython3 tools/scrape.py --target amazon_search --query "laptop"
YouTube subtitlesyoutube_subtitles--querypython3 tools/scrape.py --target youtube_subtitles --query "dFu9aKJoqGg"
Reddit postreddit_post--urlpython3 tools/scrape.py --target reddit_post --url "https://www.reddit.com/r/nba/comments/17jrqc5/..."
Reddit subredditreddit_subreddit--urlpython3 tools/scrape.py --target reddit_subreddit --url "https://www.reddit.com/r/nba/"

Output: Search → JSON (sections). Scrape URL → markdown. Amazon / Amazon search → JSON (results e.g. ads, product info, delivery_postcode). YouTube → transcript. Reddit → JSON (content).

Comments

Loading comments...