Scraper

PassAudited by ClawScan on May 1, 2026.

Overview

This is a straightforward scraping helper for user-authorized pages that fetches URLs and saves cleaned text locally, with its main risks clearly disclosed.

This skill appears safe for its stated purpose. Before installing, be aware that it can fetch supplied URLs and save cleaned page contents locally, so use it only for public or authorized pages and avoid storing sensitive content unless you intend to keep it in the local OpenClaw workspace.

Findings (2)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Low

#ASI02: Tool Misuse and Exploitation

What this means

The agent can make web requests to URLs it is given, which may contact sites or services on the user's behalf.

Why it was flagged

The script fetches a user-supplied URL directly. This is expected for a scraper, but users should ensure the URL is public or authorized because the code does not itself enforce that boundary.

Skill content

parser.add_argument("--url", required=True, help="Public URL to fetch") ... urllib.request.urlopen(req, timeout=20)

Recommendation

Use only with public or explicitly authorized URLs, and do not use it for pages behind logins, paywalls, captchas, robots restrictions, or rate limits.

Low

#ASI06: Memory and Context Poisoning

What this means

Scraped text and source URLs may persist locally and be visible to later workflows that inspect this workspace.

Why it was flagged

The skill stores extracted page content and job metadata persistently in a local OpenClaw memory/workspace directory. This is disclosed and purpose-aligned, but saved content can remain available after the task.

Skill content

All outputs are stored locally under: `~/.openclaw/workspace/memory/scraper/jobs.json` ... `~/.openclaw/workspace/memory/scraper/output/`

Recommendation

Avoid saving sensitive page contents unless you are comfortable keeping them locally, and delete the scraper memory/output directory when the saved data is no longer needed.