Scrapling MCP

Security checks across static analysis, malware telemetry, and agentic risk

Overview

This is a coherent web-scraping guidance skill, but it includes powerful anti-bot, crawling, and local persistence features that should be used only on authorized targets.

This skill appears aligned with its web-scraping purpose and does not show hidden exfiltration or destructive behavior. Before installing, be comfortable with installing Scrapling and Playwright, and use the anti-bot, proxy, and spider features only for authorized scraping with clear limits on target sites, concurrency, and retained crawl data.

Static analysis

Static analysis findings are pending for this release.

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal

Risk analysis

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

If used carelessly, the agent could scrape protected or rate-limited websites in ways that violate site rules or create operational risk.

Why it was flagged

The skill exposes anti-bot scraping functionality through MCP. This is clearly related to the stated scraping purpose, but it is a capability users should restrict to authorized targets.

Skill content
### fetch_stealthy
Anti-bot fetch with Cloudflare bypass.
Recommendation

Use these tools only for sites you own, have permission to test, or are allowed to scrape; add human review for stealth, proxy, and large-crawl actions.

What this means

Installing external packages may introduce dependency or version risk if done in a sensitive environment.

Why it was flagged

The setup uses external package and browser installs. These are expected for Scrapling and Playwright, but the artifacts do not pin versions or provide a registry install spec.

Skill content
pip install scrapling[mcp,playwright]
python -m playwright install chromium
Recommendation

Install in a virtual environment, verify the package source, and pin versions if you need reproducible or production-safe use.

What this means

Local crawl data, checkpoints, or adaptive selector state could retain information longer than expected.

Why it was flagged

The skill documents persistent crawl state. This is useful and purpose-aligned, but stored crawl output or checkpoints may contain scraped content or state reused across runs.

Skill content
Pause/Resume: `crawldir` parameter saves checkpoints
Recommendation

Store crawl data in a known project directory, avoid scraping sensitive personal or private data, and delete checkpoints when they are no longer needed.