Crawlee Web Scraper

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed web-scraping skill, but it is built to bypass blocking protections and lacks clear limits or authorization safeguards.

Install only if you have a legitimate, authorized scraping use case. Use an isolated environment, pin dependencies, limit targets to approved domains, avoid passing sensitive headers or internal URLs, and do not use it to bypass sites that prohibit automated access.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (4)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 92% confidence
Finding: The skill exposes significant capabilities—network access, shell execution, and file read/write—yet does not declare any permissions or constraints. This creates a transparency and governance gap: users or orchestrators may invoke the skill without understanding its ability to access local files, write output, or run commands, which increases the risk of misuse or unintended data exposure.

Natural-Language Policy Violations

Medium

Confidence: 95% confidence
Finding: The file description explicitly markets the skill for 'bot-detection evasion' and using Crawlee when normal access is blocked, which encourages bypassing site protections. In an agent skill context, that increases the likelihood of unauthorized scraping, policy violations, and use against targets that have already signaled access should be restricted.

Missing User Warnings

Medium

Confidence: 72% confidence
Finding: This helper automatically sends caller-provided URLs and optional headers over the network and silently retries via a second mechanism designed to evade blocking. In an agent setting, that can cause unintended disclosure of sensitive URLs, tokens in headers, or access to internal resources without clear user awareness or consent.

Missing User Warnings

Medium

Confidence: 82% confidence
Finding: The code automatically launches a secondary scraper specifically for bot-detection evasion when the initial request is blocked. In a security-sensitive agent environment, automatically switching to evasive behavior increases legal, policy, and misuse risk and may bypass operator expectations about how network access is performed.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal