Scrapling Fetch

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed web-scraping skill, but it needs Review because it can bypass site protections, forward URLs to third parties, charge a billing account, and contains a crafted-URL code execution bug.

Install only if you intentionally need authorized scraping of public pages and are comfortable with browser automation dependencies. Avoid private, internal, token-bearing, or untrusted URLs; avoid the paid script until the hardcoded API key, explicit charge confirmation, and URL code-injection bug are fixed.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (10)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
print(json.dumps(result, ensure_ascii=False))
"""
    
    result = subprocess.run(
        [VENV_PYTHON, "-c", code],
        capture_output=True,
        text=True,
Confidence
97% confidence
Finding
result = subprocess.run( [VENV_PYTHON, "-c", code], capture_output=True, text=True, timeout=120 )

subprocess module call

Medium
Category
Dangerous Code Execution
Content
print(json.dumps(result, ensure_ascii=False))
"""
    
    result = subprocess.run(
        [VENV_PYTHON, "-c", code],
        capture_output=True,
        text=True,
Confidence
97% confidence
Finding
result = subprocess.run( [VENV_PYTHON, "-c", code], capture_output=True, text=True, timeout=120 )

Context-Inappropriate Capability

Medium
Confidence
86% confidence
Finding
The tool explicitly advertises anti-bot bypass and uses a `StealthyFetcher` for sites identified as protected. In an agent skill, this meaningfully increases abuse potential by enabling automated access patterns designed to evade site defenses and consent boundaries.

Context-Inappropriate Capability

High
Confidence
96% confidence
Finding
The scraping path uses a dynamically constructed Python snippet executed in a subprocess, giving the tool a code-execution capability far beyond ordinary fetching. Because the snippet interpolates untrusted URL data, this becomes a realistic code-injection vector rather than merely an architectural smell.

Missing User Warnings

Medium
Confidence
89% confidence
Finding
The README explicitly advertises 'automatic charging' and shows a paid invocation flow, but it does not place a clear spending warning at the point of use beyond the price mention elsewhere. In an agent/skill context, hidden or insufficiently prominent spend implications can lead to unintended charges by users or automated systems invoking the skill.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The documentation includes external billing API calls that transmit a user identifier and API key to a third-party service, but it does not warn about data handling, retention, or privacy implications. In a skill ecosystem, this omission can cause operators to unknowingly expose identifiers or secrets to external infrastructure.

Vague Triggers

Medium
Confidence
94% confidence
Finding
The trigger list includes very broad phrases such as generic web-fetching and access-failure terms, which can cause the skill to be invoked in contexts the user did not specifically intend. In this skill, that risk is amplified because the tool is designed to fetch arbitrary URLs and route requests through anti-bot bypass mechanisms, increasing the chance of unintended third-party requests and data exposure.

Missing User Warnings

Medium
Confidence
97% confidence
Finding
The documentation references Jina Reader and external billing endpoints but does not clearly warn users that requested URLs, fetched content, and possibly identifiers may be transmitted to third-party services. This is dangerous because users may submit sensitive or private URLs under the assumption that processing is local, resulting in unintended disclosure to external providers.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The code sends `user_id` and billing events to an external service without any visible user notice, consent flow, or minimization in this file. In a skill context, hidden transmission of identifiers to third parties can create privacy, compliance, and trust issues, especially when tied to payment activity.

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The requested browsing target is forwarded to `r.jina.ai`, meaning user-supplied URLs and potentially sensitive research targets are disclosed to an external service. In a fetching skill, undisclosed third-party relaying materially increases privacy and confidentiality risk.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal