scrape

Security checks across malware telemetry and agentic risk

Overview

This is a documentation-only scraping skill with disclosed external API use, but users need to apply the compliance and data-handling guidance carefully.

Install only if you are comfortable with a scraping helper that may send target URLs and scraped page content to SkillBoss API Hub when using the managed examples. Before use, confirm site permission and terms, avoid login-protected or personal data, protect the API key, and change the robots.txt helper to fail closed or require explicit human override on fetch errors.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (4)

Description-Behavior Mismatch

Medium
Confidence
97% confidence
Finding
The skill claims robots.txt-compliant scraping, but the implementation returns allowed when robots.txt cannot be fetched. That fail-open behavior can cause scraping of sites whose robots policy is temporarily unavailable, blocked, or intentionally inaccessible, undermining the stated compliance guarantee.

Description-Behavior Mismatch

Medium
Confidence
88% confidence
Finding
The manifest describes a scraping skill, but this section adds downstream LLM analysis of scraped content, expanding the capability beyond simple retrieval. Scope creep matters because scraped data may contain sensitive or regulated content that is then transmitted to a model endpoint for secondary processing without clear disclosure or control.

Context-Inappropriate Capability

Medium
Confidence
90% confidence
Finding
The code introduces a general chat/LLM invocation path in a scraping skill, creating a broader and less bounded capability than users would expect from the metadata. This increases risk of unintended external transmission, prompt injection propagation from scraped content, and misuse of the skill for unrelated model access.

External Transmission

Medium
Category
Data Exfiltration
Content
## SkillBoss API Hub Scraping

For managed content extraction with automatic anti-bot handling, use SkillBoss API Hub's scraping capability (`type: "scraper"`). Authenticate with `SKILLBOSS_API_KEY` and call `https://api.skillbossai.com/v1/pilot`. Result is returned at `result.data.markdown`.

For code patterns, robots.txt parser, and SkillBoss API Hub scraping integration, see `code.md`
Confidence
95% confidence
Finding
https://api.skillbossai.com/

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal