Webapp Testing Anthropic

Security checks across malware telemetry and agentic risk

Overview

This webapp-testing skill appears useful, but it needs review because it can run local shell commands and save sensitive browser artifacts without clear safety boundaries.

Review before installing. Use it only for projects and server commands you trust, avoid running commands derived from untrusted input, and avoid capturing full-page screenshots, DOM dumps, or raw console logs from authenticated or production-like sessions unless you have a plan to redact and delete them.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: print(f"Starting server {i+1}/{len(servers)}: {server['cmd']}") # Use shell=True to support commands with cd and && process = subprocess.Popen( server['cmd'], shell=True, stdout=subprocess.PIPE,
Confidence: 98% confidence
Finding: process = subprocess.Popen( server['cmd'], shell=True, stdout=subprocess.PIPE, stderr=subprocess.PIPE )

Lp3

Medium

Category: MCP Least Privilege
Confidence: 95% confidence
Finding: The skill instructs the agent to invoke shell commands and run helper scripts that can start arbitrary local servers, but it declares no permissions or safety boundaries. This creates a capability mismatch that can mislead operators and downstream policy systems, increasing the chance of unintended command execution, file creation, or interaction with untrusted local applications.

Missing User Warnings

Medium

Confidence: 92% confidence
Finding: The skill encourages capturing screenshots and browser logs during testing without warning that these artifacts may contain credentials, personal data, session tokens, or internal application details. In a local webapp-testing context, this is especially relevant because developers often test against staging-like data or authenticated sessions.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The reconnaissance instructions explicitly tell users to save full-page screenshots and dump rendered page content, which can expose sensitive UI state, secrets embedded in the DOM, CSRF tokens, personal data, or proprietary content. Because this step is framed as a normal inspection workflow without safeguards, it increases the likelihood of unnecessary data collection and leakage.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: This example persists raw browser console output to a file under /mnt/user-data/outputs without any filtering, minimization, or warning to the user. Console logs often contain sensitive data such as tokens, PII, stack traces, internal URLs, or application secrets, so writing them to disk can create unintended retention and later disclosure risk.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal