Skill Sandbox

Security checks across malware telemetry and agentic risk

Overview

This is a defensive testing tool, but its sandbox claims are stronger than what its code enforces, so users could run unsafe skills believing they are contained.

Use this only as a lightweight inspection helper, not as a real sandbox. Run unknown skills inside Docker, a VM, or another disposable environment, and do not trust a SAFE report as proof that the skill had no filesystem, network, subprocess, or credential behavior.

SkillSpector

By NVIDIA

Vulnerability Patterns

MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (2)

Description-Behavior Mismatch

High

Confidence: 98% confidence
Finding: The sandbox advertises monitoring of filesystem, network, environment, and subprocess activity, but the actual execution path uses subprocess.run with a sanitized environment and never applies the monitored open() or monitored environment wrappers to the child process. As a result, untrusted code can perform runtime file, network, and subprocess activity without those actions being captured in the report, creating a false sense of safety for anyone relying on this tool to vet skills before installation.

Intent-Code Divergence

Medium

Confidence: 96% confidence
Finding: The CLI exposes flags such as --restricted and --monitor-network and prints guidance implying network blocking and enhanced monitoring, but those options are not enforced in execution. This misleading interface can cause operators to believe dangerous actions are being prevented when they are not, increasing the chance that malicious skills are trusted and installed after inadequate testing.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal