Enterprise Legal Guardrails Public

Security checks across malware telemetry and agentic risk

Overview

This skill is a disclosed local legal guardrail, but its wrapper can execute outbound commands after only a review warning and has a broad allow-any override that users should review carefully.

Install only if you need a command-wrapping outbound guardrail, not just a passive checker. For real posting, email, messaging, or publishing workflows, enable --strict, use a narrow --allowed-command list, avoid --allow-any-command in normal use, enable --sanitize-env with explicit keep lists, and verify the GitHub source before running it.

SkillSpector

By NVIDIA

Vulnerability Patterns

Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (6)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: command_ms = None command_start = time.perf_counter() try: proc = subprocess.run(command, check=False, env=env, timeout=args.command_timeout) command_ms = int((time.perf_counter() - command_start) * 1000) except FileNotFoundError: msg = f"Command not found: {command[0]}"
Confidence: 93% confidence
Finding: proc = subprocess.run(command, check=False, env=env, timeout=args.command_timeout)

Description-Behavior Mismatch

Low

Confidence: 92% confidence
Finding: The guardrail can be disabled entirely via environment flags or bypassed for selected apps through scope settings, causing the tool to return PASS without analyzing outbound content. In a system that relies on this skill as a pre-publish safety control, misconfiguration or adversarial control of runtime settings can silently eliminate legal/compliance review and allow risky content to be sent.

Description-Behavior Mismatch

Medium

Confidence: 91% confidence
Finding: The advertised purpose is legal/compliance filtering, but the implementation is also a command-execution adapter. That mismatch increases risk because operators may deploy or trust the skill as a passive guardrail while it actually has the capability to launch external processes.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The code provides an explicit bypass that permits any command to run when a reason string and approval token are present, but these are only checked for presence/format and can come from CLI or environment variables. That means the central safety control—the command allowlist—can be programmatically disabled without cryptographic verification or trusted external authorization.

Description-Behavior Mismatch

Medium

Confidence: 83% confidence
Finding: The test suite confirms that the adapter can execute arbitrary wrapped commands, including explicit bypass modes such as --allow-any-command. For a skill described as legal/compliance guardrails for outbound actions, validating broad command-execution behavior materially expands the trusted capability surface beyond content review and makes misuse of the skill more dangerous if exposed to untrusted inputs or agents.

Context-Inappropriate Capability

High

Confidence: 88% confidence
Finding: This file exercises a design where a legal/compliance guardrail adapter also executes arbitrary subprocesses and supports policy bypass through an unsafe escape hatch. In context, that coupling is dangerous because a guardrail-branded skill may be granted trust while actually acting as a command runner, increasing the chance of abuse, privilege misuse, or accidental execution of unintended actions.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal