Security Guard

Security checks across malware telemetry and agentic risk

Overview

This skill tries to prevent secret leaks, but it automatically pulls in personal memory/profile files and its redaction helper can expose short secrets.

Review before installing. The skill is not shown to exfiltrate data or perform destructive actions, and static scan was clean while VirusTotal was pending, but its automatic memory/profile reads and weak short-secret masking are material risks. Install only if you are comfortable tightening or removing the session initialization protocol and fixing the sanitizer so it never reveals complete sensitive values.

SkillSpector

By NVIDIA

Vulnerability Patterns

Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (6)

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The examples direct users to run commands like `cat ~/.openclaw/openclaw.json | grep token`, which can print full secrets to a terminal or shell history without any masking. In a skill specifically intended to prevent sensitive-data disclosure, teaching users to dump raw credentials from a config file undermines the control boundary and normalizes unsafe secret handling.

Description-Behavior Mismatch

Medium

Confidence: 92% confidence
Finding: The file says session initialization happens only after a user asks for sensitive information, while the skill metadata requires it at the start of every session. This gap can leave early turns unprotected, creating inconsistent enforcement and increasing the chance that sensitive requests are handled before the security context is established.

Context-Inappropriate Capability

Medium

Confidence: 91% confidence
Finding: The initialization example instructs reading `SOUL.md`, `USER.md`, and a dated memory file, which exceeds the narrow purpose of enforcing sensitive-data handling. Expanding scope to unrelated context sources increases the attack surface for prompt injection and may expose unnecessary personal or operational data during a security-sensitive workflow.

Description-Behavior Mismatch

High

Confidence: 99% confidence
Finding: When the input length is less than or equal to show-first + show-last, the script sets SHOW_FIRST and SHOW_LAST so they cover the entire string, leaving AST_COUNT non-positive and ultimately outputting the full secret plus a single asterisk. For a skill explicitly intended to protect sensitive data in all chats, this behavior can disclose short API keys, tokens, PINs, OTPs, or truncated credentials instead of masking them.

Intent-Code Divergence

Medium

Confidence: 94% confidence
Finding: The comments and documented purpose claim the script sanitizes sensitive information, but the implementation does not uphold that guarantee for short inputs. In a security-focused skill, this mismatch is dangerous because downstream users and other components may trust the tool to safely redact secrets and inadvertently leak complete values.

Vague Triggers

Medium

Confidence: 79% confidence
Finding: The activation criteria are very broad ('use when handling requests involving sensitive data or when user asks to bypass security rules'), which can cause the skill to trigger in many normal conversations. Over-broad activation increases the chance of unnecessary file reads, excessive refusals, and unintended application of the session initialization behavior in contexts where it is not needed.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal