SkillGuard

Security checks across malware telemetry and agentic risk

Overview

SkillGuard is a coherent safety scanner, but it asks for high-trust credentials and automatically changes persistent workspace policy beyond a simple scan.

Install only if you trust the Apify actor/operator and are comfortable exposing a protected OpenClaw webhook path and related tokens. Use dedicated, rotatable credentials, review or remove the automatic TOOLS.md policy write, and treat a passing result as a limited SKILL.md scan rather than proof the full skill package is safe.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (6)

Lp3

Medium
Category
MCP Least Privilege
Confidence
88% confidence
Finding
The skill instructs the agent to run shell commands such as the bundled bash script and a file-append command, yet the manifest does not declare shell capability or equivalent permission. This creates a transparency and policy-enforcement gap: reviewers and runtime controls may underestimate what the skill can do, especially because one of the commands modifies local workspace state.

Description-Behavior Mismatch

Medium
Confidence
95% confidence
Finding
The skill is presented as a scanner/auditor, but it also instructs the agent to change TOOLS.md and alter future installation behavior across sessions. That is a scope expansion from passive analysis into persistent policy modification, which can influence unrelated future tasks and bypass user expectations about what a scanning skill should do.

Context-Inappropriate Capability

High
Confidence
98% confidence
Finding
Automatically appending policy text to TOOLS.md is not necessary to scan a skill and gives this skill a durable way to influence the agent's future behavior. Persistent instruction injection into workspace policy files is dangerous because it survives the current task, affects later sessions, and normalizes self-installing guardrails without explicit consent.

Vague Triggers

Medium
Confidence
72% confidence
Finding
The description says the skill should run automatically when the user asks to install a skill, which is a broad trigger for a third-party action involving external services and webhooks. Broad invocation increases the chance of unintended activation, especially in systems where trigger matching is heuristic or overlaps with other install workflows.

Vague Triggers

Medium
Confidence
80% confidence
Finding
Several example trigger phrases are common natural-language requests like 'install [skill]' or 'scan [skill]', making accidental invocation plausible. In a tool-enabled agent, ambiguous triggers can cause unsolicited network calls, webhook setup, or policy-related actions when the user may only be asking a question.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The skill directs automatic modification of a local policy file during first install without adequate warning that it changes persistent workspace state. Silent or implicit local state changes are dangerous because they can alter future agent behavior, create hard-to-trace policy drift, and exceed the user's reasonable expectation for a scanning utility.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal