Skill Review Pro

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed skill-review tool with an optional, user-confirmed fix workflow, not a hidden or automatic modifier.

Install only if you are comfortable with a skill that can read local Skill files and, when you explicitly proceed through its fix workflow, edit the selected Skill. Review the proposed diffs before approving fixes, especially when using broad prompts like "improve" or "enhance".

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (5)

Description-Behavior Mismatch

Medium
Confidence
93% confidence
Finding
The manifest frames the skill as a static reviewer, but the body also includes fix execution and direct modification workflows. This capability mismatch can cause users or calling systems to grant broader trust than intended, and can lead to unintended file changes when the skill is invoked for what appears to be read-only analysis.

Intent-Code Divergence

High
Confidence
97% confidence
Finding
The Chinese responsibility boundary states that the skill must not modify the target Skill, but later sections instruct it to execute fixes. Contradictory safety boundaries are dangerous because downstream agents may follow the more operational instruction and perform writes despite an apparent non-modification guarantee.

Intent-Code Divergence

High
Confidence
97% confidence
Finding
The English section repeats the same contradiction: 'Never modify the target Skill' is followed by explicit fix execution workflows. In bilingual skills, inconsistent or conflicting operating rules increase the chance of unsafe behavior because the model may privilege the later procedural instructions over the earlier boundary.

Vague Triggers

Medium
Confidence
90% confidence
Finding
The trigger phrases are broad and overlap with ordinary review or improvement requests, making accidental activation likely. In a skill that can transition from review into fix workflows, loose matching increases the risk of invoking higher-impact behavior without the user clearly intending it.

Vague Triggers

Medium
Confidence
94% confidence
Finding
The fix-related modes can be triggered by generic words like 'improve' or 'directly fix', which are common in harmless discussion. Because these phrases can switch the skill into a repair workflow, ambiguous activation creates a pathway from advisory interaction to modification-oriented behavior without sufficiently explicit consent.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal