skill-creator

Security checks across malware telemetry and agentic risk

Overview

The skill mostly matches its stated skill-building purpose, but its local review viewer can terminate other local processes and it loads third-party scripts into pages containing eval outputs.

Install only if you are comfortable with a skill that can run local Python/shell workflows, spawn eval agents, call Claude/Anthropic tooling, and write review artifacts. Prefer the static viewer mode for sensitive evaluations, avoid using occupied ports, stop any background viewer after review, and do not embed confidential outputs in the CDN-backed viewer unless you accept that external script dependency.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Rogue AgentSelf-Modification, Session Persistence
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Findings (9)

Lp3

Medium
Category
MCP Least Privilege
Confidence
91% confidence
Finding
The skill instructs the agent to read and write files, execute shell commands, launch Python scripts, and use environment-dependent tooling, but it does not declare any permissions or capability boundaries. That mismatch increases the risk of over-privileged execution and makes it harder for a host system or reviewer to reason about what the skill is allowed to do.

Context-Inappropriate Capability

High
Confidence
98% confidence
Finding
The script unconditionally identifies processes bound to the requested port and sends SIGTERM to them before starting its own server. That gives a review utility unnecessary host-management power and can disrupt unrelated local services, developer tools, or security controls if the user points it at a commonly used port.

Intent-Code Divergence

Medium
Confidence
89% confidence
Finding
The top-level description says the tool only generates and serves a review page, but the implementation also kills processes on the selected port. This mismatch is security-relevant because operators may run the tool assuming it is read-only/non-destructive, increasing the chance of accidental disruption.

Context-Inappropriate Capability

Medium
Confidence
93% confidence
Finding
This viewer loads executable JavaScript and fonts from third-party CDNs at runtime, which creates an unnecessary external trust dependency for a local eval-review tool. If the CDN, dependency, or delivery path is compromised, an attacker could inject script into the review page and access embedded run data, reviewer feedback, or manipulate the UI and saved results.

Vague Triggers

Medium
Confidence
80% confidence
Finding
The top-level description is broad enough to trigger on many generic requests about creating, modifying, testing, benchmarking, or optimizing skills. Over-broad activation can cause the wrong skill to run in unrelated contexts, leading to unnecessary file operations, shell execution, or workflow changes the user did not explicitly request.

Vague Triggers

High
Confidence
94% confidence
Finding
The instruction to make descriptions 'pushy' and trigger on broad related mentions actively encourages ambiguous or excessive skill activation. In context, this is more dangerous because the skill can spawn subagents, execute scripts, write files, and launch background viewers, so accidental triggering can amplify operational and security risk beyond a harmless prompt mismatch.

Vague Triggers

Medium
Confidence
72% confidence
Finding
The file defines two materially different analysis modes in one prompt surface with only prompt-supplied parameters distinguishing them. An LLM or orchestration layer can misclassify the task and follow the wrong instruction set, causing benchmark runs to produce prescriptive skill-improvement advice or post-hoc comparisons to omit required structured output; in a multi-agent pipeline, this can corrupt downstream decisions and evaluations.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The script logs the full prompt, model response, and especially the model's hidden 'thinking' content to disk in a JSON transcript. Those fields can contain sensitive data from skill contents, eval queries, prior history, and internal reasoning that users typically do not expect to be persisted, creating a confidentiality and prompt-leakage risk if logs are later accessed, shared, or committed.

Session Persistence

Medium
Category
Rogue Agent
Content
4. **Launch the viewer** with both qualitative outputs and quantitative data:
   ```bash
   nohup python <skill-creator-path>/eval-viewer/generate_review.py \
     <workspace>/iteration-N \
     --skill-name "my-skill" \
     --benchmark <workspace>/iteration-N/benchmark.json \
Confidence
88% confidence
Finding
nohup

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal