Skill Creator (Ming)

Security checks across malware telemetry and agentic risk

Overview

The skill mostly does what it claims, but it includes review-worthy local automation that can stop unrelated services and encourages broad skill triggering.

Install only if you are comfortable with a skill that can create and modify skill files, run local Python helpers, call the local Claude CLI, and launch a localhost review viewer. Use a dedicated workspace, avoid putting secrets in eval prompts or outputs, prefer the static viewer mode when possible, choose an unused viewer port, and review generated skill descriptions so they do not trigger too broadly.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Rogue AgentSelf-Modification, Session Persistence
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
Findings (8)

Lp3

Medium
Category
MCP Least Privilege
Confidence
88% confidence
Finding
The skill instructs the agent to read and write files, execute shell commands, inspect environment context, and launch scripts, but it does not declare permissions or constrain those capabilities. That creates a least-privilege gap: if this skill is triggered unexpectedly or on untrusted input, it can perform high-impact local actions without an explicit trust boundary.

Context-Inappropriate Capability

Medium
Confidence
98% confidence
Finding
The script forcibly kills whatever process is listening on the requested port before starting its own server, without verifying ownership, provenance, or whether termination is safe. In a developer workstation or shared environment, this can disrupt unrelated services, destroy active work, or be abused to terminate security-sensitive local processes if a user runs the tool with a chosen port.

Vague Triggers

Medium
Confidence
92% confidence
Finding
The skill explicitly recommends making descriptions more 'pushy' and to trigger even when users do not explicitly ask for the skill. This increases the chance of overbroad activation, causing the agent to invoke a powerful filesystem-and-shell-capable workflow for adjacent or ambiguous requests where it is unnecessary or unsafe.

Vague Triggers

Medium
Confidence
87% confidence
Finding
Telling the agent to infer where the user is in the process and 'jump in' creates an ambiguous activation boundary. In context, this skill can run evaluations, write files, spawn background jobs, and execute scripts, so ambiguity materially raises the risk of unintended actions being taken without sufficiently clear user intent.

Vague Triggers

Medium
Confidence
88% confidence
Finding
This file defines two distinct operating modes in one agent file: a post-hoc skill-comparison analyzer and a benchmark-results analyzer. Because the top-level description and instructions are broad and multiplexed, an orchestrator or triggering system may invoke this skill in unintended contexts, causing it to read the wrong inputs, produce the wrong output schema, or analyze benchmark data when improvement analysis was expected (and vice versa).

Missing User Warnings

Medium
Confidence
95% confidence
Finding
Spreadsheet content is converted to HTML with SheetJS and then inserted with innerHTML, which can expose the page to script or markup injection if a crafted spreadsheet produces unsafe HTML. This viewer handles untrusted eval artifacts, so rendering attacker-controlled spreadsheet-derived HTML in the same DOM context is more dangerous than in a trusted-only workflow.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The code sends full skill content, eval results, history, and possibly prior model outputs to an external `claude` process without any explicit user-facing consent or visibility at the point of transmission. If those inputs contain proprietary prompts, internal test data, secrets, or sensitive customer text, this creates a real confidentiality risk through unintended disclosure to an external model/service.

Session Persistence

Medium
Category
Rogue Agent
Content
4. **Launch the viewer** with both qualitative outputs and quantitative data:
   ```bash
   nohup python <skill-creator-path>/eval-viewer/generate_review.py \
     <workspace>/iteration-N \
     --skill-name "my-skill" \
     --benchmark <workspace>/iteration-N/benchmark.json \
Confidence
90% confidence
Finding
nohup

VirusTotal

39/39 vendors flagged this skill as clean.

View on VirusTotal