Skill Creator

Security checks across malware telemetry and agentic risk

Overview

This is a coherent skill-building helper, but it needs review because its helper scripts can send skill content to Claude, start a local background server, and terminate whatever process is using the viewer port.

Install only if you are comfortable with a skill that edits and packages skills, runs Python helpers and Claude CLI subprocesses, creates local review artifacts, and may transmit skill content and eval data to Claude. Prefer the static viewer mode or a foreground server, avoid running it in sensitive repositories without reviewing what will be sent externally, and check for any existing service on the viewer port before launching it.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration

Findings (7)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 95% confidence
Finding: The skill instructs the agent to read and write files, invoke shell commands, inspect environment context, launch background processes, and package artifacts, but it declares no permissions or compatibility constraints. That mismatch can cause the skill to run with broader-than-expected capabilities and gives users no visibility into the operational or security-sensitive actions it may take.

Context-Inappropriate Capability

Medium

Confidence: 97% confidence
Finding: The helper unconditionally finds any process listening on the requested port and sends SIGTERM to it before starting the local review server. That can terminate unrelated local services owned by the user, causing denial of service, data loss, or disruption if the port is in use by something important.

Vague Triggers

Medium

Confidence: 84% confidence
Finding: The skill explicitly tells authors to make descriptions 'pushy' and to trigger on broad adjacent contexts even when the user does not explicitly ask for the skill. That increases over-triggering risk, causing this high-capability skill to activate in situations where shell, file, and evaluation workflows are unnecessary, which expands the attack surface and may lead to unintended file operations or process execution.

Missing User Warnings

Low

Confidence: 89% confidence
Finding: The page loads Google Fonts and the SheetJS library from third-party CDNs, which causes users' browsers to make outbound requests to external domains and exposes metadata such as IP address, user agent, and timing information. It also introduces a supply-chain and availability dependency: if a CDN is compromised, blocked, or unavailable, the viewer may leak data or fail to render correctly.

Missing User Warnings

Medium

Confidence: 86% confidence
Finding: The script sends full skill content, evaluation failures, history, and potentially user-derived queries to an external `claude` subprocess without any consent, redaction, or policy check in this file. If those inputs contain proprietary code, secrets, or sensitive prompts, they may be disclosed to an external model service unintentionally, creating a confidentiality and compliance risk.

Missing User Warnings

Low

Confidence: 95% confidence
Finding: The script automatically opens a generated HTML report in the user's default browser without explicit confirmation. Even though the path is local and generated by the tool, this creates an unexpected side effect, may leak information through the browser environment, and can be unsafe in headless, shared, or sensitive execution contexts.

Session Persistence

Medium

Category: Rogue Agent
Content: 4. **Launch the viewer** with both qualitative outputs and quantitative data: ```bash nohup python <skill-creator-path>/eval-viewer/generate_review.py \ <workspace>/iteration-N \ --skill-name "my-skill" \ --benchmark <workspace>/iteration-N/benchmark.json \
Confidence: 88% confidence
Finding: nohup

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal