Skill Creator Claude

Security checks across malware telemetry and agentic risk

Overview

This appears to be a legitimate skill-building toolkit, but it needs Review because it can run local automation, stop local processes, send skill/eval content to Anthropic, and persist sensitive transcripts.

Install only if you are comfortable with a developer-focused skill that runs local Python tooling, uses subagents or the Claude CLI where available, opens local review pages, and may send skill/eval content to Anthropic during description optimization. Avoid using it with confidential skills, customer data, or secrets unless you review the scripts, prefer static/offline viewer output, avoid ports used by important local services, and disable or avoid detailed optimization logging for sensitive work.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

Context-Inappropriate Capability

Medium

Confidence: 97% confidence
Finding: The code unconditionally finds and sends SIGTERM to any process listening on the requested port before starting its own server. This can terminate unrelated local services owned by the same user, causing denial of service or accidental disruption of developer tools, and the risk is elevated because this skill is intended to create and run tooling automatically.

Missing User Warnings

Low

Confidence: 88% confidence
Finding: The README instructs users to set ANTHROPIC_API_KEY but does not warn that it is a sensitive secret or provide basic handling guidance. In a skill intended for broad cross-platform installation, this omission can lead users to expose the key in shell history, logs, screenshots, shared environments, or committed config files, increasing the risk of credential leakage and API abuse.

Vague Triggers

Medium

Confidence: 84% confidence
Finding: The guidance explicitly tells authors to make descriptions 'pushy' and trigger on broad mentions even when the user does not explicitly ask for the skill. In a skill that can execute shell commands, create files, run benchmarks, and launch background processes, over-triggering increases the chance that privileged actions occur in contexts where the user expected a normal conversational response rather than automation.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The script sends potentially sensitive local data to Anthropic, including full skill content, evaluation results, prior attempt history, and user query text embedded in failures, without any notice, consent flow, redaction, or data-classification guardrails. In a skill-creation context, those inputs can easily contain proprietary prompts, internal benchmarks, or user-derived text, so silent exfiltration to an external API is a real privacy and confidentiality risk.

Missing User Warnings

Medium

Confidence: 98% confidence
Finding: The code extracts the model's hidden 'thinking' content and writes it to disk along with the full prompt, response, and rewritten response. This creates a durable local transcript containing sensitive inputs, model internals, and potentially confidential evaluation data, increasing the chance of later disclosure through logs, backups, shared workspaces, or source control mistakes.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal