Self-Improving Meta

Security checks across malware telemetry and agentic risk

Overview

This skill is not malicious, but it needs review because it can persistently steer or change agent prompts, hooks, memory, and skill files with inconsistent approval boundaries.

Install only if you intentionally want an agent to help maintain its own prompt, hook, memory, and skill infrastructure. Keep hooks project-scoped, avoid empty or global matchers, do not log secrets or raw command output, and require an explicit reviewed diff before any edit to shared prompt files, memory, hooks, rules, or skill definitions.

SkillSpector

By NVIDIA

Vulnerability Patterns

Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (7)

Intent-Code Divergence

Medium

Confidence: 88% confidence
Finding: The document first forbids auto-modifying core prompt files without explicit approval, then elsewhere encourages direct application of fixes to the governed files. In a self-improving skill that targets agent infrastructure, this contradiction can normalize unauthorized edits to high-impact control files and weaken operator review gates.

Intent-Code Divergence

Medium

Confidence: 90% confidence
Finding: The skill says important learnings should be promoted only after explicit human review, but later states broadly applicable fixes may be promoted directly to affected files. Because this skill governs infrastructure used by future sessions, that inconsistency creates a real risk of silent self-modification of trusted agent control surfaces.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: An empty hook matcher causes the activator to run on every user prompt, dramatically broadening the skill's automatic activation surface. For a meta-skill that can influence infrastructure and persistent learnings, this increases the chance of unnecessary prompting, noisy persistence, and unintended workflow manipulation.

Vague Triggers

Medium

Confidence: 87% confidence
Finding: The advanced hook setup repeats the same empty matcher pattern, extending broad activation to additional events including PostToolUse. This can cause the skill to inspect or react to far more activity than necessary, increasing accidental logging and persistent meta-behavior changes.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The template tells authors to include trigger conditions but does not require them to be narrow, testable, or accompanied by exclusions. In a meta-skill that modifies shared agent infrastructure, vague triggers can cause skills to activate too broadly and unintentionally alter prompts, hooks, rules, or memory behavior across many sessions.

Vague Triggers

Medium

Confidence: 95% confidence
Finding: The empty matcher causes the UserPromptSubmit hook to fire on every prompt, greatly expanding the hook’s activation scope. In a self-improving infrastructure skill, this increases the chance of unnecessary prompt interception, context pollution, and unreviewed persistence of meta-behavior across normal workflows.

Session Persistence

Medium

Category: Rogue Agent
Content: ### Option 1: Project-Level Configuration Create `.claude/settings.json` in your project root (activator-only recommended): ```json {
Confidence: 87% confidence
Finding: Create `.claude/settings.json` in your project root (activator-only recommended): ```json { "hooks": { "UserPromptSubmit": [ { "matcher": "", "hooks": [ {

VirusTotal

47/47 vendors flagged this skill as clean.

View on VirusTotal