Self Improving Agent

Security checks across malware telemetry and agentic risk

Overview

This skill appears purpose-aligned, but it needs review because it can persist user and project context into long-term agent memory with broad automatic triggers.

Install only if you want the agent to keep durable learning and memory files. Prefer project-scoped hooks with narrow matchers, avoid global every-prompt activation, and review or redact entries before they are promoted into MEMORY.md, AGENTS.md, SOUL.md, or TOOLS.md.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (15)

Intent-Code Divergence

Medium

Confidence: 98% confidence
Finding: The document's security note is internally inconsistent: the hook configuration explicitly invokes shell scripts via a command hook, yet the note claims the scripts only output text and do not run commands. This can mislead users into underestimating the execution risk of enabling hooks, especially since hook scripts run with the agent's permissions and can be modified or replaced.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The activation criteria are broad enough that the skill may trigger during many normal interactions, including routine failures, corrections, or tool issues. In a skill that writes to persistent learning files and memory, over-broad invocation increases the chance of storing sensitive, low-quality, or adversarially induced content without deliberate user consent.

Missing User Warnings

Medium

Confidence: 92% confidence
Finding: The README advertises automatic error logging, learning capture, memory integration, and rule formation, but does not warn about privacy, retention, or the risk of persisting sensitive data from prompts, tool output, or user corrections. This creates a real security and privacy issue because an agent may silently store confidential information or poisoned instructions into durable memory.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The skill instructs the agent to persist conversation-derived errors, corrections, and learnings into local files immediately, but it does not require informing the user or obtaining consent before storing that data. This creates a privacy and data-retention risk because sensitive user statements, mistakes, or contextual details may be written to durable storage without the user's awareness.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The logging guidance captures user requests, feedback, and missing-capability signals in persistent files without warning that these entries may contain personal, proprietary, or sensitive operational information. Over time this can build an undocumented dossier of user interactions that may later be exposed to other tools, agents, or operators.

Vague Triggers

Medium

Confidence: 87% confidence
Finding: An empty matcher causes the hook to trigger on every prompt, greatly expanding when the command runs. In a self-improvement skill, that broad scope increases exposure to unnecessary command execution, prompt-context injection, and accidental persistence of behavior across routine interactions.

Vague Triggers

Medium

Confidence: 92% confidence
Finding: Using an empty matcher in a user-level configuration makes the command run for all prompts across sessions, which broadens impact beyond a single project. That persistent global activation increases the chance of unintended execution and makes any future script change affect all work indiscriminately.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: Although labeled minimal, this setup still uses an empty matcher, so it remains broadly triggered on every prompt. Reducing the number of hooks lowers overhead but does not reduce the core risk of indiscriminate command execution on normal interaction flow.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The Codex CLI example also uses an empty matcher, causing the command to run on every prompt submission. Because this is documentation intended for reuse, it can propagate an overly broad and unsafe default across environments.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The trigger descriptions are broad enough to encourage frequent, automatic logging of errors, surprises, and knowledge gaps without clear scoping or minimization rules. In a self-improvement skill that writes to persistent files, this can lead to over-collection of sensitive user, workspace, or operational context and create privacy and data retention risk.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The documentation instructs the agent to log learnings to persistent `.learnings/` storage but does not warn against storing sensitive prompts, personal data, credentials, or session-specific content. Because this skill is explicitly designed to capture failures and corrections, the omission makes accidental long-term retention of sensitive information more likely.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The file documents transcript access and cross-session messaging capabilities without any privacy boundary, consent requirement, or minimization guidance. In a memory-oriented skill, this materially increases the risk that information from one session will be copied into another session or into persistent memory, enabling unintended disclosure across contexts.

Ssd 3

Medium

Confidence: 95% confidence
Finding: The skill explicitly tells the agent to consolidate lessons into long-term memory files and to promote information into durable documents such as MEMORY.md, AGENTS.md, SOUL.md, and TOOLS.md. Because these records can include user-provided corrections, preferences, and context, the design increases the chance of retaining personal or sensitive information far beyond the original conversation.

Ssd 3

Medium

Confidence: 97% confidence
Finding: The recommended MEMORY.md structure explicitly includes storing information 'about the boss' such as user preferences, habits, and important information in a persistent file. This is a concrete privacy risk because it normalizes profiling and durable storage of personal data without any stated consent, minimization, or access-control safeguards.

Session Persistence

Medium

Category: Rogue Agent
Content: ### Option 1: Project-Level Configuration Create `.claude/settings.json` in your project root: ```json {
Confidence: 79% confidence
Finding: Create `.claude/settings.json` in your project root: ```json { "hooks": { "UserPromptSubmit": [ { "matcher": "", "hooks": [ { "type": "command",

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal