Self Improving Agent

Security checks across malware telemetry and agentic risk

Overview

This self-improvement skill is purpose-aligned, but it encourages durable logging and cross-session sharing of potentially sensitive conversation and error context without enough redaction or consent guidance.

Review before installing. Use this only for projects where persistent learning memory is acceptable, keep hooks project-scoped where possible, avoid global hook activation, and require sanitized summaries only. Do not log secrets, credentials, tokens, raw prompts, customer or personal data, full stack traces, private URLs, or verbatim transcript content; review anything before promoting it into always-loaded agent files.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (8)

Intent-Code Divergence

Medium

Confidence: 98% confidence
Finding: The document’s security section inaccurately minimizes risk by stating the scripts 'only output text' and 'don’t modify files or run commands,' while the same document configures those scripts to execute as command hooks. This can mislead users into enabling automatically executed scripts with undue trust, increasing the chance they deploy code with broader capabilities than advertised.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The skill instructs persistent logging of corrections, errors, and context into markdown files without any privacy screening, secret redaction, or sensitivity warning. In real usage, these logs can easily capture API keys, personal data, proprietary code details, or internal operational information and retain them indefinitely.

Missing User Warnings

Medium

Confidence: 87% confidence
Finding: The document instructs users to log learnings, errors, and tool issues into persistent workspace files, but it does not warn against storing secrets, prompts, credentials, internal paths, or sensitive session content. In an agent environment, failures and corrections often contain sensitive operational context, so this creates a realistic risk of unintended retention and later disclosure.

Ssd 3

Medium

Confidence: 96% confidence
Finding: The skill normalizes storing user-provided details and sharing learnings across sessions, but it provides no scoping, minimization, or access-control guidance. That creates a straightforward path for natural-language data exfiltration into durable files and other agent contexts where sensitive content may spread far beyond the original session.

Ssd 3

High

Confidence: 98% confidence
Finding: The logging template explicitly asks for full context, which encourages copying rich conversational and operational details into plain markdown. Plain-language summaries often still contain secrets, internal architecture details, incident specifics, or personal information, making the leak surface both broad and durable.

Ssd 3

High

Confidence: 99% confidence
Finding: Recording the exact inputs or parameters used during failures is dangerous because commands often include secrets, environment values, database URLs, file paths, or customer identifiers. Persisting these details in markdown materially increases the chance of credential leakage and operational exposure.

Ssd 3

High

Confidence: 97% confidence
Finding: Presenting transcript reading and message passing as normal mechanisms for sharing learnings creates a direct avenue to propagate sensitive information from one session to another. Because transcripts can contain user prompts, outputs, code, and operational details, this significantly enlarges the blast radius of any sensitive disclosure.

Ssd 3

Medium

Confidence: 92% confidence
Finding: The guidance to 'promote aggressively' into long-lived instruction files increases persistence and distribution of whatever was logged earlier, including potentially sensitive or low-quality content. This turns one-time session data into durable ambient context that may influence future sessions and leak further.

VirusTotal

61/61 vendors flagged this skill as clean.

View on VirusTotal