Self-Improving Agent

Security checks across malware telemetry and agentic risk

Overview

This instruction-only skill openly stores correction patterns locally so an agent can improve over time, but users should review what gets saved.

Install only if you want persistent local agent rules. Review or approve each new RULES.md entry, keep entries general, avoid secrets or client-specific personal details, and periodically prune or delete outdated rules.

SkillSpector

By NVIDIA

Vulnerability Patterns

Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

Findings (3)

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The skill describes an automatic loop where user corrections are logged into RULES.md and reused in future sessions, but it does not require explicit user notice or consent before modifying persistent files. This can lead to silent state changes, unexpected persistence of user content, and abuse where adversarial or mistaken corrections alter future agent behavior.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The instruction to log ANY user correction to RULES.md lacks filtering for secrets, personal data, or sensitive workflow details. Because corrections may contain client names, email preferences, calendar habits, or other confidential information, this creates a persistent privacy and data-retention risk that can expose sensitive content across future sessions.

Ssd 3

Medium

Confidence: 98% confidence
Finding: Persistent logging of all corrections creates a long-lived memory file that can accumulate sensitive user-provided information without safeguards, minimization, or access controls. In this skill's context, the danger is increased because the feature is explicitly designed to learn from every correction over time, making privacy leakage and prompt/data poisoning more likely.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal