hermes-learning-loop

Security checks across malware telemetry and agentic risk

Overview

This is not malware, but it asks the agent to automatically save task lessons and modify long-term memory or skills without clear user approval or privacy limits.

Install only if you intentionally want a persistent self-learning journal for the agent. Require approval before writes to memory or skills, review proposed diffs, and avoid storing secrets, customer data, private prompts, exact file paths, credentials, or sensitive task details.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (5)

Vague Triggers

Medium

Confidence: 86% confidence
Finding: The skill says it should trigger after complex tasks, periodic reviews, and extracting lessons from errors, which are broad conditions that could cause the skill to activate in many unrelated contexts. Over-broad invocation can lead to unnecessary persistent logging and memory updates, increasing the chance of capturing sensitive user data or interfering with normal task execution.

Vague Triggers

Medium

Confidence: 91% confidence
Finding: Terms like 'complex task,' 'non-trivial reusable workflow,' and 'must persist when user corrected the process' are ambiguous and leave too much discretion to the agent. In practice, this can cause excessive or inconsistent retention of interaction details, including corrections that may contain sensitive operational or personal information.

Ssd 3

Medium

Confidence: 95% confidence
Finding: The skill directs the agent to keep daily logs of valuable operations and lessons learned in persistent memory files. Because these logs are plain-language summaries of real tasks, they can easily capture confidential prompts, user corrections, internal workflows, credentials, business context, or sensitive file names, creating long-term data retention and secondary disclosure risk.

Ssd 3

Medium

Confidence: 98% confidence
Finding: This section requires immediate persistence of errors and user corrections, which are especially likely to include sensitive details because they often contain the exact content that failed or was corrected. Storing such material in long-term memory expands exposure beyond the original session and can later leak through retrieval, summaries, or unrelated task context.

Ssd 3

Medium

Confidence: 96% confidence
Finding: The periodic review process instructs the agent to read recent memory logs and summarize them into MEMORY.md, which propagates previously retained sensitive information into a broader and more frequently loaded memory surface. This increases both the persistence and the accessibility of confidential content, making accidental reuse or leakage more likely across future sessions.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal