Back to skill

Security audit

Feihong Self Improving Agent

Security checks across malware telemetry and agentic risk

Overview

This skill appears purpose-aligned, but it needs review because it encourages durable agent memory, broad logging of task context, and optional automatic hooks without strong redaction or scoping controls.

Install only if you want agents to keep durable learning notes. Before enabling hooks or cross-session sharing, add a local rule to summarize and redact secrets, tokens, customer data, private URLs, raw prompts, and full command outputs. Keep .learnings out of version control unless every entry is intentionally shareable.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Rogue AgentSelf-Modification, Session Persistence
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (7)

Vague Triggers

Medium
Confidence
84% confidence
Finding
The trigger phrases are common in normal conversation and are mapped to automatic logging actions without requiring sensitivity checks, consent, or contextual validation. This makes it easy for benign conversational content, corrections, or user-provided facts to be unnecessarily persisted into long-term files, increasing privacy and data-retention risk.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The guide instructs users to enable a user-level UserPromptSubmit hook that executes a local shell script on every prompt across all sessions. Although the script may be intended to be harmless, persistent global command execution materially expands the attack surface and is not paired with an immediate, prominent warning that every prompt submission triggers code execution.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The setup examples configure command-type hooks that run shell scripts, but the initial instructions do not immediately disclose that normal prompt submission and Bash tool use will cause local executable scripts to run. This can lead users to enable automation without understanding that they are introducing automatic command execution into their workflow.

Ssd 3

Medium
Confidence
92% confidence
Finding
The skill explicitly instructs the agent to persist user corrections, requests, and promoted learnings into durable workspace and project memory files. In a security context, that creates a clear risk of retaining sensitive prompts, business context, credentials, or private workflow details in places that may later be reused, exposed to other agents, or committed to source control.

Ssd 3

Medium
Confidence
95% confidence
Finding
The cross-session guidance encourages reading other sessions' transcripts and sending learnings between sessions without any access-control, need-to-know, redaction, or consent requirements. That can expose sensitive user content across session boundaries and amplify a leak from one session into multiple agents or workspaces.

Ssd 3

Medium
Confidence
94% confidence
Finding
The logging templates instruct the agent to capture full context, inputs, parameters, environment details, and literal error output. Those fields commonly contain credentials, API keys, filesystem paths, internal URLs, user data, stack traces, and operational details that become durable artifacts and may be indexed, shared, or committed later.

Session Persistence

Medium
Category
Rogue Agent
Content
└── FEATURE_REQUESTS.md
```

### Create Learning Files

```bash
mkdir -p ~/.openclaw/workspace/.learnings
Confidence
81% confidence
Finding
Create Learning Files ```bash mkdir -p ~/.openclaw

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal

Static analysis

No suspicious patterns detected.