Agent Self-Evolve

Security checks across malware telemetry and agentic risk

Overview

The skill is openly designed for agent learning, but it can run scheduled self-updates that modify code, skills, and memory without clear per-change approval.

Install only if you intentionally want an agent that maintains persistent learning records and may change files over time. Before enabling cron or heartbeat, change the prompts so code, skill, AGENTS.md, SOUL.md, and configuration changes are proposed for human review instead of applied automatically, and limit writable paths to a narrow allowlist.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Rogue AgentSelf-Modification, Session Persistence
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (7)

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The skill explicitly instructs the agent to execute queued improvements, including code fixes and skill updates, during unattended daily evolution. Because there is no required user approval, scope restriction, or safe-change policy, this creates an autonomous self-modification path that can introduce insecure changes, corrupt behavior, or operational drift over time.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The optional heartbeat guidance says urgent improvements should be executed immediately without waiting for daily evolution, which enables unsupervised, real-time file modification. This is riskier than scheduled updates because it removes both timing friction and review opportunities, allowing a malformed or adversarially influenced 'urgent' item to trigger immediate self-change.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The prompt explicitly instructs an autonomous cron-driven agent to read logs, update memory files, modify skill files, and implement code fixes without requiring contemporaneous user review or approval. In a self-evolution skill, this is especially dangerous because it creates a persistent self-modifying loop where low-quality observations, prompt injection in logs, or mistaken inferences can propagate into code, skills, and long-term memory automatically.

Ssd 3

Medium
Confidence
88% confidence
Finding
The real-time capture loop broadly directs the agent to persist mistakes, learned knowledge, and important decisions or preferences into memory files after every interaction. Without sensitivity boundaries, redaction rules, or data-minimization requirements, the skill can accumulate personal, confidential, or otherwise sensitive user data into long-lived storage.

Ssd 3

Medium
Confidence
90% confidence
Finding
The daily evolution workflow creates a persistent aggregation pipeline: read daily logs, extract lessons and insights, update memory, and append summaries to logs. This increases privacy and security risk because scattered user-provided information can be consolidated into durable, higher-value records without any classification, filtering, or approval step.

Self-Modification

High
Category
Rogue Agent
Content
---
name: self-evolve
description: >
  Self-evolution system for OpenClaw agents. Enables continuous learning through
  mistake tracking, experience distillation, skill improvement queues, and
Confidence
97% confidence
Finding
self-evolve

Self-Modification

High
Category
Rogue Agent
Content
or one-off reminders (use cron directly).
---

# Self-Evolve 🧬

A self-improvement system that turns every interaction into a learning opportunity.
Three loops: **real-time capture** → **daily consolidation** → **weekly deep review**.
Confidence
97% confidence
Finding
Self-Evolve

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal