Metaclaw

Security checks across malware telemetry and agentic risk

Overview

MetaClaw is a coherent local memory-management skill, but it should be used carefully because it can read and overwrite local memory files and can send session content to a local Ollama service.

Install only if you want a local OpenClaw memory system that can read and update your workspace memory. Start with --dry-run and auto-fix preview modes, review diffs before using --allow-write, confirm your local Ollama service is trusted, and do not add the cron or heartbeat hooks unless you are comfortable with unattended memory maintenance.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Rogue AgentSelf-Modification, Session Persistence
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration

Findings (15)

os.system() or os exec-family call

High

Category: Dangerous Code Execution
Content: # Try LCM via openclaw command (if available) try: result = os.popen("lcm_describe 2>/dev/null").read() if result: return result except Exception as e:
Confidence: 90% confidence
Finding: result = os.popen("lcm_describe 2>/dev/null").read()

Lp3

Medium

Category: MCP Least Privilege
Confidence: 95% confidence
Finding: The skill documentation describes capabilities that read and write local files, invoke shell commands, install dependencies, and use network-capable components, yet no explicit permissions or safety boundaries are declared. This is dangerous because operators may treat the skill as lower-risk than it is and execute commands that modify memory data, install packages, or access external/local services without informed consent.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 92% confidence
Finding: The declared description presents the skill as generic memory management and evolution, but the content also includes static-analysis actions, auto-fix that can rewrite skill files, dependency installation, and references to external local LLM-backed processing. This mismatch increases the chance that a user will invoke broader and riskier behavior than expected, including self-modification and writes to user memory or skill assets.

Context-Inappropriate Capability

Medium

Confidence: 84% confidence
Finding: The script's stated job is extracting memory from session content, but it also executes an external command to retrieve content. That expands attack surface and can pull data from an unexpected source or execute a malicious helper if the environment is tampered with, making the behavior riskier than simple file parsing. In a skill handling conversational memory, this can expose or ingest unintended data.

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The script includes an auto-fix path that can modify skill files, which expands its capability from evaluation into write operations. Even though writes are gated behind --allow-write, this still creates a potentially dangerous surface because an evaluation tool may be invoked in automation or by users who do not fully understand that it can alter repository contents.

Context-Inappropriate Capability

Medium

Confidence: 97% confidence
Finding: The code deletes old backup files and empty files under references/, which is a destructive capability not necessary for a pure evaluator. In a skill-management context, this is more dangerous because repositories may contain intentionally empty sentinel files or backups needed for recovery, so invoking the tool with write permissions can cause silent data loss.

Missing User Warnings

Medium

Confidence: 92% confidence
Finding: The README advertises automatic memory extraction and consolidation that can update persistent memory files, but it does not clearly warn users that stored data may be modified automatically. This increases the risk of unintended data alteration, overwriting, or privacy-impacting persistence because users may run the feature assuming it is read-only.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The README includes an auto-fix command with `--allow-write` that can modify skill files, but it does not prominently warn that execution changes code or configuration on disk. Users may invoke it expecting analysis only, creating risk of unintended self-modification, corruption of skill contents, or supply-chain style abuse if the fixer behavior is unsafe.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The skill advertises automatic memory extraction into files like preferences.md and project-state.md and automatic consolidation, but it does not clearly warn that user data will be created, altered, or merged. In a memory-management context this is especially sensitive because silent writes can corrupt user state, introduce inaccurate summaries, or overwrite manually curated information.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The cron-based auto-dream consolidation schedules unattended execution of a script that may modify memory files, but the documentation does not clearly warn about automatic state changes during scheduled runs. Unattended writes increase risk because modifications can occur outside user review, making corruption or undesired summarization harder to detect and revert.

Missing User Warnings

Medium

Confidence: 88% confidence
Finding: The script transmits full session content to the Ollama HTTP API for processing without any explicit user warning or consent mechanism. Even though the endpoint is localhost, conversational memory may contain secrets or personal data, and local model servers are still separate processing components that can log, retain, or expose submitted content. In a memory-management skill, this context increases sensitivity because the data is explicitly user/session memory.

Self-Modification

High

Category: Rogue Agent
Content: python3 scripts/skill-eval.py --layer1 skills/metaclaw/ # Auto-Fix (mit --allow-write zum Schreiben) python3 scripts/skill-eval.py --auto-fix --allow-write skills/metaclaw/ ``` ### Output
Confidence: 96% confidence
Finding: write skill

Self-Modification

High

Category: Rogue Agent
Content: python3 scripts/skill-eval.py --auto-fix skills/metaclaw/ # Auto-Fix (mit Schreiben) python3 scripts/skill-eval.py --auto-fix --allow-write skills/metaclaw/ ``` ### Quality Targets
Confidence: 96% confidence
Finding: write skill

Session Persistence

Medium

Category: Rogue Agent
Content: ### Cron-Job (Optional) ```bash # crontab -e # Tägliche Auto-Dream Consolidation (03:00) 0 3 * * * "$HOME/.openclaw/workspace/scripts/memory-digest-enhanced.sh" --auto-dream ```
Confidence: 91% confidence
Finding: crontab -e

Session Persistence

Medium

Category: Rogue Agent
Content: ### Cron Integration ```bash # crontab -e # Auto-Dream tägliche Consolidation (Optional) 0 3 * * * "$HOME/.openclaw/workspace/scripts/memory-digest-enhanced.sh" --auto-dream ```
Confidence: 91% confidence
Finding: crontab -e

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal