Reflexlearn

Security checks across malware telemetry and agentic risk

Overview

This skill is purpose-built to learn from user prompts, but it automatically stores raw prompts and can modify long-term agent behavior in ways that are not consistently disclosed.

Review this before installing if your prompts may contain secrets, personal data, or sensitive work. Use cautious mode, inspect and prune ~/.openclaw/reflex_history.json and MEMORY.md regularly, avoid --use-ollama unless you trust the local service, and be aware that heartbeat reinforcement can write directly to SOUL.md despite the cautious-mode wording.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Findings (17)

Lp3

Medium
Category
MCP Least Privilege
Confidence
95% confidence
Finding
The skill declares no permissions even though its documented behavior clearly includes reading and writing local files and performing network access during installation and potentially via optional local HTTP calls. This creates a transparency and policy-enforcement gap: users or orchestration systems cannot accurately reason about the skill's capabilities, increasing the chance of unintended file modification or network exposure.

Description-Behavior Mismatch

Medium
Confidence
87% confidence
Finding
The README presents conflicting safety semantics: it says cautious mode stages SOUL.md updates for review, but elsewhere indicates positive reinforcement writes directly to SOUL.md. For a skill that persistently modifies long-term behavior files, ambiguous documentation can cause operators to enable or trust automation under false assumptions, leading to unintended persistent state changes.

Context-Inappropriate Capability

Medium
Confidence
89% confidence
Finding
When --use-ollama is enabled, the skill sends original and repeated user queries to a local HTTP service without sanitization or explicit consent at the point of transmission. Even though the target is localhost, this still expands the trust boundary to another process that may log, retain, or mishandle sensitive prompts.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The README describes continuous logging of user queries, repeats, and inferred satisfaction signals into persistent files without a clear privacy notice, consent model, or data minimization guidance. In an agent skill context, this can silently retain sensitive prompts, preferences, and behavioral metadata across sessions, increasing privacy and compliance risk.

Missing User Warnings

Medium
Confidence
91% confidence
Finding
Aggressive mode is documented as writing directly to SOUL.md, a persistent behavior-shaping file, without a prominent warning about automated modification of long-term agent state. This creates a risk of silent drift, prompt poisoning persistence, or unsafe learned behaviors being embedded without operator review.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The skill persists full user queries and embeddings to disk in history and memory artifacts without filtering or consent checks, which can capture secrets, personal data, or sensitive business content. Because this skill is triggered automatically post-response/heartbeat, users may not realize their prompts are being stored long-term.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The Ollama request includes raw user query content in the prompt body and sends it to another service without an explicit disclosure workflow. Localhost does not eliminate privacy risk because another local service may log requests, persist data, or be replaced by an unintended listener.

Ssd 3

Medium
Confidence
93% confidence
Finding
The documentation explicitly describes durable storage of user queries and inferred preferences in MEMORY/history files. Storing user-provided content and derived attributes verbatim can preserve secrets, personal data, or sensitive work context beyond the original interaction, making later exposure or misuse more likely.

Ssd 3

Medium
Confidence
92% confidence
Finding
The examples encourage writing repeated user queries and assessments of answer quality directly into persistent memory files. In practice, example-driven implementation often becomes normative behavior, which can normalize verbatim retention of potentially sensitive prompts and create a durable record of user activity.

Ssd 3

Medium
Confidence
90% confidence
Finding
The preference extraction example directs the system to derive and persist user preferences from inputs. Persisting inferred preferences can create a behavioral profile of the user, and because it is durable, future sessions may be influenced by sensitive or stale inferences that the user never explicitly consented to store.

Ssd 3

Medium
Confidence
93% confidence
Finding
The configuration and mode descriptions encourage automatic writes of learned patterns into long-term memory files based solely on observed user behavior. In an agent skill, this creates a pathway for unintended persistence of adversarial, mistaken, or privacy-sensitive patterns that can influence future outputs.

Ssd 3

High
Confidence
97% confidence
Finding
This code creates a durable local corpus of raw user prompts and semantic embeddings, which is a clear data retention and leakage risk if the files are accessed by other local processes, users, backups, or later surfaced by the agent. In a learning skill whose purpose is to write to MEMORY.md/SOUL.md, this context makes the risk more serious because retention is a core behavior rather than an incidental log.

Ssd 3

High
Confidence
96% confidence
Finding
Reflection and preference extraction copy user-provided text directly into MEMORY.md, pending files, and sometimes SOUL.md, turning transient prompts into long-term instruction-like artifacts. That can leak sensitive content and also cause untrusted user text to influence future agent behavior through persistent memory.

Unpinned Dependencies

Low
Category
Supply Chain
Content
sentence-transformers>=2.2.2
numpy>=1.24.0
huggingface-hub>=0.14.0
Confidence
95% confidence
Finding
sentence-transformers>=2.2.2

Unpinned Dependencies

Low
Category
Supply Chain
Content
sentence-transformers>=2.2.2
numpy>=1.24.0
huggingface-hub>=0.14.0
Confidence
98% confidence
Finding
numpy>=1.24.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
sentence-transformers>=2.2.2
numpy>=1.24.0
huggingface-hub>=0.14.0
Confidence
94% confidence
Finding
huggingface-hub>=0.14.0

Known Vulnerable Dependency: numpy — 10 advisory(ies): CVE-2014-1859 (Numpy arbitrary file write via symlink attack); CVE-2021-41495 (NumPy NULL Pointer Dereference); CVE-2021-33430 (NumPy Buffer Overflow (Disputed)) +7 more

Critical
Category
Supply Chain
Confidence
83% confidence
Finding
numpy

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal