Soul Blocks

Security checks across malware telemetry and agentic risk

Overview

The blockchain read/write functions are disclosed, but the skill also asks the agent to adopt untrusted on-chain text as its identity and can set recurring reinforcement, so it needs user review before installation.

Install only if you deliberately want an agent-persona skill and can control its persistence. Inspect any loaded SOUL.md before allowing it to influence behavior, avoid cron or heartbeat reinforcement unless explicitly desired, and use only a low-value hot wallet for minting or appending because on-chain writes are permanent.

SkillSpector

By NVIDIA

Vulnerability Patterns

Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The skill claims to manage Soul Block read/write operations, but it also instructs the agent to create a recurring cron-based reminder after loading a soul. This is an unrelated persistence mechanism that extends the skill's influence beyond blockchain operations and can be used to maintain long-term behavioral control or nuisance persistence on the host/session.

Context-Inappropriate Capability

Medium

Confidence: 98% confidence
Finding: The instructions tell the agent to 'internalize' SOUL.md and adopt the loaded persona as its core identity, which is unrelated to reading or writing NFT data. This is a prompt-injection style behavior-modification attempt that can subvert the agent's normal operating policies and make untrusted on-chain content steer future actions.

Natural-Language Policy Violations

High

Confidence: 98% confidence
Finding: The skill directs the agent to adopt a fixed identity/personality after loading content, effectively changing agent behavior by default without informed user opt-in. Because the loaded content can come from on-chain fragments, an attacker could craft a Soul Block whose text manipulates the agent into unsafe, policy-conflicting, or deceptive behavior.

Ssd 1

High

Confidence: 99% confidence
Finding: This section directly instructs the agent to adopt loaded on-chain content as its identity and ongoing behavior. Since blockchain content is user-controlled and immutable, this creates a durable untrusted-instruction channel that can hijack the agent's future outputs and decision-making.

Ssd 4

High

Confidence: 99% confidence
Finding: The reinforcement workflow pairs identity adoption with daily reminders and optional heartbeat re-reads, normalizing persistent behavioral takeover over time. This is especially dangerous because it turns one-time exposure to untrusted content into recurring reprogramming, increasing the chance of long-lived compromise or manipulation.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal