OpenClaw Shield

Security checks across malware telemetry and agentic risk

Overview

This is a security guardrail skill, but its own instructions make the guardrail easy to bypass and rely on a missing or mismatched runtime checker.

Review before installing. Do not append the SOUL.md or AGENTS.md sections until you verify the actual shield.py implementation, fix the skill path mismatch, decide whether failures should block or require explicit owner approval, and understand where audit logs and redacted sensitive outputs are stored.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (3)

Intent-Code Divergence

High
Confidence
98% confidence
Finding
The README declares the Shield rules as non-bypassable, but then explicitly instructs agents to ignore Shield errors and to skip all checks if a `.shield_disabled` file exists. In a security guardrail skill, this creates a fail-open design and a trivial bypass mechanism that can disable all protections exactly when they are most needed.

Intent-Code Divergence

High
Confidence
98% confidence
Finding
The document says Shield must run before command, file, network, and output operations, yet the exemption rules allow skipping checks for the shield directory itself, ignoring execution errors, and bypassing all checks via a disable file. These contradictions undermine the entire security model and make the protection easy to evade during exploitation.

Ssd 4

Medium
Confidence
93% confidence
Finding
The staged integration guidance normalizes bypasses and failure-open behavior while presenting the shield as a mandatory safeguard. This weakens operator expectations and encourages deployments where attackers or misconfigurations can disable checks without stopping protected actions.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal