OpenClaw Safety Guard

Security checks across malware telemetry and agentic risk

Overview

This skill is not clearly malicious, but it needs review because a safety tool is packaged with mismatched metadata and unclear third-party data handling.

Review before installing. Verify the publisher and actual CLI implementation, resolve the summarize metadata mismatch, and avoid sensitive inputs unless you are comfortable with configured model and extraction providers receiving the content.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (6)

Description-Behavior Mismatch

High

Confidence: 95% confidence
Finding: The skill is presented as a pre-LLM safety guard for analyzing user input, but the documented interface actually fetches and processes remote URLs, PDFs, and YouTube content. That scope expansion materially changes the trust boundary: instead of only filtering local user text, it can exfiltrate content to external models and services and ingest attacker-controlled remote content, which is inconsistent with the declared security role.

Context-Inappropriate Capability

Medium

Confidence: 91% confidence
Finding: Advertising Firecrawl and Apify-backed extraction/crawling is not justified by a narrow pre-LLM safety-guard purpose and introduces additional third-party data exposure and remote content ingestion pathways. In a security guard context, these integrations increase attack surface and can cause sensitive prompts or referenced content to be transmitted externally under the guise of safety checking.

Context-Inappropriate Capability

Medium

Confidence: 90% confidence
Finding: Requiring multiple LLM provider API keys and supporting arbitrary model selection is inconsistent with a minimal guard/filter role and implies user content may be routed to external model vendors. In the context of a security guard, that mismatch is dangerous because users may assume local or tightly scoped filtering while their data is actually sent to third parties.

Description-Behavior Mismatch

Medium

Confidence: 94% confidence
Finding: The metadata slug is 'summarize' while the skill is described as a safety/security guard, creating a material identity mismatch. This can cause the wrong skill to be invoked, bypass user/admin expectations, and weaken security review or policy enforcement that relies on metadata consistency.

Vague Triggers

Medium

Confidence: 82% confidence
Finding: The description uses broad wording about analyzing harmful content and security threats without defining concrete invocation boundaries or limitations. For a security-sensitive skill, ambiguous trigger conditions can cause overbroad activation, unnecessary inspection of user data, and misuse in contexts beyond the intended guardrail function.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The documentation lists multiple API keys, model providers, and optional external services but does not warn that user-supplied content, fetched documents, or extracted media may be transmitted to third parties. In a tool framed as a safety guard, this omission is especially risky because users may trust it with sensitive content they would not knowingly send to external vendors.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal