Agent Safety

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed local safety-scanning skill with a Git hook and health-check script; it has workflow and privacy considerations but no artifact-backed malicious behavior.

Install this only if you want local shell-based scanning and a repository pre-commit hook that can block commits. Run scans only on files or repos you intend to publish, check whether an existing pre-commit hook would be overwritten, and treat health-check output as local security/workspace information that should not be shared casually.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (6)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 87% confidence
Finding: The skill invokes shell scripts but does not declare any permissions, which creates a transparency and trust problem for users and any policy engine relying on declared capabilities. In a security-sensitive skill, hidden or undeclared shell execution increases the risk of unexpected filesystem, git, and system-inspection actions being run without adequate review.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 95% confidence
Finding: The documented purpose is outbound publication safety, but the skill also advertises broader host inspection and system health monitoring, including workspace inspection, update queries, and security posture checks. That mismatch is dangerous because users may grant trust for a narrow scanning task while the skill performs materially broader reconnaissance on the local machine, increasing privacy exposure and expanding the attack surface.

Description-Behavior Mismatch

Medium

Confidence: 93% confidence
Finding: This section performs functionality outside the skill’s declared purpose by checking the current AI model and querying npm for the latest OpenClaw version. The npm lookup introduces outbound network activity and environmental discovery in a script presented as a local health check for an outbound-safety skill, which expands trust and data-exposure boundaries beyond what users would reasonably expect.

Description-Behavior Mismatch

Medium

Confidence: 91% confidence
Finding: This block inventories OS patch status, firewall state, and SIP status, which exceeds the manifest’s stated role of scanning agent output and blocking unsafe git commits. Even though the checks are local, they collect sensitive host-security posture information that could later be exposed through logs, agent output, or misuse in a broader workflow.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The command `npm view openclaw version` performs a network-based registry lookup that is not necessary for the skill’s advertised git-level secret scanning and outbound safety purpose. Unexpected network access from a security-related helper script is dangerous because it can disclose metadata about the environment and violates least-privilege and user-expectation boundaries.

Missing User Warnings

Low

Confidence: 84% confidence
Finding: The network version check lacks any user-facing notice that running the health check will contact an external service. While the data sent may be limited, undisclosed outbound connections from a tool marketed around safety can erode trust and create privacy or policy-compliance issues in restricted environments.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal