OpenClaw Benchmark

Security checks across malware telemetry and agentic risk

Overview

This is a coherent OpenClaw benchmark skill that runs bounded tests and writes local benchmark reports, with no evidence of deception, exfiltration, or destructive behavior.

Install only if you are comfortable with a benchmark that runs simple shell commands, performs a couple of network-based tests, and saves local reports and baseline JSON files. Review or delete ~/Downloads/OpenClaw-Benchmark and /tmp/bench_metrics.json if you do not want benchmark history left on disk.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Behavioral ASTexec() Call, eval() Call, Dynamic Import
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (3)

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The skill explicitly instructs use of `exec` to create and delete a local file under `/tmp` without any user-facing warning or consent checkpoint. Although the commands are limited and appear benchmark-related, they still modify the local system and normalize shell execution inside a skill, which can surprise users and create risk if the pattern is copied or later expanded.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The skill directs writing metrics to `/tmp/bench_metrics.json` and generating HTML reports in the user's Downloads directory, causing persistent filesystem changes without prominent disclosure. Even if the data is operational rather than secret, writing benchmark artifacts to disk can expose environment details, create unwanted residue, and violate user expectations for a read-only benchmarking workflow.

Missing User Warnings

Low

Confidence: 89% confidence
Finding: The baseline management step copies benchmark JSON into `~/Downloads/OpenClaw-Benchmark/baselines/`, creating durable run-history files without a user warning. This is lower severity than arbitrary command execution, but it still persists system and model metadata on disk and may unintentionally accumulate sensitive environment information over time.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal