Clawhub Skill Deploy Pilot

Security checks across malware telemetry and agentic risk

Overview

This deployment skill mostly fits its purpose, but it advertises important safety controls that are missing while allowing high-impact Docker, SSH, and shell-command execution.

Install only if you will treat every stack definition, health check, and hook as trusted code. Do not rely on the advertised approval, rollback, blue-green, LXC, cron, or history features without verifying or adding them, and run the tool with the least Docker, SSH, and Proxmox privileges needed.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (10)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: command = check.get("command", "echo ok") try: result = subprocess.run( ["ssh", target, command], capture_output=True, text=True,
Confidence: 88% confidence
Finding: result = subprocess.run( ["ssh", target, command], capture_output=True, text=True, timeout=30 )

subprocess module call

Medium

Category: Dangerous Code Execution
Content: script = check.get("script", "") try: result = subprocess.run( script, shell=True, capture_output=True,
Confidence: 99% confidence
Finding: result = subprocess.run( script, shell=True, capture_output=True, text=True, timeout=check.get("timeout", 30

subprocess module call

Medium

Category: Dangerous Code Execution
Content: for hook in stack.get("hooks", {}).get("pre", []): print(f"{Colors.status_wait()} Running pre-hook: {hook}") try: subprocess.run(hook, shell=True, check=True) print(f"{Colors.status_ok()} Pre-hook completed") except subprocess.CalledProcessError as e: print(f"{Colors.status_fail()} Pre-hook failed: {e}")
Confidence: 99% confidence
Finding: subprocess.run(hook, shell=True, check=True)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: for hook in stack.get("hooks", {}).get("post", []): print(f"{Colors.status_wait()} Running post-hook: {hook}") try: subprocess.run(hook, shell=True, check=True) print(f"{Colors.status_ok()} Post-hook completed") except subprocess.CalledProcessError as e: print(f"{Colors.status_fail()} Post-hook failed: {e}")
Confidence: 99% confidence
Finding: subprocess.run(hook, shell=True, check=True)

Intent-Code Divergence

Medium

Confidence: 86% confidence
Finding: The docstring advertises approval workflows and zero-downtime controls, but the implementation contains no approval enforcement and exposes options to bypass checks. This mismatch can create a false sense of security, leading operators to use the tool in sensitive production contexts without the governance controls they believe are present.

Intent-Code Divergence

Medium

Confidence: 87% confidence
Finding: The tool claims rollback support, but rollback is effectively a stub and auto-rollback only stops containers rather than restoring a prior known-good state. In production deployment tooling, this can worsen outages and recovery time because operators may rely on rollback behavior that does not actually exist.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill explicitly documents `--skip-health-check` and `--force` as normal deployment options in an infrastructure automation tool, but does not place strong inline warnings at the point of use about bypassing approval and safety validation. In this context, those flags can directly enable unsafe or unauthorized production changes, increasing the likelihood of outages, bad deploys, or policy bypass by users following examples verbatim.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The custom script health-check feature silently permits arbitrary shell command execution without prominently disclosing that risk. In this context, a 'health check' sounds harmless, which makes the feature more dangerous because users may import or trust configurations that actually contain executable payloads.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: Pre- and post-deploy hooks execute shell commands but are not clearly disclosed as arbitrary code execution mechanisms. In deployment tooling, hidden execution surfaces are especially risky because imported stack definitions may appear declarative while actually running attacker-controlled commands.

Vague Triggers

Medium

Confidence: 84% confidence
Finding: The skill metadata uses broad, generic trigger terms such as "deploy" and "automation" without any apparent scoping or activation constraints. In an agent ecosystem, this can cause the skill to activate during routine DevOps conversations and expose high-impact capabilities like Docker, SSH, or Proxmox execution when the user did not explicitly intend to invoke deployment automation.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal