Huo15 Plan Mode

Security checks across malware telemetry and agentic risk

Overview

This skill is not malicious, but it promises a dangerous-command confirmation safety layer that the artifacts do not actually enforce.

Install only if you treat this as an advisory helper, not a complete safety boundary. Do not rely on it as the sole protection against destructive commands until confirmation storage, reply handling, and gated execution are implemented and tested. Review the HEARTBEAT.md changes and the eval-based integration wrapper before enabling it in a real workspace.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Findings (5)

Lp3

Medium
Category
MCP Least Privilege
Confidence
93% confidence
Finding
The skill documents shell-capable scripts and executable workflow steps, but the metadata declares no permissions or capability boundaries. In an agent ecosystem, this can cause the skill to be installed or trusted without users understanding it can invoke shell actions, including destructive commands after confirmation. The risk is increased because the skill explicitly brokers dangerous operations such as deletion, force-push, and process termination.

Description-Behavior Mismatch

Medium
Confidence
97% confidence
Finding
The script advertises a safety control for dangerous operations, but the main dispatcher only classifies commands and prints confirmation text; it never blocks or mediates actual command execution. In an agent setting, this can create a false sense of protection, leading integrators or users to rely on it as an enforcement mechanism when dangerous commands may still run elsewhere without verified approval.

Intent-Code Divergence

Medium
Confidence
94% confidence
Finding
The documentation claims the script can execute or cancel operations and shows an `execute` mode, but no such action exists in the code. This mismatch is dangerous because downstream agents may assume the script provides an enforcement layer that it does not actually implement, undermining safety guarantees around destructive commands.

Natural-Language Policy Violations

Medium
Confidence
78% confidence
Finding
The confirmation flow requires replies of '是' or '否' without offering language flexibility or equivalent accepted responses. In safety-critical confirmation logic, restricting confirmation to one language can lead to misunderstandings, failed cancellations, or accidental execution if users respond naturally in another language and the parser misinterprets or retries unsafely.

Autonomous Decision Making

Medium
Category
Excessive Agency
Content
| 配置 | 说明 |
|------|------|
| `mode` | strict=严格, normal=普通 |
| `autoConfirmLowRisk` | 低危操作自动执行 |
| `notifyOnCancel` | 取消时通知用户 |

---
Confidence
89% confidence
Finding
autoConfirm

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal