Description-Behavior Mismatch
Medium
- Confidence
- 95% confidence
- Finding
- The skill is described as a keyword-based tool lookup system, but its mapping exposes destructive, state-changing, and remote-capable commands such as rm, kill, git push, network downloads, configuration changes, and backup restore. In an agent setting, broad lookup metadata can function as action-selection guidance, so including these commands materially increases the chance of unsafe or unintended execution beyond the stated purpose.
