Description-Behavior Mismatch
Medium
- Confidence
- 96% confidence
- Finding
- The skill explicitly states it will run arbitrary shell commands from improve.md and inherit whatever credentials are present, which creates a real command-execution and privilege-exposure risk. Even though this is disclosed as a prerequisite/security note, the surrounding skill positions itself as a bounded optimization tool, so users may underestimate that checks and scoring commands can reach external systems, mutate infrastructure, or exfiltrate secrets.
