Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The skill claims to be a command generator but also instructs the agent to generate and locally execute a bash pre-flight script. Executing dynamically generated shell, even for validation, expands the trust boundary and creates a code-execution surface on the agent host if user-controlled values are embedded unsafely.
