Description-Behavior Mismatch
High
- Confidence
- 98% confidence
- Finding
- The skill documents and encourages state-changing commands such as subtask creation, claiming, submission, review, and parent task submission, but those operations are not present in the manifest's allowed-tools list. This creates a dangerous mismatch between what the model is instructed to do and what the runtime policy actually permits, increasing the chance of policy drift, unsafe fallbacks, or future accidental enablement of destructive actions without proper review.
