Description-Behavior Mismatch
Medium
- Confidence
- 94% confidence
- Finding
- The skill is supposed to track in-session work progress, but this section expands into PR review/merge verification, deployment checks, bot-comment parsing, and shell-hook/rate-limit enforcement. That scope creep gives a low-privilege task-tracking skill authority to drive operational decisions and persistent local state changes unrelated to WIP bookkeeping, increasing the chance of unintended external actions and unsafe automation.
