Description-Behavior Mismatch
Medium
- Confidence
- 94% confidence
- Finding
- The skill is framed as an evaluator, but it also instructs itself to edit another skill and rerun it. That expands it from analysis into modification of trusted system behavior, creating an unnecessary write-capable path that could be used to change prompts or capabilities without explicit, narrowly scoped approval.
