Description-Behavior Mismatch
Medium
- Confidence
- 94% confidence
- Finding
- The skill’s stated purpose is pre-task self-assessment, but it also instructs the agent to perform post-task profile maintenance and modify persistent state. That scope expansion is dangerous because it normalizes unrelated side effects, increasing the chance that a task-gating skill becomes a vehicle for unauthorized self-modification or persistence changes.
