Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The document is framed as a testing workflow, but it also instructs the agent to implement code fixes after analyzing failures. This expands the skill from read-only validation into code modification, increasing the chance of unintended or unauthorized changes to user code under the guise of testing.
