Description-Behavior Mismatch
High
- Confidence
- 95% confidence
- Finding
- The skill explicitly presents itself as read-only, but then instructs the agent to fix artifacts and re-validate them. This mismatch can cause an agent or operator to grant broader write authority than expected, leading to unintended modification of reviewed content and weakening trust in the skill’s safety boundaries.
