Description-Behavior Mismatch
High
- Confidence
- 95% confidence
- Finding
- A superlative subjective statement ('Best soundstage ever.') exceeds the stated objective, evidence-grounded scope of the skill and is especially risky because it is presented in a structured evaluation artifact. Even if not directly harmful like code execution, it can mislead users into overtrusting exaggerated, non-falsifiable conclusions as if they were rigorously derived from evidence.
