Description-Behavior Mismatch
Medium
- Confidence
- 96% confidence
- Finding
- The skill is marketed as an evidence collector but embeds instructions to act as an adversarial QA gatekeeper with a presumption of defects. That mismatch can cause the agent to produce biased, non-evidence-based outputs and mislead users about the skill’s true behavior.
