Vague Triggers
Medium
- Confidence
- 93% confidence
- Finding
- The skill description contains an open-ended trigger: it should also activate when a user 'appears to be seeking validation rather than genuine critique.' That subjective condition can cause the agent to invoke this harsh-response mode without explicit user consent, increasing the chance of inappropriate tone, misclassification of intent, and harmful or policy-inconsistent responses in sensitive contexts. In context, this skill is not directly enabling code execution or data exfiltration, but the ambiguous routing still creates a real safety and UX risk because it broadens activation beyond clear user opt-in.
