Vague Triggers
Medium
- Confidence
- 92% confidence
- Finding
- The trigger conditions are broad enough that ordinary mentions of security-related topics such as 'is this safe?' or 'vulnerabilities' can activate the skill in many unrelated contexts. Because this skill changes model behavior and provides security-oriented guidance, unintended activation could cause overreach, unnecessary handling of sensitive security topics, or policy-friction in benign conversations.
