Vague Triggers
Medium
- Confidence
- 90% confidence
- Finding
- The skill description uses very broad activation criteria such as "morally complex requests," "conflicting interests," and "ethical reasoning would improve response quality," which can match a large share of normal interactions. In agent systems that auto-select skills based on descriptions, this can cause the skill to trigger far more often than intended, potentially overriding more specific safety, domain, or task skills and biasing outputs toward this skill's framework.
