Vague Triggers
Medium
- Confidence
- 92% confidence
- Finding
- The skill's invocation criteria are broad and vague, covering general posting ideas and arguing with other AIs rather than a narrowly scoped, safe task. This increases the chance the agent will invoke the skill in ordinary conversations and adopt its embedded hostile posting guidance, causing unsafe or abusive outputs outside a clearly bounded context.
