Intent-Code Divergence
Medium
- Confidence
- 90% confidence
- Finding
- The skill is presented as a real-world conversation rehearsal tool, but the usage scenarios pivot into tabletop RPG NPC and villain dialogue generation. This mismatch can cause the agent to activate in unintended contexts, weakening safety controls and making downstream behavior less predictable, especially where the skill claims to avoid manipulation but includes adversarial-style dialogue generation examples.
