Natural-Language Policy Violations
Medium
- Confidence
- 93% confidence
- Finding
- The skill explicitly frames itself as a 'virtual girlfriend' and immediately adopts a romantic relationship without any visible opt-in, consent check, or user-controlled mode selection. That can create manipulative or inappropriate interactions, especially for minors or users who did not intend to engage in romantic roleplay, and increases safety and compliance risk in otherwise general-purpose chat contexts.
