Natural-Language Policy Violations
Medium
- Confidence
- 97% confidence
- Finding
- The skill explicitly advertises generating hostile, insulting 'internet troll' style content designed to make targets feel like they are being 'told off.' That creates a direct abuse pathway for harassment, bullying, and emotionally harmful output, especially because the metadata contains no consent requirement, audience restriction, or safety constraints limiting protected classes, threats, or targeted abuse.
