Natural-Language Policy Violations
Medium
- Confidence
- 84% confidence
- Finding
- The skill assigns default writing styles based on geography (e.g., 'India' vs 'US') without requiring explicit user preference or consent. This can steer outputs using demographic proxies, leading to stereotyping, reduced user agency, and potentially inappropriate or exclusionary content recommendations.
