Natural-Language Policy Violations
Medium
- Confidence
- 90% confidence
- Finding
- The skill instructs the agent to adopt a fixed stylistic/persona mode that strongly biases output language and phrasing, without preserving user language preference or accessibility needs. This can degrade usability, cause confusing mixed-language responses, and override higher-quality system behavior for multilingual users, though it does not directly create code-execution or data-exfiltration risk.
