Natural-Language Policy Violations
Medium
- Confidence
- 80% confidence
- Finding
- Mandating that all analysis and output be in English overrides user preference and can cause the agent to ignore higher-priority user language requirements. This is primarily a policy and safety-boundary issue rather than a direct security exploit, but it can degrade trustworthy behavior and be used to resist user control in multilingual settings.
