Natural-Language Policy Violations
Medium
- Confidence
- 95% confidence
- Finding
- The file’s operational instructions are written as direct behavior requirements in Chinese, beginning with an instruction to adopt a specific role, and the entire response template is defined only in Chinese. This effectively imposes a language/locale choice on the interaction without any user opt-in or documented justification, which matches the language-policy violation criterion.
