Natural-Language Policy Violations
Medium
- Confidence
- 86% confidence
- Finding
- The policy can cause the skill to answer in Chinese even when the user explicitly wrote in English, based solely on inferred audience. That creates a prompt-specification integrity issue: the skill may override the user’s immediate language choice, leading to unexpected output, reduced usability, and possible miscommunication in sensitive business or crisis contexts.
