Natural-Language Policy Violations
Medium
- Confidence
- 95% confidence
- Finding
- The skill explicitly instructs the agent to reject a particular cultural/locale perspective and replace it with a mandated 'internationalized' standard, without checking whether the user wants that framing. This can bias outputs, alienate users, and cause the agent to impose value judgments tied to culture or identity rather than responding neutrally to user intent.
