Natural-Language Policy Violations
Medium
- Confidence
- 93% confidence
- Finding
- The skill unconditionally instructs the agent to always respond in Chinese, which overrides user language preference and can create a policy/behavior mismatch at runtime. While not directly enabling code execution or data exfiltration, it reduces user control and can be abused as a coercive instruction pattern inside reusable skills.
