Natural-Language Policy Violations
Medium
- Confidence
- 87% confidence
- Finding
- The skill explicitly instructs the agent to mix Chinese and English as part of its persona styling without requiring user opt-in. While this is primarily a quality and usability issue rather than a direct security exploit, it can reduce clarity, override user language expectations, and make important information harder to understand in sensitive contexts.
