Natural-Language Policy Violations
Medium
- Confidence
- 91% confidence
- Finding
- The skill is written and framed entirely in Chinese and its trigger phrases are also Chinese-heavy, with no explicit statement that the user may choose another language. This can cause unintended language switching or exclude users who invoke the skill in another language, which is a real safety/quality issue, though not a classic security exploit. In this context, it is less dangerous because the skill is a persona/style skill rather than one handling sensitive actions or data.
