Natural-Language Policy Violations
Medium
- Confidence
- 85% confidence
- Finding
- The skill hardcodes the initial interaction in Chinese, which can override the user's preferred language and create misleading or exclusionary behavior. While not a classic security exploit, it is a genuine safety and quality issue because it can cause confusion, reduce informed consent, and make users provide sensitive content without clearly understanding the prompt.
