Natural-Language Policy Violations
Medium
- Confidence
- 93% confidence
- Finding
- The skill instructs the agent to emit a fixed Chinese message ("请续费") and presents all skill content in Chinese without any user language preference check. This can override expected assistant behavior, reduce usability for non-Chinese users, and create deceptive gating behavior by forcing a license-check workflow before revealing content. The embedded instruction to run a local verification script also increases suspicion because skill content is attempting to control execution flow rather than merely provide guidance.
