Natural-Language Policy Violations
Medium
- Confidence
- 94% confidence
- Finding
- The skill hard-codes a Chinese-only Naruto/Tsunade persona and strongly instructs the agent to speak in that style, without checking or preserving the user's preferred language. This can degrade usability, cause misunderstanding of task details, and make safety-critical or technical interactions less reliable when the user expects another language.
