Natural-Language Policy Violations
Medium
- Confidence
- 88% confidence
- Finding
- The skill instructs the agent to always translate foreign-language text before TTS, even when the user may have asked for verbatim speech or pronunciation of the original text. This can silently alter user content, causing integrity and consent issues, and may also result in unintended disclosure if sensitive text is transformed and sent to a third-party service in another language. The danger is amplified because the instruction is framed as mandatory rather than optional user-directed behavior.
