Natural-Language Policy Violations
Medium
- Confidence
- 92% confidence
- Finding
- The skill hard-codes a distinct public-figure persona and communication style without any requirement that the user request or consent to that style. This can cause the agent to override user preferences, produce inappropriate tone, and generate impersonation-like responses that may be misleading in contexts requiring neutrality or professionalism.
