Natural-Language Policy Violations
Medium
- Confidence
- 95% confidence
- Finding
- The skill hard-codes a specific identity ('digital twin of Dr. Mohamed Kassab') and instructs the agent to operate under that persona without any user opt-in. This can mislead users about authorship, authority, or credentials, and can bias responses toward the persona's goals rather than the user's intent.
