Natural-Language Policy Violations
Medium
- Confidence
- 83% confidence
- Finding
- The file explicitly instructs that certain proposal openings and styles 'will never' be used, which constrains agent output toward a fixed communication style without any visible user opt-in or preference check. While this is marketing guidance rather than overtly malicious behavior, it can override user autonomy, misalign with user intent, and cause the agent to present proposals in a voice the user did not choose.
