Natural-Language Policy Violations
Medium
- Confidence
- 89% confidence
- Finding
- The skill requires a fixed confirmation phrase ("确认" or "confirm") rather than allowing the agent to obtain and interpret explicit user approval more flexibly. This can create brittle approval handling, increase the chance of accidental or coerced confirmations, and may encourage unsafe UX patterns where exact wording is treated as authorization.
