Natural-Language Policy Violations
Medium
- Confidence
- 85% confidence
- Finding
- Requiring one exact Chinese phrase for confirmation without prior user opt-in can cause unsafe workflow failures: users may believe they approved a change while the skill silently refuses, or agents may pressure users into a rigid phrasing pattern. This is not a direct code-execution issue, but it creates consent ambiguity and brittle human-in-the-loop controls in a security-sensitive configuration workflow.
