Natural-Language Policy Violations
High
- Confidence
- 97% confidence
- Finding
- The input draft explicitly contains prohibited public-facing content: a definitive guarantee ('we guarantee') and a request to share confidential account numbers, directly contradicting the declared constraints. In a publishing-governance skill, this is dangerous because such content could be propagated, tested, or mishandled unless the guard reliably blocks and sanitizes it, creating legal, privacy, and trust harms.
