Intent-Code Divergence
Medium
- Confidence
- 97% confidence
- Finding
- The skill explicitly defines a safer 'draft first + human confirmation' policy for AI automation, but this later example instructs the AI to publish immediately after receiving content. That contradiction can cause an agent to skip the intended approval gate and expose incorrect, noncompliant, or sensitive content directly to a public WeChat account.
