Intent-Code Divergence
Medium
- Confidence
- 97% confidence
- Finding
- The skill explicitly states that host-initiated instructions injected into the user-message slot should be followed exactly and can override the normal conversational flow. That creates an instruction-priority inversion where untrusted or weakly authenticated host content can bypass the skill's own confirmation, intent-validation, and safety constraints, enabling unintended actions such as email sends, rule changes, or action execution.
