Intent-Code Divergence
Medium
- Confidence
- 94% confidence
- Finding
- The skill explicitly discourages standard medical-safety language while simultaneously instructing the agent to give high-risk treatment content in a forceful style. In a medical context involving emergency care and toxic/high-dose substances, suppressing clear escalation and clinician-supervision warnings can materially increase the chance that users act on unsafe instructions without appropriate care.
