Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The archived audit explicitly preserves a stress-case failure where the skill drifted outside its declared scope and missed required boundary guidance. For an academic-writing skill in a medical context, scope drift can cause the agent to produce content beyond formatting/structuring into higher-risk advisory or unsupported professional material, undermining safety claims.
