Description-Behavior Mismatch
High
- Confidence
- 95% confidence
- Finding
- The script's behavior materially diverges from the skill's declared purpose: it launches a Prism mock server rather than performing Drift-based API verification. In an agent setting, this can cause the system to run the wrong toolchain, produce misleading "successful" outcomes, and mask the absence of real conformance testing, which is especially dangerous because the skill description explicitly targets verification and 'keep running until everything passes' workflows.
