Description-Behavior Mismatch
Medium
- Confidence
- 95% confidence
- Finding
- The flow explicitly writes a detailed assessment report to `results/exam-{sessionId}-full.md` and maintains a history index, creating persistent local artifacts from a user interaction. In a self-assessment skill, this is risky because it stores session metadata and performance information without any explicit consent, retention policy, or indication that disk state will be modified.
