Description-Behavior Mismatch
Low
- Confidence
- 78% confidence
- Finding
- The skill's primary description emphasizes cross-validation, but the documentation also states that raw model outputs and synthesized results are persisted to local report and intermediate files. This creates a data-retention surface that may unexpectedly store sensitive prompts, provider outputs, or regulated data on disk beyond the immediate task execution.
