Missing User Warnings
Medium
- Confidence
- 94% confidence
- Finding
- The skill explicitly instructs users to send prompts and model outputs to multiple third-party endpoints and later save artifacts such as responses, rubrics, reports, and checkpoints, but it does not prominently warn that potentially sensitive task data, generated queries, and model responses will leave the local environment and be persisted to disk. This can cause unintentional disclosure of confidential inputs, evaluation content, or proprietary outputs when users assume benchmarking is local-only.
