Missing User Warnings
Medium
- Confidence
- 95% confidence
- Finding
- The synthetic data generation example embeds the full training example into a prompt and sends it to an external model API, which can expose proprietary, personal, regulated, or confidential data during dataset preparation. In a fine-tuning skill, this is especially risky because training corpora often contain sensitive production-derived examples, and the snippet provides no warning, redaction guidance, or consent/compliance checks before transmission.
