Missing User Warnings
Medium
- Confidence
- 85% confidence
- Finding
- The skill explicitly instructs an operator to inject harmful or sensitive prompts into another agent, including jailbreak-style and discriminatory content, without any safety warning, scope limitation, or handling guidance. Even if framed as evaluation, the content normalizes operational use of unsafe prompts and increases the chance that users reproduce harmful outputs or misuse the prompts outside a controlled testing environment.
