Natural-Language Policy Violations
Medium
- Confidence
- 89% confidence
- Finding
- The file explicitly instructs the evaluator to run each judge prompt through specific external LLMs ('Claude/GPT') without any indication of user consent, configurability, or policy checks. In a skill-evaluation pipeline, this can cause unauthorized routing of potentially sensitive prompt content to third-party models and can override user or platform model-selection constraints.
