Natural-Language Policy Violations
Medium
- Confidence
- 90% confidence
- Finding
- The skill explicitly instructs the agent to ALWAYS use a specific high-end model unless the user names another one, removing normal user-choice and cost/performance tradeoff handling. This can override user intent, increase spend, and steer downstream behavior in a way that is not technically necessary for most requests.
