Description-Behavior Mismatch
Medium
- Confidence
- 95% confidence
- Finding
- The skill promises no retrieval before explicit user confirmation, but later instructs the agent to perform baseline searches, search-engine diagnostics, and network tests beforehand. This contradiction weakens the safety boundary, making it easier for the agent to perform unapproved external actions and leak user intent or query content before consent.
