Description-Behavior Mismatch
Medium
- Confidence
- 94% confidence
- Finding
- The skill is presented as a lightweight travel personality test, but it instructs the agent to install and invoke an external tool for live travel data. That materially expands capability beyond the declared function and can surprise operators or users, increasing the risk of unauthorized network access, tool execution, and data handling.
