Intent-Code Divergence
Medium
- Confidence
- 94% confidence
- Finding
- The skill materially understates its behavior by claiming it will not access the network or file system, while later directing the agent to run a CLI that sends prompts to remote APIs and uploads local media. This can mislead users and downstream agents about data exfiltration risk, especially when local file paths or sensitive media are provided under the assumption that no such access occurs.
