Intent-Code Divergence
Medium
- Confidence
- 89% confidence
- Finding
- The skill presents a restrictive safety boundary by saying the agent is explicitly prohibited from reading source code directly, but later includes direct file-content retrieval via API endpoints. Conflicting instructions can bypass user expectations and safety controls by reframing source access as an API operation rather than direct file reading.
