Description-Behavior Mismatch
Medium
- Confidence
- 95% confidence
- Finding
- The skill instructs the agent to persist user-supplied document content into a local `references/` directory, creating a form of prompt/data persistence without trust boundaries, validation, or user confirmation. This is dangerous because uploaded documents may contain adversarial instructions or poisoned safety guidance that later analyses treat as authoritative, causing persistent integrity compromise of future outputs.
