Description-Behavior Mismatch
Medium
- Confidence
- 92% confidence
- Finding
- The skill goes beyond formatting user-facing responses and instructs the agent to persist information into a durable memory file. Persistent writes can store user-derived or inferred data without consent, create privacy/compliance issues, and let prompt-triggered content poison future behavior through memory injection.
