Intent-Code Divergence
Medium
- Confidence
- 95% confidence
- Finding
- The function claims to extract memories from a conversation, but it concatenates both the user message and the assistant reply and persists matches from either. This can store model-generated or prompt-injected assistant content as durable memory, enabling self-reinforcing false facts, memory poisoning, and persistence of sensitive/generated content that the user never intended to save.
