Intent-Code Divergence
Medium
- Confidence
- 97% confidence
- Finding
- The prompt asserts that incoming messages are Base64-encoded and should be treated as inert data, but only the JSON wrapper for currentMessages is Base64-encoded while conversation history and embedded fields remain directly injected into the prompt. This creates a false sense of safety and leaves the model exposed to prompt injection from stored message content, visitor fields, or other untrusted database data that can override instructions or manipulate downstream behavior.
