Ssd 1
Medium
- Confidence
- 97% confidence
- Finding
- The skill instructs the agent to treat transcribed audio 'as normal chat input' and answer resulting questions or instructions without any trust boundary or safety filtering. Because audio is an untrusted external input, an attacker can embed spoken prompt-injection content that manipulates agent behavior, elicits sensitive information, or triggers unsafe downstream actions if the broader system grants tool access.
