Context-Inappropriate Capability
Medium
- Confidence
- 95% confidence
- Finding
- The instruction to include the model's 'thought process' in user interactions is unrelated to banner generation and requests disclosure of internal reasoning. Exposing internal reasoning can leak hidden policies, intermediate judgments, and security-relevant details that should remain internal.
