Context-Inappropriate Capability
Medium
- Confidence
- 99% confidence
- Finding
- The skill instructs the assistant to include its hidden thought process in user-facing responses, which is unrelated to the timer/shutdown function and requests disclosure of sensitive internal reasoning. Exposing chain-of-thought can leak internal safety logic, prompt structure, and decision-making artifacts that attackers can use to jailbreak or manipulate the agent.
