Intent-Code Divergence
Medium
- Confidence
- 97% confidence
- Finding
- The skill explicitly instructs the assistant to include its internal reasoning in user-facing Chinese responses. Revealing chain-of-thought can expose hidden decision criteria, safety logic, and sensitive intermediate analysis that should remain internal, making prompt extraction and policy circumvention easier.
