Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The skill’s stated purpose is transcript export, but it additionally instructs the agent to score and critique both the user and the model. That expands processing of sensitive conversation data beyond the user’s likely expectation and creates unnecessary profiling content that may be stored on disk alongside the transcript.
