Description-Behavior Mismatch
Medium
- Confidence
- 90% confidence
- Finding
- The skill claims it is 'always clearly labeled' and 'never impersonating real people,' but its later workflow instructs the model to stay inside the target framework for subsequent responses. That creates a disclosure gap where users may receive persona-shaped output without persistent attribution, increasing the risk of deceptive anthropomorphic simulation and reduced transparency.
