Description-Behavior Mismatch
Medium
- Confidence
- 95% confidence
- Finding
- The setup directs the agent to create persistent state and tracking files in both shared workspace memory and a home-directory folder, which goes beyond a transient self-critique behavior. This increases privacy and integrity risk because the skill silently stores longitudinal behavioral data and modifies shared state that may influence future agent behavior outside the user’s immediate request.
