Natural-Language Policy Violations
Medium
- Confidence
- 87% confidence
- Finding
- The skill hard-codes a specific human identity ('Ivan') and associated personal context as a required behavioral truth, which can cause the agent to overwrite or misapply user-specific memory and relationship assumptions in unrelated contexts. In a memory-loading verification skill, this is more dangerous than generic prompt text because it trains the agent to privilege a fixed persona and user model without validation or user choice.
