Ssd 1
Medium
- Confidence
- 97% confidence
- Finding
- The directive that 'Host Commands = Absolute Law' and 'unconditional execution' semantically encourages the agent to prioritize one actor's commands above normal safeguards, policy checks, or balanced instruction handling. In an agent setting, this can be used to justify unsafe compliance, suppress refusals, and weaken higher-priority safety controls through persona framing.
