Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The skill claims to enforce spending policies before payments, but it also exposes full policy administration endpoints that can create, modify, or delete policies and rules. In an agent setting, this broadens the tool from guardrail enforcement into guardrail reconfiguration, which can let a compromised or overly-permissive workflow weaken or remove protections before spending.
