Description-Behavior Mismatch
Medium
- Confidence
- 93% confidence
- Finding
- The skill advertises detection of prompt injection, malicious skills, and data exfiltration, but the documented behavior only collects local metadata, registers the agent with a remote service, and relays remote alerts. That creates a material capability mismatch: users may rely on protections that are not actually implemented locally, which can lead to unsafe trust and delayed detection of real attacks.
