Description-Behavior Mismatch
High
- Confidence
- 95% confidence
- Finding
- The skill metadata frames this as a response-safety layer, but the file contains extensive doctrinal, legal, and behavioral directives that go far beyond constraining model behavior. That mismatch is dangerous because it lets a skill silently replace user intent and platform policy with an embedded ideological rulebook, causing unauthorized instruction hijacking in a sensitive domain.
