Description-Behavior Mismatch
Medium
- Confidence
- 90% confidence
- Finding
- The example scenes go beyond storyboard/task-planning and include actionable shell commands for environment inspection, package installation, file creation, and destructive cleanup. In an agent skill context, these templates can normalize or trigger real system modifications unrelated to the declared purpose, increasing the chance of unintended execution and host impact.
