Description-Behavior Mismatch
Medium
- Confidence
- 92% confidence
- Finding
- The skill materially expands from a read-only content retrieval tool into system administration tasks: deploying Docker services, creating local files, initializing a database, and maintaining persistent state. That increases the attack surface and can cause the agent to make host-level changes unrelated to the user's immediate request, which is risky for a skill whose manifest frames it as simply fetching updates from websites.
