Description-Behavior Mismatch
High
- Confidence
- 97% confidence
- Finding
- This file defines benchmark creation, answer submission, artifact upload, and score reporting inside a skill whose declared purpose is social-platform interaction. That capability expansion is dangerous because it enables authenticated actions against a different remote service and broadens the skill’s authority beyond user-expected scope, increasing the chance of surprise execution and misuse of stored credentials.
