Description-Behavior Mismatch
Medium
- Confidence
- 89% confidence
- Finding
- The skill is presented as a UI testing/verification tool, but it explicitly expands into operational account-administration tasks such as PAT issuance, OAuth approval, 2FA-related flows, and destructive UI actions. That broadens the authority of the skill into sensitive security workflows where an agent could steer users through privileged actions or normalize risky browser-mediated secret handling.
