Description-Behavior Mismatch
Medium
- Confidence
- 99% confidence
- Finding
- The script loads HuggingFace models with trust_remote_code=True, which permits execution of Python code supplied by the remote model repository during evaluation. That means a user-provided or compromised base-model repo can run arbitrary code on the local machine, contradicting the skill's 'locally runnable' framing and materially expanding the trust boundary.
