benchmarking
Security checks across malware telemetry and agentic risk
Overview
This skill is a plain Markdown guide for designing and running model benchmarks, with no executable code or hidden install behavior.
Install only if you want an agent to help create or run model benchmark artifacts. When using execution mode, confirm the model roster and provider access first because benchmark runs can consume quota and create local result files.
SkillSpector
By NVIDIA
Vulnerability Patterns
- Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
- Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
- Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
- Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
- Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
VirusTotal
61/61 vendors flagged this skill as clean.
