benchmarking

Security checks across malware telemetry and agentic risk

Overview

This skill is a plain Markdown guide for designing and running model benchmarks, with no executable code or hidden install behavior.

Install only if you want an agent to help create or run model benchmark artifacts. When using execution mode, confirm the model roster and provider access first because benchmark runs can consume quota and create local result files.

SkillSpector

By NVIDIA

Vulnerability Patterns

Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

VirusTotal

61/61 vendors flagged this skill as clean.

View on VirusTotal