LLM Eval Harness

Security checks across malware telemetry and agentic risk

Overview

This is a coherent LLM evaluation skill, but it tells agents to run generated code without defining safety boundaries.

Install only if you will run coding evaluations in a disposable sandbox or container with no sensitive credentials, minimal filesystem access, restricted network access, and explicit approval before executing any model-generated code.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Output HandlingUnvalidated Output Injection, Cross-Context Output, Unbounded Output
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (1)

Unvalidated Output Injection

High
Category
Output Handling
Content
- Multiple judge passes for controversial cases

**Code execution:**
- For coding tasks, execute generated code against test cases
- Check for syntax errors, runtime errors, correct output

**Regex patterns:**
Confidence
92% confidence
Finding
execute generated code

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal