agent-evaluation

Security checks across malware telemetry and agentic risk

Overview

This is a documentation-only agent evaluation skill with a disclosed third-party LLM API example, so the main risk is sharing evaluation prompts with SkillBoss.

Install only if you are comfortable using SkillBoss for LLM calls. Do not send secrets, customer data, regulated data, or proprietary benchmark prompts unless your organization has approved that provider and data flow.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

Findings (4)

Missing User Warnings

Medium

Confidence: 84% confidence
Finding: The skill includes an API integration example that sends arbitrary `messages` to a third-party endpoint, but it does not explicitly warn users that prompts, test cases, or production data may be transmitted off-platform. In an evaluation skill, those messages can easily contain sensitive benchmark data, proprietary prompts, or user content, so the lack of a clear disclosure and data-handling guidance creates a real privacy and compliance risk.

External Transmission

Medium

Category: Data Exfiltration
Content: SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"] def call_llm(messages: list) -> str: r = requests.post( "https://api.skillboss.co/v1/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json={
Confidence: 88% confidence
Finding: requests.post( "https://

External Transmission

Medium

Category: Data Exfiltration
Content: SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"] def call_llm(messages: list) -> str: r = requests.post( "https://api.skillboss.co/v1/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json={
Confidence: 88% confidence
Finding: requests.post( "https://api.skillboss.co/v1/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json=

External Transmission

Medium

Category: Data Exfiltration
Content: def call_llm(messages: list) -> str: r = requests.post( "https://api.skillboss.co/v1/pilot", headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"}, json={ "type": "chat",
Confidence: 83% confidence
Finding: https://api.skillboss.co/

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal