Pilot Quarantine

Security checks across malware telemetry and agentic risk

Overview

This incident-response skill is not malicious, but it can disconnect live agents despite saying immediate disconnects are out of scope.

Install only if operators understand this is active containment, not just passive quarantine. Require manual review before running the disconnect examples, and prefer a version that clearly separates temporary trust suspension from immediate session termination.

SkillSpector

By NVIDIA

Vulnerability Patterns

MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (4)

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The skill claims quarantine should not be used for immediate disconnects, yet the quarantine example both untrusts the agent and forcibly disconnects active sessions. This mismatch can mislead operators and cause more disruptive actions than intended, especially if an agent is only under investigation rather than confirmed malicious.

Description-Behavior Mismatch

Medium

Confidence: 96% confidence
Finding: The enforcement loop repeatedly disconnects quarantined agents even though the manifest says immediate disconnect is out of scope for this skill. That creates a behavioral contradiction and can turn a temporary isolation mechanism into an ongoing active denial of connectivity without clear operator consent.

Intent-Code Divergence

Medium

Confidence: 93% confidence
Finding: The top-level guidance says not to use this skill for immediate disconnect, but the embedded examples later perform disconnect operations anyway. Such contradictory instructions are dangerous in operational security tooling because they undermine operator expectations and increase the chance of unintended service disruption.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The example issues untrust and disconnect commands that directly change live network state, but it does so without an explicit warning, dry-run, or confirmation step. In a security response context, these actions can interrupt legitimate agents, sever active sessions, and complicate incident investigation if triggered too quickly or by mistake.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal