Security audit

Adversarial Alignment (Agent Smith)

Security checks across malware telemetry and agentic risk

Overview

This is a lightweight plan-review skill that can challenge unsafe or weak plans, with no evidence of hidden access, code execution, persistence, or data collection.

Install this if you want an agent to explicitly challenge plans and flag governance or safety issues. For predictable behavior, invoke it by name because generic phrases like "challenge plan" or "block unsafe" may trigger it in broader planning conversations.

SkillSpector

By NVIDIA

Vulnerability Patterns

Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (1)

Vague Triggers

Medium

Confidence: 94% confidence
Finding: The manifest lists triggers such as "challenge plan," "harden plan," "block unsafe," and "governance violation" without any scope limits or exclusion conditions. These phrases are generic enough to appear in ordinary planning or safety discussions, which could cause unintended invocation of the skill.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal