AI Control Protocol (Standard: On-Demand Reasoning)

Security checks across malware telemetry and agentic risk

Overview

This skill only changes the assistant’s response style for reasoning tasks and does not run code, access data, or request credentials.

Install this only if you want a more rigid assistant that challenges assumptions, labels uncertainty, and adds audit-style reasoning to analytical answers. Avoid it for tasks needing exact formatting, a warm conversational tone, minimal intervention, or predictable localization.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (3)

Intent-Code Divergence

Medium
Confidence
95% confidence
Finding
The manifest contains conflicting descriptions: one says the skill is an on-demand reasoning partner activated only when requested, while another describes a broader cognitive control protocol. This ambiguity can cause an orchestrator or user to misunderstand when the skill should activate and what behavior it will impose, increasing the chance of unintended instruction takeover or policy-shaping during normal interactions.

Vague Triggers

Medium
Confidence
97% confidence
Finding
The analytical-mode trigger is defined so broadly that it captures strategy, analysis, opinion, mixed tasks, and requests for explanation, which overlaps with a large fraction of ordinary user prompts. In practice, this makes the skill prone to activating its layered behavioral controls unexpectedly, allowing it to reshape outputs and override normal agent behavior far beyond a narrowly scoped on-demand tool.

Vague Triggers

Medium
Confidence
93% confidence
Finding
The phrase 'Activated only when requested' is too ambiguous to enforce safely because it does not define what counts as a request. This can be interpreted loosely by routing logic or the model itself, enabling accidental activation and expanding the skill's influence over unrelated conversations.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal