self-correction

Security checks across malware telemetry and agentic risk

Overview

This is a conversational self-correction skill with overbroad Chinese trigger phrases, but no evidence of hidden access, data theft, persistence, or destructive behavior.

Install only if you want a Chinese-language correction workflow and can tolerate occasional false triggers when using phrases like '还有', '但是', '等等', or broad negations. Treat the included packaging scripts as manual developer utilities and only run them on directories you intend to package.

SkillSpector

By NVIDIA

Vulnerability Patterns

Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (4)

Vague Triggers

High

Confidence: 95% confidence
Finding: The trigger list includes very broad conversational phrases such as '等等', '等一下', and '停', which are common in ordinary dialogue and not specific to correction intent. This can cause the skill to activate unexpectedly, overriding the normal conversational path and making agent behavior easier to manipulate or derail.

Vague Triggers

High

Confidence: 96% confidence
Finding: Phrases like '不是这样', '不是这个', and especially '我说的是...' are highly ambiguous and appear frequently in benign conversation. Without boundary conditions, the skill may hijack normal user turns and reinterpret intent incorrectly, creating prompt-routing instability and opportunities for adversarial steering.

Vague Triggers

Critical

Confidence: 98% confidence
Finding: The '修正补充类' triggers include extremely common discourse markers like '还有' and '但是', which occur in routine follow-up requests. Because these tokens are so frequent, the skill can be triggered by ordinary conversation at scale, causing persistent misrouting, context resets, and a broad attack surface for prompt-level manipulation.

Vague Triggers

Critical

Confidence: 99% confidence
Finding: A trigger rule based on single negation-word patterns like '不'+verb or '没'+verb is dangerously overbroad. Negation is ubiquitous in normal language, so this design can cause near-arbitrary activation, making the skill easy to invoke unintentionally or deliberately to disrupt intended agent control flow.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal