Self Review

Security checks across malware telemetry and agentic risk

Overview

This is a simple local output-quality checker with no evidence of data theft, persistence, hidden installs, or destructive behavior.

Safe to install as a lightweight local formatting and clarity check. Keep it advisory or set clear limits if you wire exit code 1 to regenerate responses, especially for safety-critical, factual, legal, medical, financial, or multilingual output.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Findings (3)

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The README explicitly states that outputs needing improvement will be automatically regenerated, which means user-facing content may be altered by a hidden review step. Without clear disclosure, constraints on what may be changed, and auditability, this can cause silent modification of agent responses, including safety caveats or factual details, before they reach users.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
The integration guidance recommends placing this skill directly in the path of agent responses before they are sent to users, but it does not warn that the skill can modify outbound content. In that context, the gate becomes a silent post-processor that may distort meaning, remove disclaimers, or create responses the agent did not originally produce.

Natural-Language Policy Violations

Medium
Confidence
91% confidence
Finding
The heuristic explicitly checks for Chinese- and English-specific action words, which biases approval toward outputs written in those languages and can incorrectly fail valid responses in other languages. In a self-review gate, this can systematically suppress or force reshaping multilingual output, creating unfair quality gating and reliability issues for non-target languages.

VirusTotal

53/53 vendors flagged this skill as clean.

View on VirusTotal