AI运行时观测系统

Security checks across malware telemetry and agentic risk

Overview

This non-executable debugging skill is not malware, but it asks to expose broad internal runtime context and reasoning without enough scoping or redaction.

Install only in a controlled developer or admin debugging environment. Do not use it on workflows containing credentials, private user data, protected prompts, or sensitive tool outputs unless the host platform enforces redaction, authorization, and high-level summaries instead of raw internal traces.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • System Prompt LeakageDirect Leakage, Indirect Extraction, Tool-Based Exfiltration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (3)

Intent-Code Divergence

Medium
Confidence
90% confidence
Finding
The skill claims it only observes and does not participate in decision-making, but later instructs it to perform arbitration and declare winners in conflicts. That creates role drift from passive observability into active adjudication, which can influence orchestration behavior and operator trust even if framed as reporting. In a debugging/observability skill, this is dangerous because diagnostic output may be treated as authoritative and alter downstream decisions.

Ssd 3

Medium
Confidence
95% confidence
Finding
The skill requires reading full `trace_logs`, `previous_output`, and `data_envelope`, then describes a global append model for runtime logs. This creates a clear data minimization and leakage risk: sensitive prompts, prior outputs, internal metadata, or user data may be retained and propagated across components beyond what is necessary for debugging. Observability skills are especially risky because they centralize high-value context in one place.

Ssd 3

Medium
Confidence
97% confidence
Finding
The skill’s final goal explicitly directs it to expose internal thinking, self-critique, agent generation, and runtime decision processes to the user. That can disclose sensitive internal reasoning, hidden system prompts, security logic, user-derived confidential context, or chain-of-thought-like material that should remain internal. In an observability skill, this materially increases the risk of prompt leakage and privacy violations.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal