大佬思想蒸馏框架

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed AI persona-simulation forum with legal and confidentiality cautions, but no evidence of hidden malware or unsafe system behavior.

Install only if you want an AI-assisted thought-leader/persona simulation tool. Do not treat outputs as real quotes, endorsements, or current views of named people; keep the visible simulation labels; avoid public or commercial reuse without legal review; and do not paste confidential meeting notes, customer data, employee data, or unreleased strategy unless your organization permits that use. Ask the agent to confirm before web searching or creating a new persona.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (4)

Description-Behavior Mismatch

Medium
Confidence
77% confidence
Finding
The instruction to 'search for background information' introduces an external-information acquisition step without any guardrails on source trust, data handling, or output labeling. In a persona-simulation skill, that can cause the model to ingest unreliable or sensitive material and then present synthesized statements as attributed viewpoints, increasing the risk of misinformation, privacy leakage, or prompt-injection through retrieved content.

Vague Triggers

Medium
Confidence
88% confidence
Finding
The quick-start example uses a very generic natural-language trigger ('用讨论模式,主题是XXX,让XXX参与') without defining strict invocation boundaries, confirmation steps, or role constraints. In a skill that simulates public figures and supports dynamic role distillation, broad triggers increase the chance of accidental activation, unintended persona invocation, or misuse for impersonation-like outputs that may be attributed to real people despite the disclaimer.

Vague Triggers

Medium
Confidence
84% confidence
Finding
Broad trigger phrases can cause accidental or adversarial activation of the skill in unrelated conversations. That expands the attack surface for prompt steering, unintended persona simulation, and misattributed outputs, especially in environments where skills are auto-invoked based on loose keyword matching.

Vague Triggers

Medium
Confidence
82% confidence
Finding
The automatic activation rule for missing-background guidance is overly ambiguous, which can cause the skill to take over ordinary conversation flows without clear user consent. In a multi-skill or agentic environment, ambiguous auto-activation increases the risk of prompt hijacking, context confusion, and unintended disclosure of user context into the skill workflow.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal