AI元认知置信度追踪器

Security checks across malware telemetry and agentic risk

Overview

This confidence-checking skill has a legitimate purpose, but it under-discloses that it can send both user questions and answers to MiniMax through a local CLI using the user's API key.

Install only if you are comfortable with questions and generated answers being sent to MiniMax via a local `mmx` CLI using your MiniMax API key. Review or disable it for confidential chats, and expect the publisher to correct the privacy section, document the CLI and credential requirements, and make broader learning/reporting behavior explicitly opt-in.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (6)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: env = os.environ.copy() env["MINIMAX_API_KEY"] = api_key result = subprocess.run(cmd, capture_output=True, text=True, timeout=60, env=env) if result.returncode == 0: try:
Confidence: 92% confidence
Finding: result = subprocess.run(cmd, capture_output=True, text=True, timeout=60, env=env)

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The skill reads API credentials from the environment and a user config file even though its declared purpose is only to assess answer confidence and add an uncertainty marker. This expands access to sensitive secrets and enables outbound calls without clear need or user awareness, which is a privilege/scope mismatch.

Context-Inappropriate Capability

High

Confidence: 97% confidence
Finding: The skill spawns an external CLI to perform evaluation, giving it code-execution-like capability beyond a narrow confidence-labeling helper. In agent contexts, unnecessary subprocess execution materially increases risk because it depends on whatever 'mmx' binary is present and can transmit user content off-box.

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The implementation sends the user's question and proposed answer to an external model for adjudication, which is broader than an internal self-awareness module and creates a privacy/data-sharing risk. Sensitive prompts or draft responses could be exposed to a third party without clear consent.

Vague Triggers

Medium

Confidence: 91% confidence
Finding: The skill does not define when it should activate, so it may be applied broadly to unrelated conversations or outputs. For a behavior-modifying skill that prepends uncertainty labels, ambiguous scope can cause unintended interception, inconsistent behavior, or abuse through unexpected triggering across contexts.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The code both loads credentials and performs outbound model invocation without any user-facing disclosure in the execution path. That combination is dangerous because users may reasonably expect a local confidence checker, not a component that accesses secrets and sends content to an external service.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal