hxl-code-reviewer

Security checks across malware telemetry and agentic risk

Overview

This is a coherent MCP server-building guide with an optional evaluator; no hidden malicious behavior was found, but the evaluator should only be run on test or read-only systems.

Install only if you intend to build or evaluate MCP servers. Run the optional evaluator in a virtual environment with pinned dependencies, use synthetic or read-only test data, avoid production credentials, and review generated reports before sharing them because tool inputs and outputs may be included.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (9)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 86% confidence
Finding: The skill instructs the agent to create MCP servers and evaluation artifacts, including writing files and interacting with MCP-capable tooling, but it declares no permissions. This mismatch can cause unsafe execution in environments that rely on explicit permission declarations, because users and policy systems are not clearly informed that the skill may write files or interact with external MCP services.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 94% confidence
Finding: The stated description presents the skill as a development guide, but the behavior described by analysis includes acting as an MCP client, invoking remote tools, running evaluations, calling Anthropic APIs, and generating reports. This description-behavior gap is security-relevant because operators may approve the skill expecting passive guidance while it actually enables active external interaction, data transmission, and potentially costly or risky tool execution.

Context-Inappropriate Capability

Medium

Confidence: 95% confidence
Finding: The guide explicitly demonstrates collecting an API key from the user via `ctx.elicit(..., input_type="password")` without any security framing, storage restrictions, or recommendation to prefer preconfigured secrets/token brokers. In a skill intended to teach MCP server construction, this normalizes credential harvesting patterns and can lead developers to build tools that solicit, handle, and possibly expose sensitive secrets unnecessarily.

Context-Inappropriate Capability

Medium

Confidence: 90% confidence
Finding: The resource example exposes local file contents through a URI template (`file://documents/{name}`) and reads from `./docs/{name}` with no validation, allowlist, or path normalization. In a guide focused on external API/service integration, this broadens scope toward local file access and teaches a pattern that could become path traversal or unintended local data exposure when copied into real MCP servers.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The example requests an API key in plain language and presents it as a normal tool interaction, but provides no warning about secret handling, safer alternatives, or downstream risks like logging, prompt leakage, or reuse across sessions. Documentation that omits these caveats can directly influence insecure implementations by downstream developers.

Ssd 3

Medium

Confidence: 96% confidence
Finding: The guidance explicitly instructs tools to request and use user API keys, which is a sensitive-data handling pattern that can easily be abused or implemented insecurely. Because this is prescriptive instructional content, it has elevated risk: readers may reproduce the pattern verbatim without adding safeguards, resulting in credential theft or accidental disclosure.

Ssd 3

Medium

Confidence: 98% confidence
Finding: The evaluation system prompt explicitly instructs the model to disclose every tool input and output in <summary> tags. Since those summaries are persisted into the final report, any secrets present in prompts, headers, API responses, tool arguments, or tool results can be echoed into logs or output files, creating a clear sensitive-data exposure risk.

Known Vulnerable Dependency: anthropic — 2 advisory(ies): CVE-2026-34450 (Claude SDK for Python has Insecure Default File Permissions in Local Filesystem ); CVE-2026-34452 (Claude SDK for Python: Memory Tool Path Validation Race Condition Allows Sandbox)

Low

Category: Supply Chain
Confidence: 89% confidence
Finding: anthropic

Known Vulnerable Dependency: mcp — 3 advisory(ies): CVE-2025-53366 (MCP Python SDK vulnerability in the FastMCP Server causes validation error, lead); CVE-2025-66416 (Model Context Protocol (MCP) Python SDK does not enable DNS rebinding protection); CVE-2025-53365 (MCP Python SDK has Unhandled Exception in Streamable HTTP Transport, Leading to )

High

Category: Supply Chain
Confidence: 97% confidence
Finding: mcp

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal