tavily-extract-zh

Security checks across malware telemetry and agentic risk

Overview

This skill is advertised as a URL text extractor, but the bundled client also exposes broader search and AI synthesis commands.

Review before installing. Use it only if you are comfortable giving the bundled client your AISA_API_KEY and allowing URLs, queries, and retrieved content to be sent to AISA endpoints beyond simple extraction. If installed, invoke only the extract subcommand for URL body extraction and avoid private or sensitive URLs unless you trust that service’s handling of the data.

SkillSpector

By NVIDIA

Vulnerability Patterns

Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (7)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 85% confidence
Finding: The skill declares required environment variables and invokes a Python client that performs outbound API calls, but it does not declare explicit permissions for env and network access. This weakens sandboxing and review controls because the runtime capabilities exceed what the manifest transparently communicates, increasing the chance of unintended secret use or network exfiltration.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 93% confidence
Finding: The skill is presented as a narrow Tavily page-extraction tool, but the referenced behavior indicates much broader search, model-querying, synthesis, and multi-source aggregation capabilities. This mismatch is dangerous because users and policy engines may authorize a low-risk extraction skill while the implementation can perform materially different network actions and process data in ways not disclosed.

Intent-Code Divergence

Medium

Confidence: 82% confidence
Finding: The run instructions point to a generic repository client rather than a dedicated extraction-only script, which suggests operators may execute tooling with capabilities beyond the skill's stated purpose. In practice this creates a confused-deputy risk: a seemingly limited skill can inherit broader search or querying features from the shared client and make unintended external requests.

Description-Behavior Mismatch

High

Confidence: 97% confidence
Finding: The file advertises and implements a broad multi-search and research CLI, while the skill metadata says it should only extract clean page text from provided URLs. This mismatch expands the skill's capabilities beyond user and platform expectations, increasing the risk of unintended data transmission, unauthorized web querying, and misuse in contexts that assumed a narrow extraction-only tool.

Description-Behavior Mismatch

High

Confidence: 98% confidence
Finding: The command parser exposes multiple subcommands for web search, scholar search, Tavily search, Perplexity-style querying, and multi-source synthesis even though the skill is described as URL extraction only. In an agent setting, hidden or undocumented capabilities are dangerous because they can be invoked to send user prompts and research targets to external services outside the expected scope.

Context-Inappropriate Capability

Medium

Confidence: 90% confidence
Finding: The verity workflow performs parallel multi-source retrieval, computes confidence scores, and sends aggregated results for AI synthesis via another endpoint. For a URL-body extraction skill, this is unnecessary privilege and data flow expansion that can expose user queries and collected content to additional services without clear justification.

Intent-Code Divergence

Medium

Confidence: 94% confidence
Finding: The module docstring explicitly presents the code as a multi-search engine client, contradicting the manifest's extraction-only purpose. Documentation mismatch is a security issue in agent ecosystems because review, policy gating, and user consent often rely on the declared function of the skill.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal