Deep Research Agent

Security checks across malware telemetry and agentic risk

Overview

This is a coherent deep-research skill that uses expected web search, LLM provider keys, sub-agents, and report writing, but users should understand the privacy, file-output, and dependency-management caveats.

Install only if you are comfortable sending research topics, URLs, and fetched page content to Tavily and your chosen LLM provider. Run the backend in a dedicated virtual environment or sandbox, monitor API usage and cost, and check generated /research_request.md and /final_report.md outputs for sensitive content.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Findings (18)

Lp3

Medium
Category
MCP Least Privilege
Confidence
92% confidence
Finding
The skill instructs use of environment variables for API keys and web/network-backed tooling, but it does not declare corresponding permissions. That mismatch can bypass expected review or consent mechanisms, making secret access and outbound requests less transparent to the host system and user. In a research agent that performs autonomous multi-step search, this increases risk because network and credential use are central to operation.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The README explicitly describes persistent file writing (e.g. read/write_file and /final_report.md output) but does not warn users that invoking the skill may modify local files. In an autonomous agent context, undocumented filesystem writes can surprise users, overwrite existing data, or create sensitive artifacts on disk, especially when sub-agents operate automatically.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The README advertises deep web research, Tavily-powered search, and multiple LLM backends, which implies that user prompts and fetched web content are sent to third-party services, but it does not disclose this data flow or associated privacy risks. For a research agent handling arbitrary user topics, omission of this warning can cause users to unintentionally transmit sensitive queries, proprietary text, or personal data to external providers.

Vague Triggers

Medium
Confidence
83% confidence
Finding
The trigger conditions are broad enough to match ordinary user requests such as generic 'research [X]' or comprehensive analysis prompts, which can cause the agent to invoke a powerful autonomous workflow unexpectedly. Because this skill performs multi-step web access and potentially delegated research, accidental triggering can lead to unnecessary external calls, cost, and execution of a more capable workflow than the user intended.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The markdown describes running a backend script that writes a report file to `/final_report.md`, but it does not clearly warn the user about local file creation or where outputs will be stored. Hidden or poorly disclosed file-writing behavior reduces user awareness and can cause unintended persistence of sensitive research topics or collected data on disk. In an autonomous research context, this is more concerning because generated reports may contain proprietary or sensitive information aggregated from multiple sources.

Missing User Warnings

Medium
Confidence
78% confidence
Finding
The function fetches arbitrary URLs returned from search and converts full page contents to markdown, which are then supplied to the research workflow and potentially onward to an external LLM. In a deep-research agent, this creates a real data-sharing and prompt-injection exposure surface because untrusted remote content is ingested and processed without any user-facing consent, content isolation, or filtering.

Missing User Warnings

Medium
Confidence
84% confidence
Finding
The system prompt instructs the agent to write files to fixed paths such as /research_request.md and /final_report.md without clear prior disclosure or path controls. In an autonomous agent context, silent filesystem modification is risky because users may not expect writes, the fixed absolute-style paths can overwrite existing files in some runtimes, and prompt-injected workflows may abuse file-writing behavior.

Unpinned Dependencies

Low
Category
Supply Chain
Content
deepagents>=0.1.0
langchain>=0.3.0
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
Confidence
90% confidence
Finding
deepagents>=0.1.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
deepagents>=0.1.0
langchain>=0.3.0
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
Confidence
96% confidence
Finding
langchain>=0.3.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
deepagents>=0.1.0
langchain>=0.3.0
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
httpx>=0.27.0
Confidence
96% confidence
Finding
langchain-core>=0.3.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
deepagents>=0.1.0
langchain>=0.3.0
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
httpx>=0.27.0
markdownify>=0.13.0
Confidence
88% confidence
Finding
langchain-anthropic>=0.3.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
langchain>=0.3.0
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
httpx>=0.27.0
markdownify>=0.13.0
Confidence
83% confidence
Finding
tavily-python>=0.5.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
langchain-core>=0.3.0
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
httpx>=0.27.0
markdownify>=0.13.0
Confidence
93% confidence
Finding
httpx>=0.27.0

Unpinned Dependencies

Low
Category
Supply Chain
Content
langchain-anthropic>=0.3.0
tavily-python>=0.5.0
httpx>=0.27.0
markdownify>=0.13.0
Confidence
80% confidence
Finding
markdownify>=0.13.0

Known Vulnerable Dependency: langchain — 10 advisory(ies): CVE-2023-36258 (langchain arbitrary code execution vulnerability); CVE-2026-45134 (LangSmith SDK: Public prompt pull deserializes untrusted manifests without trust); CVE-2024-2965 (Denial of service in langchain-community) +7 more

Critical
Category
Supply Chain
Confidence
94% confidence
Finding
langchain

Known Vulnerable Dependency: langchain-core — 10 advisory(ies): CVE-2026-26013 (LangChain affected by SSRF via image_url token counting in ChatOpenAI.get_num_to); CVE-2024-10940 (langchain-core allows unauthorized users to read arbitrary files from the host f); CVE-2025-65106 (LangChain Vulnerable to Template Injection via Attribute Access in Prompt Templa) +7 more

Critical
Category
Supply Chain
Confidence
95% confidence
Finding
langchain-core

Known Vulnerable Dependency: httpx — 2 advisory(ies): CVE-2021-41945 (Improper Input Validation in httpx); CVE-2021-41945 (Encode OSS httpx <=1.0.0.beta0 is affected by improper input validation in `http)

Critical
Category
Supply Chain
Confidence
78% confidence
Finding
httpx

Known Vulnerable Dependency: markdownify — 1 advisory(ies): CVE-2025-46656 (markdownify allows large headline prefixes such as <h9999999>, which causes memo)

Low
Category
Supply Chain
Confidence
71% confidence
Finding
markdownify

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal