Back to skill

Security audit

Deep Research Pipeline

Security checks across malware telemetry and agentic risk

Overview

This skill is a coherent deep-research tool, but users should understand that it can send prompts and retrieved content to external services and save research artifacts locally.

Install only if you are comfortable with research questions and retrieved source text being sent to configured search, GitHub, documentation, and LLM providers. Use mock/offline modes or a trusted local provider for sensitive work, run it in a sandboxed workspace, pick explicit output/checkpoint paths, and verify citations before relying on generated reports.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Rogue AgentSelf-Modification, Session Persistence
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (14)

Lp3

Medium
Category
MCP Least Privilege
Confidence
95% confidence
Finding
The skill documentation describes concrete capabilities to read environment variables, write files, and perform network access, but it declares no permissions. That mismatch can prevent informed consent and policy enforcement by the host, especially because the pipeline can transmit prompts and retrieved content to external LLM/search providers and persist outputs locally.

Tp4

High
Category
MCP Tool Poisoning
Confidence
91% confidence
Finding
The declared purpose frames the skill as a research workflow, but the described behavior includes broader capabilities such as arbitrary remote fetching, GitHub/code search, parallel execution, checkpointing, and report writing to disk. This increases risk because users may invoke it expecting bounded research behavior while it actually performs wider data collection, persistence, and outbound transmission than the description suggests.

Intent-Code Divergence

Medium
Confidence
97% confidence
Finding
The methodology section asserts that all findings are directly sourced and cited, but several code paths such as summary, brief, and some template outputs omit citations entirely. In a research-writing skill, this is a security-relevant integrity issue because users may trust unsupported claims as verified, enabling misleading or fabricated content to be presented with false provenance.

Intent-Code Divergence

Medium
Confidence
95% confidence
Finding
The code claims report generation includes inline citations, but LLM-produced theme sections are trusted verbatim without post-generation verification that citations are present, valid, and tied to supplied sources. This creates an integrity vulnerability where the model can hallucinate citations or produce unsupported factual assertions, especially risky in a deep-research skill whose purpose is authoritative synthesis.

Missing User Warnings

Low
Confidence
87% confidence
Finding
The documentation explicitly instructs the orchestrator to save plans under memory/research and later describes saving reports and checkpoints, but it does not clearly warn users that running the skill modifies the local filesystem. In an agent setting, undocumented write behavior can surprise operators, overwrite files, or persist sensitive research material in unintended locations.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The skill is a research pipeline whose normal operation includes web search, fetching documentation URLs, GitHub queries, and calls to configured LLM providers, yet the examples and CLI guidance do not prominently warn that user questions and retrieved content may be transmitted to external services. In agent environments this can expose sensitive prompts, internal topics, tokens, or proprietary context to third parties without informed user consent.

Missing User Warnings

Low
Confidence
84% confidence
Finding
The markdown instructs users to write reports and other artifacts to local paths but does not warn about overwriting existing files or uncontrolled output locations. In practice this can cause accidental data loss or unsafe writes when users supply sensitive or unexpected paths.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The skill prominently supports external LLM providers, web search, and remote fetching, yet it does not clearly warn that user questions, retrieved content, and possibly local context may be transmitted to third parties. That omission is dangerous in research workflows because prompts often contain proprietary, personal, or otherwise sensitive material.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
The code sends chunk titles, URLs, and up to 2000 characters of chunk content to an external LLM service via `_call_glm(...)` without any explicit consent gate, redaction step, or user-visible warning at the transmission point. In a deep-research skill, retrieved content can include proprietary, sensitive, or regulated data, so silent exfiltration to a third-party model provider creates a real confidentiality and compliance risk even if the feature is intended for relevance scoring.

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The code sends the research question and selected chunk contents to an external LLM via `_call_glm(...)` without any consent gate, minimization, or disclosure in this component. In a deep-research skill, chunks can easily contain proprietary, personal, or otherwise sensitive text gathered from prior steps, so silent transmission to a third-party model provider creates a real data exposure risk.

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The helper fetches arbitrary URLs supplied indirectly by search results or directly by user input without any allowlist, scheme restriction beyond implicit urllib behavior, or user-visible disclosure. In an agent context, this can trigger unintended outbound requests to attacker-controlled hosts, enabling SSRF-style network access, metadata service probing, or silent data egress via request context and traffic patterns.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The executive-summary path sends the research question and derived findings to an external LLM service without any explicit user-facing notice or consent flow. In a deep-research skill, analyst output may contain sensitive internal research, proprietary data, or regulated content, so silent transmission creates a meaningful confidentiality and compliance risk.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
Theme generation transmits detailed findings, including claims and source context, to an external LLM with large token budgets and no user warning. Because this skill is specifically designed for deep research and multi-source analysis, the payload is likely to be rich, sensitive, and potentially proprietary, making unauthorized disclosure more dangerous in context than in a generic writing tool.

Session Persistence

Medium
Category
Rogue Agent
Content
## Workflow (Orchestrated Mode)

### Phase 1: Planning
- Analyze question, create slug, make `memory/research/<slug>/` directory
- Generate research plan with dimensions and questions
- Save to `plan.md`
Confidence
88% confidence
Finding
create slug, make `memory/research/<slug>/` directory - Generate research plan with dimensions and questions - Save to `plan.md` ### Phase 2: Research Cycle (repeat up to 8 times) #### Step A: Spawn

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal