Video Downloader Online

Security checks across malware telemetry and agentic risk

Overview

This skill is a cloud video downloader/editor that sends user media or URLs to NemoVideo, but its downloader framing understates broader remote editing and broad catch-all processing behavior.

Install only if you are comfortable treating this as a NemoVideo cloud video editor/downloader, not a local-only downloader. Avoid private or sensitive media unless you trust the provider, and expect URLs, uploaded files, tokens, session IDs, project state, and render jobs to be handled by the remote service.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (6)

Description-Behavior Mismatch

Medium
Confidence
90% confidence
Finding
The skill is presented as a simple video downloader, but the documented API flows include broader editing, generation, state inspection, and export behavior. This mismatch can mislead users and host agents about what the skill will actually do, increasing the risk of unintended data transfer, unexpected processing, and over-broad permissions or invocation.

Description-Behavior Mismatch

Medium
Confidence
88% confidence
Finding
The manifest frames the skill as taking user video URLs/files for download conversion, but the routing rules allow general generate/edit requests under a catch-all path. That broadens the operational scope beyond the declared purpose, which can cause users or orchestration systems to invoke capabilities they did not expect or authorize.

Vague Triggers

Medium
Confidence
86% confidence
Finding
The invocation examples are broad and overlap with ordinary user language, making accidental triggering more likely. In a skill that transmits URLs/files to a remote backend and can initiate processing workflows, ambiguous trigger phrases raise the chance of unintended activation and data exposure.

Vague Triggers

Medium
Confidence
92% confidence
Finding
The catch-all routing rule sends 'everything else' to the SSE editing/generation pathway, which creates an overly broad trigger surface. This can cause unrelated or unclear prompts to invoke remote processing actions unexpectedly, especially since the skill also maintains sessions and state server-side.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The skill instructs the agent to send video URLs/files, session identifiers, and related metadata to a remote backend, but it does not provide a clear user-facing warning or consent flow. This is dangerous because users may unknowingly transmit sensitive media, URLs, or metadata off-platform to a third party.

Natural-Language Policy Violations

Medium
Confidence
81% confidence
Finding
Forcing session creation with language set to English without user choice is a policy and consent issue that can lead to misprocessing of user content or inaccurate handling of prompts. In a media-processing workflow, this can cause user confusion and incorrect backend behavior, though the direct security impact is limited.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal