Text To Speech Video

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed cloud video-generation connector, but users should only use it with scripts and media they are comfortable sending to NemoVideo.

Install only if you trust NemoVideo with the files and prompts you choose to process. Use a limited-purpose NEMO_TOKEN when available, monitor credit usage, and avoid uploading confidential, regulated, or proprietary scripts or media unless you have reviewed the provider’s data-handling terms.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

Description-Behavior Mismatch

Medium

Confidence: 91% confidence
Finding: The skill is presented as a text-to-speech video converter, but the documented capabilities include broad media upload, editing, state inspection, and rendering workflows. This scope expansion can cause users or host systems to grant trust and data access under narrower expectations than the skill actually requires, increasing the risk of unintended data handling and misuse.

Description-Behavior Mismatch

Medium

Confidence: 94% confidence
Finding: The routing table allows generic editing behaviors such as adding BGM, checking timeline state, uploading files, and exporting, which goes beyond the advertised task of converting written scripts into narrated videos. That mismatch weakens informed consent and can let the skill operate on broader user content or existing sessions than users reasonably expect from the manifest.

Vague Triggers

Medium

Confidence: 96% confidence
Finding: The catch-all rule routes virtually any unmatched request into generation/editing via SSE, creating an overly broad execution surface. This makes accidental or adversarial prompt phrasing more likely to trigger remote actions, including edits or processing of uploaded/session content, without clear user intent or narrowly scoped confirmation.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill says it connects to a cloud backend and encourages users to send scripts, but it does not clearly warn users up front that their uploaded files and prompts are transmitted to an external service for processing. Because this skill handles potentially sensitive documents and media, the missing disclosure materially increases privacy and data-governance risk.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The skill silently checks for environment credentials and, if absent, auto-provisions an anonymous token tied to a generated client identifier, without a clear user warning. Hidden credential use and automatic account/token creation can lead to unanticipated external authentication, consumption of credits, and opaque linkage of user activity to backend identifiers.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal