Add Subtitle To Video Online

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real cloud video captioning skill, but it is broader than its subtitle-focused name suggests and sends media to a third-party backend.

Install only if you are comfortable sending selected videos, prompts, and edit instructions to NemoVideo's cloud API. Avoid confidential, regulated, or third-party-sensitive footage unless you trust the provider's privacy and retention practices. Keep NEMO_TOKEN private, and confirm before using non-subtitle editing features such as BGM, overlays, timeline edits, or exports.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (4)

Description-Behavior Mismatch

Medium

Confidence: 94% confidence
Finding: The skill is marketed as a subtitle-generation utility, but its instructions expose a much broader remote video-editing pipeline, including timeline manipulation, uploads, audio changes, state inspection, and export. This capability mismatch increases the chance of over-collection, unintended processing, and misuse because users and orchestrators may invoke a more powerful cloud editor than the declared purpose suggests.

Context-Inappropriate Capability

Low

Confidence: 84% confidence
Finding: The skill performs token acquisition, session creation, and credit/balance handling that go beyond a simple local subtitle tool and are only partially aligned with the declared purpose. While these functions may be operationally necessary for the backend, exposing them without clear necessity or boundaries expands the attack surface around authentication state and billing-related interactions.

Vague Triggers

Medium

Confidence: 92% confidence
Finding: The invocation rules route broad, generic video-editing language such as export, edit, add BGM, overlays, and audio-track actions to this skill, despite its declared subtitle-focused purpose. Overbroad matching can cause unintended activation for unrelated prompts, leading users to send media or trigger remote processing they did not intend for this specific skill.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: The skill instructs the agent to upload user video files to a third-party cloud backend and export processed results, but it does not prominently warn users that their media leaves the local environment. For potentially sensitive videos, this omission creates privacy and compliance risk because users may not understand that remote storage and processing occur.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal