Tom Video Understanding

Security checks across malware telemetry and agentic risk

Overview

This is a small instruction-only video analysis skill whose local processing steps are coherent, with an optional cloud summary step users should treat carefully.

Before installing, use the local ffmpeg/FunASR/Ollama workflow for private videos. Only use the optional cloud summary step after choosing and trusting the provider, because transcripts, frame descriptions, or summaries could leave your machine.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (3)

Description-Behavior Mismatch

Medium
Confidence
92% confidence
Finding
The skill is presented as a local video-understanding workflow, but it explicitly allows a cloud LLM API for summary and analysis. That can cause users to send transcriptions, frame descriptions, or other potentially sensitive video-derived content off-device under a misleading 'local' trust assumption.

Vague Triggers

Medium
Confidence
79% confidence
Finding
The invocation guidance, 'use this skill when you need to understand the content of a video,' is broad enough that an agent may invoke it in many loosely related situations. Over-broad triggering can lead to unnecessary processing of user videos and extracted media, increasing privacy exposure and accidental use without clear user intent.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The skill mentions cloud LLM usage for summary/analysis but does not provide a clear user-facing warning that video-derived data may be transmitted to an external service. This creates a privacy and compliance risk because users may reasonably expect all processing to remain local based on the skill description.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal