Grok Imagine Video Generation

Security checks across malware telemetry and agentic risk

Overview

This is a coherent xAI media-generation skill that uses an API key, sends prompts and media URLs to xAI, and downloads generated files as part of its stated purpose.

Install only if you intend to use xAI for media generation. Use your own preferably dedicated xAI API key, avoid submitting private or sensitive prompts/media without consent, monitor usage costs, and save downloads only to an intended workspace output directory.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Taint TrackingDirect Taint Flow, Variable-Mediated Taint Flow, Credential Exfiltration Chain
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (4)

Tainted flow: 'video_url' from os.getenv (line 337, credential/environment) → requests.get (network output)

Critical
Category
Data Flow
Content
if not video_url:
            raise ValueError("No video URL in response")

        response = requests.get(video_url, stream=True)
        response.raise_for_status()

        os.makedirs(os.path.dirname(output_path), exist_ok=True)
Confidence
84% confidence
Finding
response = requests.get(video_url, stream=True)

Lp3

Medium
Category
MCP Least Privilege
Confidence
91% confidence
Finding
The skill metadata declares environment requirements but does not explicitly declare operational permissions even though the documented usage clearly requires reading an API key from the environment and making outbound network requests to xAI and remote media URLs. This permission mismatch can weaken platform governance, prevent accurate risk review, and cause the skill to run with capabilities users or operators did not clearly approve.

Vague Triggers

Medium
Confidence
83% confidence
Finding
The invocation description is broad enough to match many ordinary image/video generation or editing requests, which increases the chance the skill is auto-selected in situations where users did not specifically intend to send prompts or media to xAI. In this context, over-broad triggering matters because the skill can upload user-provided media and prompts to an external service and download generated content into the workspace.

Missing User Warnings

Medium
Confidence
96% confidence
Finding
The skill description does not clearly warn that prompts, image URLs, and video URLs may be transmitted to an external third-party API for processing. Because this skill is specifically designed to upload and transform user media, the lack of an upfront privacy/data-handling notice raises a real risk of unintentionally exposing sensitive personal, proprietary, or regulated content.

VirusTotal

48/48 vendors flagged this skill as clean.

View on VirusTotal