Free Text Image

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real cloud media-generation skill, but it is broader than its text-to-image branding and can automatically send prompts or uploaded media to a third-party rendering service.

Review before installing. Use this only for prompts and media you are comfortable sending to the NEMO cloud service, require explicit confirmation before uploads or exports, and prefer a dedicated low-privilege token. This is not evidence of malware, but the scope and data flow are broader than the text-to-image presentation suggests.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (6)

Description-Behavior Mismatch

High
Confidence
96% confidence
Finding
The skill is marketed as simple text-to-image generation, but the implementation exposes a much broader remote video-editing and export workflow with session management, uploads, timeline state, and rendering. This mismatch can mislead users and host systems about the real capabilities and data flows, increasing the chance of unintended file uploads, remote processing, and over-privileged activation.

Intent-Code Divergence

Medium
Confidence
93% confidence
Finding
The internal documentation repeatedly describes video rendering, GPU jobs, MP4 generation, and timeline-oriented processing despite branding the skill as image generation. This deceptive or inaccurate framing weakens informed consent and can cause users to provide inputs under false assumptions about what will be processed and returned.

Vague Triggers

Medium
Confidence
84% confidence
Finding
The invocation phrase is generic enough to match ordinary conversation, making accidental activation more likely. In a skill that can automatically connect to a remote service and later process or upload content, ambiguous triggering increases the risk of unintended network actions and user confusion.

Vague Triggers

Medium
Confidence
90% confidence
Finding
The catch-all routing rule sends 'everything else' to the SSE action, which substantially widens the activation surface and makes behavior hard to predict. Because SSE drives backend operations, vague prompts could be transmitted to the remote service without sufficient user intent verification.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The skill instructs automatic cloud connection and anonymous token acquisition on first use without a clear upfront warning that data and identifiers will be sent to an external service. Silent remote authentication and session creation undermine transparency and informed consent, especially when a generated client identifier is used for backend access.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The upload and export flows describe sending files and retrieving rendered outputs but do not clearly warn users about privacy, integrity, or remote handling risks. Users may upload sensitive media or trust returned files without understanding that third-party cloud processing and downloadable artifacts are involved.

VirusTotal

62/62 vendors flagged this skill as clean.

View on VirusTotal