openai-tts-python

Security checks across malware telemetry and agentic risk

Overview

This skill is a straightforward OpenAI text-to-speech tool, with expected privacy and cost considerations because it sends requested text to OpenAI and uses an API key.

Install only if you are comfortable sending the text you convert to OpenAI and using your OpenAI API key, which may incur charges. Avoid using it for secrets, regulated data, or confidential documents unless your OpenAI account and policies allow that, and consider invoking it only with explicit TTS requests because its activation keywords are broad.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (6)

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The README describes text-to-speech functionality but does not clearly disclose that user-provided text is transmitted to OpenAI's external API for processing. This can mislead operators into sending sensitive, regulated, or confidential content off-host without informed consent, creating privacy, compliance, and data-handling risk.

Vague Triggers

Medium
Confidence
93% confidence
Finding
The activation triggers are broad and include common words like "voice," "podcast," "speak," and "tts," which can cause the skill to activate in unintended contexts. This can lead to accidental invocation, surprise API usage, unintended processing of user text, and increased cost or privacy exposure if sensitive text is converted without a clear explicit request.

Vague Triggers

Medium
Confidence
85% confidence
Finding
The summary describes activation through vague phrase categories rather than strict invocation rules, reinforcing ambiguous routing behavior. While this line alone does not execute anything, it encourages implementations that may over-trigger the skill and makes accidental activation more likely.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The script transmits user-supplied text to OpenAI's external TTS service, but the CLI flow does not provide an explicit privacy warning or consent checkpoint before sending potentially sensitive file or stdin contents off-host. In a security context this is a real data-exposure risk, especially because the tool accepts arbitrary file input and piped content that users may assume is processed locally.

Vague Triggers

Medium
Confidence
91% confidence
Finding
The activation terms are broad enough to match ordinary conversation words like 'voice', 'audio', or 'speak', which can cause the skill to trigger outside narrowly intended TTS requests. In an agent system, this increases the chance of inappropriate routing, unexpected API usage, and accidental transmission of user content to an external service.

Vague Triggers

Medium
Confidence
88% confidence
Finding
The manifest description defines activation with ambiguous, loosely bounded conditions, including broad contextual cues and keyword matching, which can make the skill eligible in situations not specifically requesting speech synthesis. That creates a prompt-selection weakness where unrelated or sensitive text may be sent to the TTS API unintentionally.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal