openai-tts-python

v1.0.1

Text-to-speech conversion using OpenAI's TTS API for generating high-quality, natural-sounding audio. Supports 6 voices (alloy, echo, fable, onyx, nova, shimmer), speed control (0.25x-4.0x), HD quality model, multiple output formats (mp3, opus, aac, flac), and automatic text chunking for long content (4096 char limit per request). Use when: (1) User requests audio/voice output with triggers like "read this to me", "convert to audio", "generate speech", "text to speech", "tts", "narrate", "speak", or when keywords "openai tts", "voice", "podcast" appear. (2) Content needs to be spoken rather than read (multitasking, accessibility). (3) User wants specific voice preferences like "alloy", "echo", "fable", "onyx", "nova", "shimmer" or speed adjustments.

1· 2k·8 current·8 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
high confidence
Purpose & Capability
The code, SKILL.md, README, and manifest all implement a TTS client that calls OpenAI's audio.speech.create API, supports chunking, multiple voices, formats and pydub for combining — this matches the stated purpose. However the top-level registry summary in the evaluation header incorrectly claimed 'Required env vars: none' while SKILL.md, manifest.json, and the script explicitly require OPENAI_API_KEY.
Instruction Scope
Runtime instructions and the script only read user-provided text (args, files, or stdin), call the OpenAI API, and optionally use pydub/ffmpeg to combine audio. There are no instructions to read unrelated system files or exfiltrate other data. Network access to OpenAI is required (expected for this purpose).
Install Mechanism
No custom install spec is provided (instruction-only install), and required dependencies (openai, pydub, ffmpeg) are standard packages/tools available via pip and system package managers. No downloads from untrusted URLs or archives are present in the manifest or README.
!
Credentials
The manifest, SKILL.md, README, and code all require OPENAI_API_KEY (reasonable for this TTS skill), but the top-level registry metadata reported 'Required env vars: none' and 'Primary credential: none' — an explicit mismatch. This metadata omission reduces transparency about credential needs. No other unrelated secrets are requested, so the single required credential is proportionate, but the metadata inconsistency is a concern.
Persistence & Privilege
The skill does not request persistent/always-on installation and does not modify other skills or system-wide agent settings. Default autonomous invocation is allowed by platform policy (normal) and the skill does not escalate privileges.
What to consider before installing
This package implements a normal OpenAI text-to-speech client and legitimately requires your OPENAI_API_KEY. Before installing or enabling it: (1) confirm the publisher/source (the package lists no homepage and 'Source: unknown'); (2) correct or ask the publisher to correct the registry metadata so the required OPENAI_API_KEY is declared; (3) ensure you are comfortable granting network access to OpenAI and that the key has appropriate billing/usage controls; (4) review the included openai/pydub/ffmpeg install instructions and run in a sandbox if you have doubts; (5) if you plan to deploy broadly, prefer an official or well-audited mirror/repository and verify package versions. The inconsistencies are likely sloppy metadata, not malicious code, but verify source trustworthiness before providing your API key.

Like a lobster shell, security has layers — review code before you run it.

latestvk97czawqmdde5fgqcjyzh0anvh7zxhed

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments