video-audio-replace

v1.0.0

Replace video audio with TTS voice while preserving original timing. Includes subtitle generation from video using Whisper. Uses ElevenLabs or Edge TTS, alig...

0· 389·2 current·2 all-time
bymarc@synthere
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
medium confidence
Purpose & Capability
Name/description match the included scripts: generating subtitles (Whisper), creating TTS segments (ElevenLabs or Edge), aligning and replacing audio with ffmpeg. Required libraries (requests, faster-whisper, edge-tts) and ffmpeg usage are proportionate to the stated purpose.
Instruction Scope
Runtime instructions and scripts operate on local video/audio files, call ffmpeg/sox, and send text to the declared TTS APIs (api.elevenlabs.io / Edge TTS). They do not read unrelated system files or attempt broad environment discovery, but SKILL.md and code assume presence of ELEVENLABS_API_KEY when using the ElevenLabs engine (the registry metadata listed no required env vars — this is inconsistent).
Install Mechanism
This is an instruction-only skill with bundled Python scripts and no installer that downloads arbitrary code. _meta.json lists pip packages; all dependencies are standard public packages. No downloads from untrusted URLs or extracted archives are present.
Credentials
The only credential used is ELEVENLABS_API_KEY (optional if you use Edge TTS). That is appropriate for the ElevenLabs integration. However, the registry metadata reported no required env vars while the code clearly checks ELEVENLABS_API_KEY and will exit if ElevenLabs is selected — metadata should be corrected. Also the default ElevenLabs voice constant is a long alphanumeric string (likely a voice ID) which could be confusing; ensure it is not a misplaced secret.
Persistence & Privilege
always is false, skill does not request persistent presence or modify other skills or system-wide settings. It runs as a local tool operating on user-supplied files.
Assessment
This skill appears to do what it says: it uses Whisper for subtitles, and ElevenLabs or Edge for TTS, then aligns/merges audio using ffmpeg. Before installing or running: 1) If you plan to use ElevenLabs set ELEVENLABS_API_KEY in your environment (or run with --engine edge to avoid sending text to an external API). 2) Review and install the listed Python packages in a virtualenv; run in an isolated environment if you’re concerned about dependencies. 3) Verify the default ElevenLabs voice constant is not a leaked secret (it looks like a voice ID but confirm). 4) Be aware that using ElevenLabs will send your subtitle text to their API — do not upload sensitive content. 5) Update the skill metadata if you maintain it so required env vars are declared accurately.

Like a lobster shell, security has layers — review code before you run it.

latestvk974hsesw6qsgwqhbfv4pqdys581yt0r

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments