Install
openclaw skills install clonevClawHub Security found sensitive or high-impact capabilities. Review the scan results before using.
Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) Clone their voice or someone else's voice, (2) Generate speech that sounds like a specific person, (3) Create personalized voice messages, (4) Multi-lingual voice cloning (speak any language with cloned voice).
openclaw skills install clonevDO NOT try to use Docker containers directly.
DO NOT try to interact with coqui-xtts container - it is broken and restarting.
DO NOT try to use APIs or servers.
ONLY USE THE SCRIPT: scripts/clonev.sh
The script handles everything automatically. Just call it with text, voice sample, and language.
Clones any voice from a short audio sample and generates new speech in that voice.
Input:
Output: OGG voice file (cloned voice speaking the text)
Works with: Any voice! Yours, a celebrity, a character, etc.
$(scripts/clonev.sh "Your text here" /path/to/voice_sample.wav language)
That's it! Nothing else needed.
en)VOICE_FILE=$(scripts/clonev.sh "TEXT_HERE" "/path/to/sample.wav" LANGUAGE)
The variable $VOICE_FILE now contains the path to the generated OGG file.
# Generate cloned voice
VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Hello, this is my cloned voice!" "/mnt/c/TEMP/Recording 25.wav" en)
# Send to Telegram (as voice message)
message action=send channel=telegram asVoice=true filePath="$VOICE"
# Generate Czech voice
VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Ahoj, tohle je můj hlas" "/mnt/c/TEMP/Recording 25.wav" cs)
# Send
message action=send channel=telegram asVoice=true filePath="$VOICE"
#!/bin/bash
# Generate voice
VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Task completed!" "/path/to/sample.wav" en)
# Verify file was created
if [ -f "$VOICE" ]; then
echo "Success! Voice file: $VOICE"
ls -lh "$VOICE"
else
echo "Error: Voice file not created"
fi
| Code | Language | Example Usage |
|---|---|---|
en | English | scripts/clonev.sh "Hello" sample.wav en |
cs | Czech | scripts/clonev.sh "Ahoj" sample.wav cs |
de | German | scripts/clonev.sh "Hallo" sample.wav de |
fr | French | scripts/clonev.sh "Bonjour" sample.wav fr |
es | Spanish | scripts/clonev.sh "Hola" sample.wav es |
Full list: en, cs, de, fr, es, it, pl, pt, tr, ru, nl, ar, zh, ja, hu, ko
Good samples:
Bad samples:
/mnt/c/TEMP/Docker-containers/coqui-tts/models-xtts/Make sure you're in the skill directory or use full path:
/home/bernie/clawd/skills/clonev/scripts/clonev.sh "text" sample.wav en
/)ls -la /path/to/sample.wavThe model should auto-download. If not:
cd /mnt/c/TEMP/Docker-containers/coqui-tts
docker run --rm --entrypoint "" \
-v $(pwd)/models-xtts:/root/.local/share/tts \
ghcr.io/coqui-ai/tts:latest \
python3 -c "from TTS.api import TTS; TTS('tts_models/multilingual/multi-dataset/xtts_v2')"
USER: "Clone my voice and say 'hello'"
→ Get: sample path, text="hello", language="en"
→ Run: VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "hello" "/path/to/sample.wav" en)
→ Result: $VOICE contains path to OGG file
→ Send: message action=send channel=telegram asVoice=true filePath="$VOICE"
USER: "Make me speak Czech"
→ Get: sample path, text="Ahoj", language="cs"
→ Run: VOICE=$(/home/bernie/clawd/skills/clonev/scripts/clonev.sh "Ahoj" "/path/to/sample.wav" cs)
→ Send: message action=send channel=telegram asVoice=true filePath="$VOICE"
Generated files are saved to:
/mnt/c/TEMP/Docker-containers/coqui-tts/output/clonev_output.ogg
The script returns this path, so you can use it directly.
scripts/clonev.shcoqui-xtts containerSimple. Just use the script.
Clone any voice. Speak any language. Just use the script.