Install
openclaw skills install smart-speak-vutranMultilingual Text-to-Speech (TTS) with intelligent Pinyin-to-Hanzi conversion. Use when the user asks to generate audio for text that contains a mix of Vietn...
openclaw skills install smart-speak-vutranThis skill provides a high-quality, multilingual text-to-speech workflow that handles Vietnamese, Chinese (including Pinyin), and English seamlessly.
vi-VN-HoaiMyNeural (Hoài Mỹ)zh-CN-XiaoxiaoNeural (Xiaoxiao)en-US-AvaNeural (Ava)ffmpeg.Before generating audio, the agent must:
你好吗?). This ensures the zh-CN-XiaoxiaoNeural voice reads it with perfect tones.Divide the processed text into blocks and assign the appropriate voice.
Example Input: "Chào anh Vũ, 你好吗? (Nǐ hǎo ma?) là câu chào."
Example Segments:
{"text": "Chào anh Vũ, ", "voice": "vi-VN-HoaiMyNeural"}{"text": "你好吗?", "voice": "zh-CN-XiaoxiaoNeural"}{"text": " ( ", "voice": "vi-VN-HoaiMyNeural"}{"text": "你好吗?", "voice": "zh-CN-XiaoxiaoNeural"}{"text": " ) là câu chào.", "voice": "vi-VN-HoaiMyNeural"}Use the bundled Python script to generate and merge the audio.
python3 skills/public/smart-speak/scripts/smart_speak.py \
--segments-json '[{"text": "Chào anh Vũ, ", "voice": "vi-VN-HoaiMyNeural"}, ...]' \
--output /home/jackie_chen_phong/.openclaw/workspace/output_name.mp3
Send the resulting MP3 file to the user using the message tool (action=send, filePath).
/home/jackie_chen_phong/.openclaw/workspace/.--segments-json string is properly escaped when passed to the shell.edge-tts is located at /home/jackie_chen_phong/.local/bin/edge-tts.