Install
openclaw skills install zhipu-ttsText-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chinese text synthesis with multiple voice personas, speed control, and output formats.
openclaw skills install zhipu-ttsConvert Chinese text to natural-sounding speech using Zhipu AI's GLM-TTS model.
1. Get your API Key: Get a key from Zhipu AI Console
2. Set it in your environment:
export ZHIPU_API_KEY="your-key-here"
Convert text to speech with default settings (tongtong voice, normal speed, WAV format):
bash scripts/text_to_speech.sh "你好,今天天气怎么样"
Specify voice, speed, format, and output filename:
bash scripts/text_to_speech.sh "欢迎使用智能语音服务" xiaochen 1.2 wav greeting.wav
Parameters:
text (required): Chinese text to convert (max 1024 characters)voice (optional): tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodospeed (optional): Speech speed from 0.5 to 2.0 (default: 1.0)output_format (optional): wav (default), pcmoutput_file (optional): Output filename (default: output.{format})Choose tongtong (default) for:
Choose chuichui for:
Choose xiaochen for:
Choose jam/kazi/douji/luodo for:
Recommended speeds:
WAV (recommended):
PCM:
Create a professional greeting:
bash scripts/text_to_speech.sh "您好,感谢致电智能客服,请按1选择中文服务" tongtong 1.0 wav greeting.wav
Generate an energetic announcement:
bash scripts/text_to_speech.sh "热烈欢迎各位嘉宾参加今天的活动!" xiaochen 1.3 wav announcement.wav
Create a calm narration:
bash scripts/text_to_speech.sh "在这个宁静的夜晚,让我们一起欣赏美丽的星空" chuichui 0.9 wav narration.wav
Best practices:
Sample rate: Generated audio uses 24000 Hz sampling rate for optimal quality.
Text Length Issues:
Audio Quality Issues:
File Playback Issues: