Install
openclaw skills install comfyui-ttsGenerate speech audio using ComfyUI Qwen-TTS service. Invoke when user needs text-to-speech conversion or voice generation through ComfyUI.
openclaw skills install comfyui-ttsGenerate speech audio using ComfyUI's Qwen-TTS service. This skill allows you to convert text to speech through ComfyUI's API.
Set these environment variables to configure the ComfyUI connection:
export COMFYUI_HOST="localhost" # ComfyUI server host
export COMFYUI_PORT="8188" # ComfyUI server port
export COMFYUI_OUTPUT_DIR="" # Optional: Custom output directory
Generate audio from text using default settings:
scripts/tts.sh "你好,世界"
Customize voice characteristics:
# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"
# Change model size
scripts/tts.sh "你好" --model "3B"
# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"
# Combine options
scripts/tts.sh "你好,这是测试" \
--character "Girl" \
--style "Emotional" \
--model "1.7B" \
--output "~/audio/test.wav"
| Option | Description | Default |
|---|---|---|
--character | Voice character (Girl/Boy/etc.) | "Girl" |
--style | Speaking style (Emotional/Neutral/etc.) | "Emotional" |
--model | Model size (0.5B/1.7B/3B) | "1.7B" |
--output | Output file path | Auto-generated |
--temperature | Generation temperature (0-1) | 0.9 |
--top-p | Top-p sampling | 0.9 |
--top-k | Top-k sampling | 50 |
The skill performs these steps:
/prompt endpoint/history endpointcurl http://$COMFYUI_HOST:$COMFYUI_PORT/system_statsThe skill uses ComfyUI's native API endpoints:
POST /prompt - Submit workflowGET /history - Check job status