Speaches
Speaches realtime STT and Kokoro TTS providers for OpenClaw
Audits
PassInstall
openclaw plugins install clawhub:openclaw-speachesopenclaw-speaches
Local Speaches speech providers for OpenClaw.
This plugin registers one speaches provider for both:
- realtime speech-to-text for
@openclaw/voice-callstreaming - text-to-speech / telephony TTS using Speaches' OpenAI-compatible
/audio/speechendpoint, tested with Kokoro
Install
openclaw plugins install clawhub:openclaw-speaches
Example voice-call config
{
"streaming": {
"enabled": true,
"provider": "speaches",
"providers": {
"speaches": {
"baseUrl": "http://127.0.0.1:8000/v1",
"model": "Systran/faster-distil-whisper-small.en",
"apiKey": "cant-be-empty",
"silenceDurationMs": 500,
"vadThreshold": 0.5,
"prefixPaddingMs": 0,
"convertTwilioMulaw": true
}
}
},
"tts": {
"provider": "speaches",
"providers": {
"speaches": {
"baseUrl": "http://127.0.0.1:8000/v1",
"apiKey": "cant-be-empty",
"model": "speaches-ai/Kokoro-82M-v1.0-ONNX",
"voice": "af_sky"
}
}
}
}
Global TTS config
{
"messages": {
"tts": {
"provider": "speaches",
"providers": {
"speaches": {
"baseUrl": "http://127.0.0.1:8000/v1",
"apiKey": "cant-be-empty",
"model": "speaches-ai/Kokoro-82M-v1.0-ONNX",
"voice": "af_sky"
}
}
}
}
}
Environment variables
SPEACHES_BASE_URL— defaulthttp://127.0.0.1:8000/v1SPEACHES_API_KEY— defaultcant-be-emptySPEACHES_STT_MODEL— defaultSystran/faster-distil-whisper-small.enSPEACHES_TTS_MODEL— defaultspeaches-ai/Kokoro-82M-v1.0-ONNXSPEACHES_TTS_VOICE— defaultaf_sky
Notes
For latency-sensitive phone calls on CPU, smaller faster-whisper models usually perform better than large models. This plugin was tested with OpenClaw voice-call, Speaches, faster-distil-whisper-small.en, and Kokoro ONNX.
