Install
openclaw skills install local-voskLocal speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.
openclaw skills install local-voskLightweight local speech-to-text using Vosk. Fully offline after model download.
# Transcribe Telegram voice message
./skills/local-vosk/scripts/transcribe voice_message.ogg
# Transcribe any audio
./skills/local-vosk/scripts/transcribe audio.mp3
# With language (default: en-us)
./skills/local-vosk/scripts/transcribe audio.wav --lang en-us
Any format ffmpeg can decode: ogg (Telegram), mp3, wav, m4a, webm, flac, etc.
Default model: vosk-model-small-en-us-0.15 (~40MB)
Other models available at https://alphacephei.com/vosk/models
pip3 install vosk --user --break-system-packages
# Download model
mkdir -p ~/vosk-models && cd ~/vosk-models
wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zip