Install
openclaw skills install asr-funasrAutomatic Speech Recognition using OpenAI Whisper (local GPU). Supports Chinese, English, and 90+ languages. Auto-detects language.
openclaw skills install asr-funasrTwo engines for different scenarios:
| Engine | Best For | Chinese Quality | Speed |
|---|---|---|---|
| FunASR SenseVoice (default) | Chinese, Japanese, Korean | ⭐⭐⭐ 简体 | Fast (0.03 RTF) |
| OpenAI Whisper | Multilingual, translation | ⭐⭐ (繁体) | Slower |
# Default: FunASR SenseVoice (best Chinese)
{baseDir}/scripts/asr.py --input audio.mp3
# Whisper for multilingual / translation
{baseDir}/scripts/asr.py --input audio.mp3 --engine whisper
| Option | Default | Description |
|---|---|---|
--input | (required) | Input audio file (mp3, wav, m4a, etc.) |
--engine | funasr | ASR engine: funasr (SenseVoice) or whisper |
--language | auto | Language code: zh, en, ja, ko, etc. (auto-detect if omitted) |
--model | base | Whisper model size: tiny/base/small/medium/large (whisper only) |
--task | transcribe | transcribe or translate (whisper only) |
--output | Write transcript to file (default: stdout) |
iic/SenseVoiceSmall (893MB, auto-downloaded from ModelScope)| Model | VRAM | Speed | Accuracy |
|---|---|---|---|
| tiny | ~1GB | Fastest | Low |
| base | ~1GB | Fast | OK |
| small | ~2GB | Medium | Good |
| medium | ~5GB | Slow | Better |
| large | ~10GB | Slowest | Best |
# Chinese audio → FunASR (default, best quality)
{baseDir}/scripts/asr.py --input meeting.mp3
# Force Chinese language
{baseDir}/scripts/asr.py --input podcast.wav --language zh
# Multilingual audio → Whisper
{baseDir}/scripts/asr.py --input mixed.wav --engine whisper
# Whisper with better model
{baseDir}/scripts/asr.py --input lecture.mp3 --engine whisper --model small
# Translate Chinese speech to English text
{baseDir}/scripts/asr.py --input speech.mp3 --engine whisper --language zh --task translate
# Save transcript to file
{baseDir}/scripts/asr.py --input audio.wav --output transcript.txt
funasr + modelscope (FunASR engine)openai-whisper (Whisper engine)imageio-ffmpeg (bundled ffmpeg binary)~/.cache/)