Install
openclaw skills install faster-whisper-localLocal speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models. Use when asked to "transcribe audio", "whisper", or "speech to text".
openclaw skills install faster-whisper-localHigh-performance local speech-to-text using faster-whisper.
Execute the setup script to create a virtual environment and install dependencies. It will automatically detect NVIDIA GPUs for CUDA acceleration.
./setup.sh
Requirements:
Use the transcription script to process audio files.
./scripts/transcribe audio.mp3
./scripts/transcribe audio.mp3 --model large-v3-turbo./scripts/transcribe audio.mp3 --word-timestamps./scripts/transcribe audio.mp3 --json./scripts/transcribe audio.mp3 --vaddistil-large-v3 (default): Best balance of speed and accuracy.large-v3-turbo: Recommended for multilingual or highest accuracy tasks.medium.en, small.en: Faster, English-only versions.small or base) or use --compute-type int8.