Groq Whisper

Transcribe audio files using Groq's Whisper API (whisper-large-v3). Fast cloud-based speech-to-text with no local model required. Use when receiving voice messages, audio files, or any speech that needs transcription. Supports all major audio formats (ogg, mp3, wav, m4a, webm, flac).

Brice Thrower@directorvector

Install

openclaw skills install @directorvector/groq-whisper

Groq Whisper

Cloud speech-to-text via Groq's Whisper API. No local model, no GPU, no fan spin.

Setup

Get a free API key at console.groq.com

Store it:

bash

mkdir -p ~/.config/groq
echo '{"api_key":"gsk_your_key_here"}' > ~/.config/groq/credentials.json
chmod 600 ~/.config/groq/credentials.json

Or set GROQ_API_KEY env var.

Usage

bash

# Transcribe an audio file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg

# Specify language (default: en)
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg es

When to use

Call this script whenever you receive an audio/voice message attachment. Pass the file path directly — Groq handles ogg, mp3, wav, m4a, webm, and flac natively. No format conversion needed.

Details

Model: whisper-large-v3 (best accuracy)
Speed: Faster than real-time (typically <2s for a 5-minute clip)
Cost: Free tier available, no credit card required
Privacy: Groq does not retain input data or train on it
Requires: curl, jq, Groq API key

Groq Whisper

Install

Groq Whisper

Setup

Usage

When to use

Details

Related skills