Audio Transcribe

v1.0.0

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

1· 1.8k· 1 versions· 14 current· 16 all-time· Updated 56m ago· MIT-0
byAlex Knight@aktheknight

Audio Transcription Skill

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Requirements

pip install faster-whisper

Models download automatically on first use.

Usage

Transcribe a file

python3 /root/clawd/skills/audio-transcribe/scripts/transcribe.py /path/to/audio.ogg

Change model (edit script)

Edit transcribe.py and change:

model = WhisperModel('small', device='cpu', compute_type='int8')  # Options: tiny, base, small, medium, large-v3

Models

ModelSizeVRAM/RAMSpeedUse Case
tiny39 MB~1 GB⚡⚡⚡Quick drafts
base74 MB~1 GB⚡⚡Basic accuracy
small244 MB~2 GBRecommended
medium769 MB~5 GB🐢Better accuracy
large-v31.5 GB~10 GB🐢🐢Best accuracy

Integration

Clawdbot auto-transcribes incoming voice messages when this skill is enabled.

Files

  • scripts/transcribe.py — Main transcription script
  • SKILL.md — This file

Version tags

latestvk97axg2b5fkdcxa60fgtv6nqjs8187re