Whisper Voice Transcription (whisper.cpp)

v1.1.1

Build and use whisper.cpp for local speech-to-text workflows, with optional cloud fallback when local transcription is not practical.

0· 93·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for xuxuclassmate/whisper-voice-transcription.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Whisper Voice Transcription (whisper.cpp)" (xuxuclassmate/whisper-voice-transcription) from ClawHub.
Skill page: https://clawhub.ai/xuxuclassmate/whisper-voice-transcription
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install whisper-voice-transcription

ClawHub CLI

Package manager switcher

npx clawhub@latest install whisper-voice-transcription
Security Scan
VirusTotalVirusTotal
Pending
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name and description match the SKILL.md: instructions show how to clone, build, and run whisper.cpp locally and describe optional cloud fallback. No unrelated credentials, binaries, or config paths are requested.
Instruction Scope
Overall instructions stay within the stated purpose (build/run local STT, optional cloud fallback). The fallback guidance to "locate the exact cache or upload path used by that tool" is vague and could lead an agent to read user files if implemented broadly; the doc does warn to restrict searches to expected app cache directories, but an implementer should be explicit about which paths are safe to access.
Install Mechanism
Instruction-only skill (no install spec). Build instructions clone the official ggerganov/whisper.cpp repo and build locally with cmake — this is expected and proportionate. No downloads from unknown/shortened/personal URLs or archive extraction steps are included.
Credentials
No environment variables or credentials are requested. Network access is implied for cloning and model download (GitHub/Hugging Face) but that is appropriate for the described functionality. If using Hugging Face private models or certain cloud APIs, additional creds would be reasonable but none are declared here.
Persistence & Privilege
Skill does not request persistent/always-on presence and does not modify other skills or system-wide settings. It is user-invocable and uses normal agent autonomy settings.
Assessment
This is an operator guide for building and running whisper.cpp and is coherent with its stated purpose. Before installing or following the guide: (1) verify you are cloning the correct upstream repository and, when downloading model files, prefer official release artifacts and verify checksums; (2) be cautious implementing any automated "search cache" fallback — restrict searches to known application cache paths and avoid broad home-directory scans; (3) explicitly prompt the user before sending audio to any cloud provider and confirm which API/endpoint will be used; (4) note that building from source requires developer tools (git, cmake, compiler, ffmpeg) and network access to fetch models. If you need the skill to operate autonomously, consider adding strict path whitelists and user consent prompts to reduce accidental data exposure.

Like a lobster shell, security has layers — review code before you run it.

audiovk97d36f4naj4ez8kzy2zc353ws85a3svlatestvk97d36f4naj4ez8kzy2zc353ws85a3svmediavk97d36f4naj4ez8kzy2zc353ws85a3svspeech-to-textvk97d36f4naj4ez8kzy2zc353ws85a3svsttvk97d36f4naj4ez8kzy2zc353ws85a3svtranscriptionvk97d36f4naj4ez8kzy2zc353ws85a3svvoicevk97d36f4naj4ez8kzy2zc353ws85a3svwhispervk97d36f4naj4ez8kzy2zc353ws85a3sv
93downloads
0stars
3versions
Updated 6d ago
v1.1.1
MIT-0

Whisper Voice Transcription with whisper.cpp

When to use

  • You want local speech-to-text without sending audio to a third party.
  • You need a fallback workflow when a built-in transcription tool fails.
  • You want an operator guide for compiling and running whisper.cpp.

Prerequisites

  • git
  • cmake
  • a C or C++ compiler
  • ffmpeg

Build steps

git clone --depth 1 https://github.com/ggerganov/whisper.cpp.git ~/whisper.cpp
cd ~/whisper.cpp
cmake -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j4

Download a model from the official ggerganov/whisper.cpp releases or Hugging Face repository and place it under ~/whisper.cpp/models/.

Standard transcription flow

ffmpeg -y -i input_audio.ogg -ar 16000 -ac 1 -f wav /tmp/voice.wav
~/whisper.cpp/build/bin/whisper-cli \
  -m ~/whisper.cpp/models/ggml-large-v3.bin \
  -f /tmp/voice.wav \
  -l auto \
  --no-timestamps

Fallback workflow

If a higher-level tool fails, first locate the exact cache or upload path used by that tool. Search only within the expected application cache directory instead of scanning the entire home directory.

Cloud fallback

If local transcription is too slow or unavailable, use an approved speech API and tell the user that audio will leave the machine.

Guardrails

  • Download binaries and models only from official sources.
  • Verify hashes when possible.
  • Do not search unrelated directories for audio files.
  • Be explicit when using a cloud provider because that changes the privacy model.

Comments

Loading comments...