Openai Whisper Api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

MIT-0 · Free to use, modify, and redistribute. No attribution required.
35 · 16.1k · 909 current installs · 926 all-time installs
byPeter Steinberger@steipete
MIT-0
Security Scan
VirusTotalVirusTotal
Suspicious
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The skill's name/description (Whisper transcriptions) align with the included script which uses curl to call api.openai.com and uploads an audio file. Minor documentation inconsistency: top-level registry metadata lists no required env/bins, but SKILL.md metadata and the script require curl and OPENAI_API_KEY.
Instruction Scope
SKILL.md and scripts/transcribe.sh limit activity to reading the provided audio file, calling OpenAI's transcription endpoint, and writing the transcript to a local path. The script does not read unrelated files or send data to any third-party endpoints other than api.openai.com.
Install Mechanism
No install spec; the skill is instruction-only with a small included shell script. Nothing is downloaded or written to system locations during install.
Credentials
The only required credential is OPENAI_API_KEY, which is appropriate for this API. Note the SKILL.md suggests storing a key in ~/.clawdbot/clawdbot.json — storing API keys in plaintext files has risk if file permissions are lax. Also callout the mismatch between registry-level 'no required env' and the SKILL.md's declared requirements.
Persistence & Privilege
Skill does not request persistent/system-wide privileges, does not set always:true, and does not modify other skills or global agent settings.
Assessment
This skill is coherent for transcribing audio with OpenAI's Speech-to-Text API: it invokes curl to upload a local audio file to https://api.openai.com/v1/audio/transcriptions and requires an OPENAI_API_KEY. Before installing, ensure you (1) are comfortable providing an OpenAI API key (the script will send the audio to OpenAI), (2) have curl available on the host, and (3) if you choose to store the key in ~/.clawdbot/clawdbot.json, protect that file with strict permissions (chmod 600) or prefer environment variables to avoid plaintext storage. Also be aware of a small metadata mismatch in the registry (the top-level metadata omitted the declared requirement for curl and OPENAI_API_KEY in SKILL.md) — make sure the agent runtime will expose OPENAI_API_KEY when you run this skill.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0
Download zip
latestvk97dn2cc2qc4k6p9pvceds7xsd7ykj06

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

☁️ Clawdis
Binscurl
EnvOPENAI_API_KEY
Primary envOPENAI_API_KEY

SKILL.md

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint.

Quick start

{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-1
  • Output: <input>.txt

Useful flags

{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.clawdbot/clawdbot.json:

{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE"
    }
  }
}

Files

2 total
Select a file
Select a file to preview.

Comments

Loading comments…