Oatda Transcribe Audio

v1.0.1

Transcribe audio to text using OATDA's unified audio API. Triggers when the user wants speech-to-text, transcription of meetings, podcasts, voice notes, subt...

0· 38·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for devcsde/oatda-transcribe-audio.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Oatda Transcribe Audio" (devcsde/oatda-transcribe-audio) from ClawHub.
Skill page: https://clawhub.ai/devcsde/oatda-transcribe-audio
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required env vars: OATDA_API_KEY
Required binaries: curl, jq
Config paths to check: ~/.oatda/credentials.json
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install oatda-transcribe-audio

ClawHub CLI

Package manager switcher

npx clawhub@latest install oatda-transcribe-audio
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (transcribe audio via OATDA) align with requested resources: curl, jq, OATDA_API_KEY, and ~/.oatda/credentials.json. Those are expected for an instruction-only wrapper around a remote transcription API.
Instruction Scope
SKILL.md only instructs the agent to resolve the OATDA API key (from env or the declared ~/.oatda/credentials.json), call OATDA endpoints, and format/handle transcription responses. It does not direct the agent to read unrelated files, scan system state, or transmit data to destinations other than oatda.com.
Install Mechanism
No install spec — instruction-only. Nothing is downloaded or written to disk by the skill itself, which minimizes install-time risk.
Credentials
Only a single provider credential (OATDA_API_KEY) and a local credentials path are required. This is proportionate for a service that forwards audio to a third-party API. Required binaries (curl, jq) are standard for the described curl/jq examples.
Persistence & Privilege
always is false and the skill does not request persistent or elevated privileges. It only reads a declared per-user config path and an API key; it does not modify other skills or system-wide settings.
Assessment
This skill appears coherent and limited in scope, but before installing: 1) Confirm you trust oatda.com — audio you send will be transmitted to that third party. 2) Store and use a dedicated OATDA_API_KEY with minimal privileges and don’t reuse high-privilege keys. 3) Verify the ~/.oatda/credentials.json file contents and permissions; the skill reads that file to obtain the API key. 4) Be careful with sensitive audio (personal data, secrets) because transcripts are sent to an external service. 5) The SKILL.md tries to avoid printing the full API key, but agents can still expose secrets through logs or mistakes — consider limiting logging and rotating keys if they may be exposed.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

📝 Clawdis
Binscurl, jq
EnvOATDA_API_KEY
Config~/.oatda/credentials.json
Primary envOATDA_API_KEY
latestvk972071z1tcn6d604zpgsd9nsx85kmrh
38downloads
0stars
2versions
Updated 1d ago
v1.0.1
MIT-0

OATDA Audio Transcription

Transcribe audio files to text through OATDA's unified audio API.

API Key Resolution

All commands need the OATDA API key. Resolve it inline for each exec call:

export OATDA_API_KEY="${OATDA_API_KEY:-$(cat ~/.oatda/credentials.json 2>/dev/null | jq -r '.profiles[.defaultProfile].apiKey' 2>/dev/null)}"

If the key is empty or null, tell the user to get one at https://oatda.com and configure it.

Security: Never print the full API key. Only verify existence or show first 8 chars.

Model Mapping

User saysProviderModel
whisper, whisper-1, openai whisper (default)openaiwhisper-1
transcription, speech to text, sttopenaiwhisper-1

Default: openai / whisper-1 if no model specified.

If the user provides provider/model format directly (for example openai/whisper-1), split on /.

⚠️ Models change over time. If a model ID fails, query oatda-list-models with ?type=audio first.

Input Preparation

The transcription endpoint supports:

  • multipart/form-data with a local file upload
  • JSON with a base64 data URL in file
  • JSON with file_base64 for providers that support direct base64 payloads

Maximum audio file size is 25MB.

For local files, prefer multipart upload because it is simpler and avoids large JSON bodies.

Discovering Audio Model Parameters

export OATDA_API_KEY="${OATDA_API_KEY:-$(cat ~/.oatda/credentials.json 2>/dev/null | jq -r '.profiles[.defaultProfile].apiKey' 2>/dev/null)}" && \
curl -s -X GET "https://oatda.com/api/v1/llm/models?type=audio" \
  -H "Authorization: Bearer $OATDA_API_KEY" | jq '.audio_models[] | {id, supported_params}'

Look for:

  • audio_modes containing transcription
  • supported response_format values
  • optional timestamp, diarization, or streaming support

API Call (multipart)

export OATDA_API_KEY="${OATDA_API_KEY:-$(cat ~/.oatda/credentials.json 2>/dev/null | jq -r '.profiles[.defaultProfile].apiKey' 2>/dev/null)}" && \
curl -s -X POST "https://oatda.com/api/v1/llm/transcriptions" \
  -H "Authorization: Bearer $OATDA_API_KEY" \
  -F "provider=<PROVIDER>" \
  -F "model=<MODEL>" \
  -F "file=@<AUDIO_FILE>" \
  -F "response_format=json"

Alternative API Call (base64 JSON)

AUDIO_DATA_URL="data:audio/mpeg;base64,$(base64 -w 0 audio.mp3)"

export OATDA_API_KEY="${OATDA_API_KEY:-$(cat ~/.oatda/credentials.json 2>/dev/null | jq -r '.profiles[.defaultProfile].apiKey' 2>/dev/null)}" && \
curl -s -X POST "https://oatda.com/api/v1/llm/transcriptions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OATDA_API_KEY" \
  -d "$(jq -n \
    --arg provider \"<PROVIDER>\" \
    --arg model \"<MODEL>\" \
    --arg file \"$AUDIO_DATA_URL\" \
    '{provider: $provider, model: $model, file: $file, response_format: \"json\"}')"

Common Parameters

  • language: ISO-639-1 language code like en, de, fr
  • prompt: Context for names, acronyms, or domain-specific terms
  • response_format: json, text, srt, verbose_json, vtt, or diarized_json
  • temperature: 0 to 1
  • timestamp_granularities: word and/or segment
  • chunking_strategy: auto
  • hotwords: Provider-specific keyword hints
  • stream: true if supported by the selected model

Response Format

The API returns JSON like:

{
  "text": "The transcribed text...",
  "language": "en",
  "duration": 42.5,
  "segments": [],
  "words": [],
  "costs": {
    "inputCost": 0,
    "outputCost": 0.0001,
    "totalCost": 0.0001,
    "currency": "USD"
  }
}

Present the text field to the user. Include subtitles, segments, or words if the requested format includes them.

Error Handling

HTTP StatusMeaningAction
401Invalid API keyTell user to check their key
402Insufficient creditsTell user to check balance
400Bad request / model not supportedCheck model or file format and query oatda-list-models with type=audio
413File too largeKeep audio under 25MB or split it
429Rate limited or monthly capWait briefly and retry once

Example

export OATDA_API_KEY="${OATDA_API_KEY:-$(cat ~/.oatda/credentials.json 2>/dev/null | jq -r '.profiles[.defaultProfile].apiKey' 2>/dev/null)}" && \
curl -s -X POST "https://oatda.com/api/v1/llm/transcriptions" \
  -H "Authorization: Bearer $OATDA_API_KEY" \
  -F "provider=openai" \
  -F "model=whisper-1" \
  -F "file=@meeting.mp3" \
  -F "response_format=json"

Notes

  • Endpoint: /api/v1/llm/transcriptions
  • Prefer multipart upload for local files
  • Use response_format=srt or vtt for subtitles
  • Use language to improve recognition when source language is known
  • Equivalent capability name: transcribe_audio
  • Related skills: oatda-generate-speech, oatda-translate-audio, oatda-list-models

Comments

Loading comments...