stt

v1.0.0

Transcribe audio files using SkillBoss API Hub STT

0· 82·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for modestyrichards/modesty-stt.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "stt" (modestyrichards/modesty-stt) from ClawHub.
Skill page: https://clawhub.ai/modestyrichards/modesty-stt
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install modesty-stt

ClawHub CLI

Package manager switcher

npx clawhub@latest install modesty-stt
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description, SKILL.md, and the included transcribe.py all describe sending audio to SkillBoss API Hub STT and require an API key. The single required credential (SKILLBOSS_API_KEY) is appropriate for this purpose. Minor naming inconsistencies (skill slug 'modesty-stt' and SKILL.md name 'gemini-stt') are cosmetic but not security-relevant.
Instruction Scope
Runtime instructions and the script only read the provided audio file, base64-encode it, and POST JSON to the external API. They do not attempt to read unrelated files, other environment variables, or system configuration. The script does transmit the full audio contents to the external service, which is consistent with its stated function.
Install Mechanism
No install spec; the skill is instruction/code-only and uses only the Python stdlib. Nothing is downloaded or extracted during install, so there is no elevated install-time risk.
Credentials
Only SKILLBOSS_API_KEY is required and used by the script. No unrelated credentials, config paths, or excessive environment access are requested.
Persistence & Privilege
always is false and the skill does not request persistent or system-wide privileges. It does not modify other skills or system configuration.
Assessment
This skill will upload the entire audio file (base64) to api.heybossai.com using SKILLBOSS_API_KEY. Before installing, confirm you trust the SkillBoss/HeyBossAI service and its privacy policy, and avoid transcribing sensitive audio unless you're comfortable with third‑party processing. Manage the API key carefully (use least-privilege/rotating keys if possible) and restrict where you run the skill to systems allowed to make external network requests. Note the minor naming inconsistencies and lack of a homepage — if you need organizational provenance, ask the publisher for more information.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

🎤 Clawdis
OSLinux · macOS
aivk97062pd5tg7myn1f3k6ym7p3h85byemlatestvk97062pd5tg7myn1f3k6ym7p3h85byem
82downloads
0stars
1versions
Updated 6d ago
v1.0.0
MIT-0
Linux, macOS

Speech-to-Text Skill

Transcribe audio files using SkillBoss API Hub. Automatically routes to the best available STT model.

Authentication

Set SKILLBOSS_API_KEY in environment (e.g., ~/.env or ~/.clawdbot/.env)

Requirements

  • Python 3.10+ (no external dependencies)
  • SKILLBOSS_API_KEY environment variable

requires.env

SKILLBOSS_API_KEY

Supported Formats

  • .ogg / .opus (Telegram voice messages)
  • .mp3
  • .wav
  • .m4a

Usage

# Transcribe an audio file
python ~/.claude/skills/gemini-stt/transcribe.py /path/to/audio.ogg

# With Clawdbot media
python ~/.claude/skills/gemini-stt/transcribe.py ~/.clawdbot/media/inbound/voice-message.ogg

Options

OptionDescription
<audio_file>Path to the audio file (required)

How It Works

  1. Reads the audio file and base64 encodes it
  2. Authenticates using SKILLBOSS_API_KEY
  3. Sends to SkillBoss API Hub /v1/pilot with type: stt
  4. SkillBoss API Hub automatically routes to the best available STT model
  5. Returns the transcribed text from data.result.text

Example Integration

For Clawdbot voice message handling:

# Transcribe incoming voice message
TRANSCRIPT=$(python ~/.claude/skills/gemini-stt/transcribe.py "$AUDIO_PATH")
echo "User said: $TRANSCRIPT"

Error Handling

The script exits with code 1 and prints to stderr on:

  • No SKILLBOSS_API_KEY set
  • File not found
  • API errors

Notes

  • SkillBoss API Hub automatically selects the best STT model
  • No external Python dependencies (uses stdlib only)
  • Automatically detects MIME type from file extension

Comments

Loading comments...