Telegram Voice Transcribe
v1.3.1Transcribe Telegram voice messages and audio notes into text using the OpenAI Whisper API. Use when (1) a user sends a voice message or audio note via Telegr...
⭐ 0· 479·2 current·3 all-time
byAlfredo Bárcena@dreadterror
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
medium confidencePurpose & Capability
Name, description, SKILL.md and the script align: the script supports Telegram file_id, local files, and URL modes and implements both local Whisper and OpenAI API transcription. Requested secrets (OPENAI_API_KEY, TELEGRAM_BOT_TOKEN) are appropriate to the stated purpose. However the registry metadata claims no required env vars while SKILL.md and code require OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for --file-id); this mismatch is a documentation inconsistency to resolve.
Instruction Scope
Instructions limit actions to fetching Telegram files (via Telegram API), optionally downloading a URL or reading a local file, then transcribing locally or via OpenAI. The example hooks run the script as a preprocessor and prepend the transcript to the message text — all within the stated transcription workflow. No unrelated system paths or credentials are referenced by the runtime instructions.
Install Mechanism
There is no automated install spec (instruction-only plus a script file). Setup docs recommend pip installing openai-whisper / static-ffmpeg / torch which is typical. CLAWHUB.md claims an 'auto-fallback — downloads static ffmpeg', but the actual script only tries to import static_ffmpeg if ffmpeg is absent and does not itself download a binary from an arbitrary host; this is a documentation/claim mismatch to verify.
Credentials
The only sensitive env vars used by the code are OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for downloading files by file_id), which are proportionate to the functionality. Again, registry metadata incorrectly lists none required — ensure your environment configuration and permission scoping (bot token scope) are deliberate.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide settings. The recommended hook runs the script as a pre-processor (normal behavior) and the script cleans up temporary files; autonomous invocation is default but not combined with other high privileges here.
Assessment
This skill is generally coherent for transcribing Telegram voice notes, but check these before installing or enabling it:
- Metadata mismatch: the registry metadata claims no required env vars but SKILL.md and the script require OPENAI_API_KEY (for API mode) and TELEGRAM_BOT_TOKEN (for --file-id). Confirm which mode you will use and only provision the minimum credentials.
- Privacy vs API: local (--local) mode keeps audio on your server; API mode sends audio to OpenAI. If privacy is required, use --local and ensure you have the required local models and resources (disk, RAM, possibly GPU).
- Resource & dependencies: installing openai-whisper / torch and downloading models (small/medium) can use hundreds of MBs to GBs and CPU/GPU time. Plan for that on your host.
- Claimed auto-download: CLAWHUB.md states an auto-download of static ffmpeg; the script only tries to import static_ffmpeg. Verify whether your environment will actually provide ffmpeg or static_ffmpeg — do not assume the skill will fetch executables from external servers.
- Network flows: in file-id and API modes the skill calls Telegram API and (optionally) OpenAI API; if you have egress controls, permit only those hosts. The URL mode will fetch arbitrary URLs you pass — avoid untrusted URLs to reduce risk.
- Inspect hooks: if you install the suggested pre-processing hook into OpenClaw, review the hook code to confirm it runs with appropriate timeouts and environment and does not expose extra secrets.
If these points are acceptable and you only grant the minimal env vars needed for your chosen mode, the skill is consistent with its stated purpose.Like a lobster shell, security has layers — review code before you run it.
audiovk972wjqcwae9xfd2gdaerv3bk9822v0bfreevk972wjqcwae9xfd2gdaerv3bk9822v0blatestvk97b19gbk2r4m3vgcxp7nb6n0d83kd4wspeech-to-textvk972wjqcwae9xfd2gdaerv3bk9822v0bsttvk972wjqcwae9xfd2gdaerv3bk9822v0btelegramvk972wjqcwae9xfd2gdaerv3bk9822v0bvoicevk972wjqcwae9xfd2gdaerv3bk9822v0bwhispervk972wjqcwae9xfd2gdaerv3bk9822v0b
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
