Transcribee 🐝

v1.2.1

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

⭐ 6· 3.1k·10 current·10 all-time
MIT-0
Download zip
LicenseMIT-0 Β· Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report β†’
OpenClawOpenClaw
Suspicious
high confidence
!
Purpose & Capability
Name/description (transcribing YouTube/local files with diarization and auto-organization) matches the included code. However the registry metadata claims no required env vars/binaries while the code clearly requires ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY and expects yt-dlp/ffmpeg. README/CLAUDE.md mention Instagram and TikTok support, but the shipped wrapper (transcribe.sh) warns only about YouTube β€” inconsistent scope/claims.
β„Ή
Instruction Scope
Runtime instructions and scripts run yt-dlp/ffmpeg (downloads/extracts media), call ElevenLabs and Anthropic SDKs, and read/write the user's library at ~/Documents/transcripts/. The code reads existing transcripts to decide categories. It does not appear to access unrelated system credentials, but it will transmit user audio/transcripts to external services (ElevenLabs and Anthropic) β€” a privacy/telemetry consideration that is expected but worth noting.
β„Ή
Install Mechanism
There is no install spec; the package includes a package.json and pnpm lock (uses npm packages 'elevenlabs' and '@anthropic-ai/sdk'). This is a moderate-risk, standard npm dependency surface β€” no arbitrary download URLs or extract-from-remote artifacts were found. Running pnpm install will pull dependencies from public registries.
!
Credentials
The code requires ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY (and expects a local .env in the skill directory), and expects system binaries yt-dlp and ffmpeg. The registry metadata reported no required env vars or binaries β€” that's a clear mismatch. Requesting API keys for the transcription and classification services is reasonable for the stated purpose, but the omission in metadata is a red flag (it hides required credentials).
βœ“
Persistence & Privilege
always:false and the skill does not request elevated system privileges. It reads/writes a folder under the user's home (~/Documents/transcripts) and creates temporary audio in OS tmpdir. Autonomous invocation is allowed (platform default) β€” combine that with external API access if you intend to allow agent-initiated runs.
What to consider before installing
Before installing or enabling this skill: 1) Expect it to require two API keys (ELEVEN_LABS_API_KEY and ANTHROPIC_API_KEY) and system binaries (yt-dlp, ffmpeg) even though the registry listing omitted them β€” verify and supply keys only if you trust those services. 2) Be aware audio and transcripts are uploaded to external services; if privacy-sensitive audio will be transcribed, consider running a local-only alternative. 3) The skill will read your ~/Documents/transcripts/ library and write new transcript folders there β€” review or sandbox it if you don't want that folder modified. 4) Verify the skill's source (there is no homepage listed) β€” prefer installing from a trusted repo and inspect the .env.example and index.ts for any extra endpoints or hardcoded secrets. 5) If you allow autonomous agent invocation, consider restricting its access or running the skill manually the first few times to confirm behavior. If you want to go ahead, run it in an isolated environment, provide least-privilege API keys, and review the code for any hidden network calls not documented in README/CLAUDE.md.

Like a lobster shell, security has layers β€” review code before you run it.

latestvk974k56qr417ag626ewag8yp5n805rfy

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments