YouTube Transcript Extractor

v1.0.0

Extract clean, plain-text transcripts from YouTube videos using a dual fallback system with Supadata API and yt-dlp for fast, accurate results.

1· 1.1k·1 current·1 all-time
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
!
Purpose & Capability
SKILL.md claims the primary method is a Supadata API (SUPADATA_API_KEY in .env) with a yt-dlp fallback and exposes a command at /root/clawd/yt-transcript. None of the three shipped scripts reference Supadata or yt-dlp, and the files are named transcript.js/.mjs/yt-transcript.sh rather than an installed binary at the documented path. The actual code uses youtube-captions-scraper, youtubei.js, and direct curl parsing of the YouTube page — which is a plausible way to extract transcripts, but it contradicts the documentation and advertised dependencies.
!
Instruction Scope
SKILL.md instructs callers to run /root/clawd/yt-transcript and references an external API key and yt-dlp fallback. The runtime instructions are inconsistent with the included scripts: the scripts output timestamps (contradicting the claim 'timestamps removed'), fetch YouTube pages with curl, and call Node libraries. The README is vague/incorrect about what will be run and where, giving the agent broad/ambiguous discretion (e.g., 'Primary: Supadata API' but no code).
Install Mechanism
There is no install spec (instruction-only), which reduces install-time risk. However package.json and package-lock.json are included, implying Node dependencies must be installed for the .js/.mjs scripts to work; SKILL.md does not document this. No external downloads, shorteners, or obscure URLs are present in the provided files. The lack of an installation step is an inconsistency to be aware of (user or runtime may need to run npm install).
!
Credentials
SKILL.md says an API key SUPADATA_API_KEY should be stored in .env, but requires.env lists no environment variables and none of the shipped scripts reference SUPADATA_API_KEY or any other secret. The documentation asks for a credential that the code does not use — this is misleading and could cause users to expose a key unnecessarily if they follow the docs.
Persistence & Privilege
The skill does not request always:true, does not declare system-wide config changes, and does not require binaries or config paths. It will perform network requests (curl and library HTTP calls) to fetch YouTube pages, which is expected for its purpose.
What to consider before installing
This skill appears to implement YouTube transcript extraction, but the documentation and shipped code disagree in several places. Before installing or providing any credentials: 1) ask the author which method is intended (Supadata API + yt-dlp, or the included node/curl scripts), 2) do not put any SUPADATA_API_KEY or other secret into an .env unless you confirm the code actually uses it, 3) if you intend to run the Node scripts, run npm install in an isolated environment and review package-lock.json (and consider npm audit), 4) be aware the shell script uses curl to fetch YouTube pages and parses captionTracks (network activity is normal for this task), and 5) if you need stronger assurance, request a corrected SKILL.md that matches the shipped scripts or a provenance link (repo/homepage) so you can verify origin and intent.

Like a lobster shell, security has layers — review code before you run it.

latestvk97avrdag4fx7e3bvrtq06yk6n80vebm
1.1kdownloads
1stars
1versions
Updated 1mo ago
v1.0.0
MIT-0

YouTube Transcript Extraction

Extract high-quality transcripts from YouTube videos using multiple methods.

Commands

# Extract transcript from YouTube URL or video ID
/root/clawd/yt-transcript https://youtu.be/VIDEO_ID
/root/clawd/yt-transcript VIDEO_ID

Features

  • Dual fallback system: Tries Supadata API first, falls back to yt-dlp
  • Auto-generated transcripts: Works even if no manual captions exist
  • Clean output: Returns plain text transcript ready for analysis
  • Fast: API method completes in seconds

Use Cases

  • Summarize long videos without watching
  • Extract key quotes and insights
  • Content research and analysis
  • Create written summaries for videos
  • Extract educational content

Technical Details

  • Primary: Supadata API (fast, clean formatting)
  • Fallback: yt-dlp CLI tool (comprehensive, handles edge cases)
  • Output: Plain text transcript with timestamps removed
  • API key: Stored in .env as SUPADATA_API_KEY

Example Workflow

Rob sends YouTube link → Alto pulls transcript → summarizes key points → Rob decides if worth watching

Saves 10-30 minutes per video!

Comments

Loading comments...