Install
openclaw skills install youtube-transcript-pipelineGenerate, clean, correct, translate, and package YouTube interview transcripts with speaker-attributed timestamps into reusable deliverables. Use for workflows involving Deepgram transcription, diarization correction, bilingual output, and structured folder packaging for handoff.
openclaw skills install youtube-transcript-pipelineThis skill turns a YouTube link into a complete transcript package:
Use this when user asks for: transcript production, diarization fixes, interview cleanup, Chinese translation of transcript, or reusable handoff folders.
Place all outputs in a structured folder:
transcripts/ (raw / cleaned / translated)artifacts/ (audio + thumbnails)scripts/ (workflow helpers)snippets/ (small JSONs, helper files)MANIFEST.txtnova-3, word-level output if possibleSuggested command (adapt paths):
python3 resilient_dg_transcribe_resume.py \
--audio-dir /tmp/dg_chunks_work2 \
--max 0 --chunk-sec 180 --model nova-3 --timeout 240
transcript_*.json, transcript_speakers_*.txt)[$TS] Speaker: textsummary_30s_zh.txt, summary_1min_zh.txt)create_youtube_transcript_workbench.sh: creates standardized output folders and manifest, optionally copies key filestranslate_transcript_google.py: batch-translates timestamped transcript while preserving formatrebuild_from_words.py: optional word-level rebuild/reconciliation utility for diarization consistencyworkflow.md: detailed decision points for conservative speaker fixes and translation fallback handling