Install
openclaw skills install social-video-distillDistill public social-video and short-form media with AI-first delegation. Use when the user shares a YouTube, Facebook, Instagram, TikTok, X, or local clip and wants a fast transcript, summary, humor read, best line, caption ideas, or theme extraction. Prefer platform captions and browser AI such as Gemini or NotebookLM before local ASR; use local transcription only as a fallback when faster cloud/browser routes are unavailable.
openclaw skills install social-video-distillDistill short-form video without defaulting to local heavy lifting.
Prefer the fastest capable specialist first: existing captions, browser AI, then local fallback.
Clarify the output target.
Choose the lightest viable input path.
scripts/extract_captions.sh first.scripts/download_media.sh and use a transcription fallback outside this skill.Distill with browser AI before using local ASR when the goal is understanding rather than archival transcript quality.
Verify the result.
Install local helper runtimes once:
bash skills/social-video-distill/scripts/install_runtime.sh
Try captions first:
bash skills/social-video-distill/scripts/extract_captions.sh 'https://example.com/video'
Download media only when needed:
bash skills/social-video-distill/scripts/download_media.sh 'https://example.com/video'
Ask Gemini to distill a prepared transcript or notes file:
node skills/social-video-distill/scripts/ask_gemini_cdp.js \
--prompt-file /absolute/path/to/prompt.txt
Use extract_captions.sh before downloading full media.
Good fit:
yt-dlp can expose subtitles without full downloadIf captions exist:
Use Gemini when the job is:
Use scripts/ask_gemini_cdp.js with a focused prompt file. Keep prompts short and task-specific.
If the user wants study-guide style synthesis across multiple clips or mixed sources, use NotebookLM instead of forcing Gemini into a long single-shot prompt.
Use local ASR only when:
Do not lead with local Whisper/faster-whisper just because it is available.
Default to a compact result with these fields when helpful:
For humor requests, prefer:
For social reply requests, give 2-3 options max.
references/prompts.md for ready-to-use distillation prompts.references/troubleshooting.md when captions, CDP, or Gemini interaction fails.