YouTube ASR Summarize (Local)
Summarize YouTube videos with NO subtitles by doing local ASR (yt-dlp + faster-whisper) and extracting a few screenshot frames via ffmpeg. Use when the user...
MIT-0 · Free to use, modify, and redistribute. No attribution required.
⭐ 0 · 196 · 0 current installs · 0 all-time installs
MIT-0
Security Scan
OpenClaw
Suspicious
medium confidencePurpose & Capability
The SKILL.md and the included Python script clearly implement local ASR using yt-dlp, ffmpeg, and faster-whisper which is coherent with the skill name/description. However, the registry metadata claims 'Required binaries: none' and 'Required env vars: none' while the script and README explicitly require yt-dlp and ffmpeg and reference environment variables (WHISPER_MODEL, WHISPER_LANG, WHISPER_COMPUTE, TIMELINE_EVERY). That mismatch between declared requirements and what the skill actually needs is an incoherence.
Instruction Scope
The SKILL.md instructs the user to install yt-dlp, ffmpeg and to pip install faster-whisper and to run the provided script; the script runs yt-dlp/ffmpeg via subprocess, writes transcript/summary files and extracts frames into the specified output directory. These actions are within the stated scope (local download/transcribe/frames). Concern: the instructions and script read environment variables (WHISPER_*, TIMELINE_EVERY) that are not declared in the skill metadata, and the metadata does not warn that the script will download video/audio to disk—so the runtime behavior is broader than the declared requirements.
Install Mechanism
There is no automated install spec (user is instructed to manually install brew packages and pip dependencies). This is low-risk compared to arbitrary remote installers. The script itself is included in the package (so code will run locally), but nothing in the repo pulls code from unknown URLs or runs hidden installers.
Credentials
The skill does not request credentials or secrets and does not contact external non-YouTube endpoints. It does read optional environment variables (WHISPER_MODEL, WHISPER_LANG, WHISPER_COMPUTE, TIMELINE_EVERY) as configuration defaults; these are benign but are not listed in the skill's declared env requirements (metadata says none). No sensitive credentials are requested.
Persistence & Privilege
The skill is not always-enabled and is user-invocable. It does not request persistent privileges or modify other skills or system-wide settings. It writes output into the user-specified --out directory only.
What to consider before installing
This skill appears to implement what it promises (local YouTube audio download + local ASR + summary + optional frames) and does not request credentials, but there are a few mismatches to be aware of: the registry metadata claims no required binaries/env vars but the script requires yt-dlp and ffmpeg and reads WHISPER_* and TIMELINE_EVERY environment variables. Before running: (1) inspect the full script yourself (it's included) to confirm no unexpected network calls; (2) run it in a controlled directory (not as root) because it will download media and write files to the specified --out path; (3) ensure you install yt-dlp, ffmpeg, and faster-whisper from reputable sources; (4) be aware of copyright/privacy implications of downloading videos; and (5) confirm that any environment variables you set are only benign configuration (the script does not require secrets). If you want stronger assurance, ask the author to correct the metadata to list required binaries and environment variables.Like a lobster shell, security has layers — review code before you run it.
Current versionv0.1.0
Download ziplatest
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
SKILL.md
YouTube ASR Summarize (local, no tokens)
Use this skill to summarize a YouTube video even when subtitles are missing by downloading audio and running local speech-to-text.
Quick start
- One-time deps
brew install yt-dlp ffmpeg
- Create venv + install ASR
python3 -m venv .venv
source .venv/bin/activate
pip install faster-whisper
- Run
python3 scripts/youtube_asr_summarize.py \
--url "https://www.youtube.com/watch?v=<id>" \
--out "/tmp/youtube-asr/<id>" \
--model small \
--lang zh \
--frames 1 \
--timeline-every 180
Outputs in --out:
summary.md(含:链接 + 摘要 + 时间轴)transcript.txttranscript.srtframes/frame_01.jpg… (if--frames > 0)
Notes
- Default model
small(CPU/int8) is fast; use--model mediumfor better accuracy. - If you need more control, see
references/workflow.md.
Files
3 totalSelect a file
Select a file to preview.
Comments
Loading comments…
