Install
openclaw skills install @c-narcissus/web-video-transcribe-docxOffline-first workflow for turning Chinese web page video or audio into text and Word deliverables. Use when Codex needs to (1) extract playable media streams from arbitrary web pages, including pages that expose MP4, M3U8, MPD, or separate audio streams, (2) download direct media URLs with common headers such as Referer or Origin, (3) run local Chinese ASR with SenseVoice via sherpa-onnx, (4) clean or chapterize the transcript conservatively, or (5) produce TXT and DOCX files from the result. Toutiao pages are a supported special case, not the only target.
openclaw skills install @c-narcissus/web-video-transcribe-docxUse the bundled scripts to extract media, download it, transcribe it offline, and render DOCX output.
Prefer the deterministic scripts before hand-rolling new code.
Use {baseDir} when constructing file paths inside this skill so the instructions stay portable across agents and marketplaces.
python {baseDir}/scripts/bootstrap_env.py once in the target environment.python {baseDir}/scripts/pipeline_web_to_docx.py <url> --output-dir <dir>.python {baseDir}/scripts/download_url.py <url> <output> and then python {baseDir}/scripts/transcribe_sensevoice.py --input <file> --output-txt <txt> --output-docx <docx>.python {baseDir}/scripts/transcribe_sensevoice.py --input <file> --output-txt <txt> --output-docx <docx>..txt, and then render it with python {baseDir}/scripts/transcript_to_docx.py.Referer header for media download. Extract the media stream and convert it to DOCX."python {baseDir}/scripts/pipeline_web_to_docx.py first. If the page is especially stubborn and it is a Toutiao page, python {baseDir}/scripts/pipeline_toutiao_to_docx.py and python {baseDir}/scripts/extract_toutiao_media.py remain available as site-specific fallbacks.python {baseDir}/scripts/download_url.py, then transcribe..txt.python {baseDir}/scripts/transcript_to_docx.py for generic TXT-to-DOCX rendering.scripts/bootstrap_env.py
Install or verify the Python packages used by this skill.scripts/extract_web_media.py
Open a generic web page in a real browser, capture likely media URLs plus common download headers, and emit a JSON manifest.scripts/extract_toutiao_media.py
Open a Toutiao page in a real browser, capture audio/video URLs, and emit a JSON manifest with the same schema as the generic extractor.scripts/download_url.py
Download a direct media URL to disk with a stable user agent, optional headers, and HLS/DASH handling.scripts/transcribe_sensevoice.py
Download the SenseVoice model on demand, segment media, run offline ASR, and emit TXT and optional DOCX.scripts/transcript_to_docx.py
Render timestamped transcripts or chapterized notes into a Word document.scripts/pipeline_web_to_docx.py
Run the generic end-to-end pipeline: extract, download, transcribe, and render.scripts/pipeline_toutiao_to_docx.py
Run the Toutiao-specialized end-to-end pipeline for cases where the generic extractor is not preferred.python {baseDir}/scripts/bootstrap_env.py before first use in a fresh environment.skill-creator/scripts/quick_validate.py.--help and one representative happy path after changing the scripts.