Install
openclaw skills install bilibili-audio-transcribeDownload audio from Bilibili or b23.tv links and transcribe it into txt, srt, and segment JSON with yt-dlp, ffmpeg, and faster-whisper. Use when a user asks to turn a Bilibili link into text, subtitles, a transcript, or speech-to-text output, especially for Chinese-language videos.
openclaw skills install bilibili-audio-transcribeConvert a Bilibili link into local transcript files. Prefer this skill for Bilibili and b23.tv URLs only; do not use it for YouTube or generic web pages.
ffmpeg and ffprobe are on PATH.scripts/bootstrap_env.sh or install yt-dlp and faster-whisper manually.scripts/transcribe_bilibili.py with the target URL.Example:
python scripts/transcribe_bilibili.py "https://b23.tv/SSx810h" \
--out-dir ./downloads/bilibili-audio \
--model-size base \
--beam-size 3
Accept only bilibili.com or b23.tv URLs. If the URL points elsewhere, stop and say this skill is the wrong tool.
Use scripts/transcribe_bilibili.py instead of rewriting yt-dlp / whisper glue code in the session. The script:
ffprobefaster-whisper.txt, .srt, and .segments.jsonDefault to:
--model-size base--beam-size 3--language zhThese defaults are optimized for Chinese Bilibili speech. If the video is clearly non-Chinese or mixed-language, rerun with --language auto or a specific language code.
By default the script writes into downloads/bilibili-audio/ relative to the current working directory. Keep all three artifacts unless the user explicitly asks for fewer outputs:
.txt.srt.segments.jsonIf a run fails:
ffmpeg / ffprobe → install system dependency firstscripts/bootstrap_env.sh--language auto, a larger model, or --no-vad when speech is clippedRead references/troubleshooting.md when dependency setup or extraction fails.
transcribe_bilibili.py — main downloader + ASR pipelinebootstrap_env.sh — create a virtualenv and install Python dependenciesrequirements.txt — Python package list for the bootstrap scripttroubleshooting.md — dependency and extraction failure playbook