Install
openclaw skills install video-transcript-extractorExtract text from videos and audio across 20+ platforms including TikTok, YouTube, Bilibili, and Douyin. Uses multi-level extraction strategy (subtitle track → speech recognition → AI correction) to achieve 95%+ accuracy. Supports batch processing and multi-language content.
openclaw skills install video-transcript-extractor🎬 Extract text from videos and audio with multi-platform support
This skill helps AI Agents extract text content from various video and audio platforms, supporting 20+ platforms including Douyin, Bilibili, Xiaohongshu, Weibo, WeChat Video, TikTok, YouTube, and Instagram. With multi-level extraction strategy (subtitle track → speech recognition → AI correction), it ensures 95%+ accuracy.
/extract-transcript/video-to-text/transcript-extraction/video-subtitle-extract/audio-to-text/extract-video-textProvide a video link, and the Agent automatically identifies the platform and extracts the transcript:
User: Extract transcript from: https://v.douyin.com/xxxxx
Agent: Extracting video transcript...
Video Title: xxx
Duration: 3m 25s
【Transcript Content】
(Full extracted transcript)
【Background Music】
Song: xxx
Artist: xxx
| Parameter | Required | Description |
|---|---|---|
| video_url | Yes | Video share link |
| extract_audio | No | Whether to extract audio, default false |
| auto_correct | No | Whether to enable AI correction, default true |
| Platform | Share Link Format | Subtitle | Audio |
|---|---|---|---|
| Douyin | https://v.douyin.com/xxx | ✅ | ✅ |
| Kuaishou | https://v.kuaishou.com/xxx | ✅ | ✅ |
| Bilibili | https://b23.tv/xxx or BV号 | ✅ | ✅ |
| Xiaohongshu | https://www.xiaohongshu.com/xxx | ✅ | ✅ |
| https://weibo.com/xxx | ✅ | ✅ | |
| WeChat Video | Video account link | ✅ | ✅ |
| TikTok | https://tiktok.com/@xxx/video/xxx | ✅ | ✅ |
| YouTube | https://youtube.com/xxx | ✅ | ✅ |
Prioritize native subtitles from video:
Use ASR when no subtitles:
Intelligent correction after recognition:
## Video Information
- Title: xxx
- Platform: YouTube
- Duration: 3m 25s
- Published: 2024-01-15
- Author: @xxx
## Transcript Content
(Complete transcript, ready for copy-paste)
## Timeline
- 00:00 - Opening Introduction
- 00:15 - Core Content Begins
- 01:30 - Case Analysis
- 03:00 - Summary
## Background Music
- Song: xxx
- Artist: xxx
- Music Link: https://...
User: Extract this video transcript: https://youtube.com/watch?v=xxx
Agent: Extracting YouTube video transcript...
✅ Extraction successful!
Video Title: Future Trends in AI Agents
Duration: 5m 12s
【Transcript Content】
Hello everyone, today we're going to talk about the future of AI agents...
(Full transcript ~2000 words)
User: Extract this TikTok transcript: https://tiktok.com/@user/video/xxx
Agent: Extracting TikTok video transcript...
✅ Extraction successful!
Video Title: Quick makeup tutorial
Subtitle Type: Auto-generated subtitles
Accuracy: 98%
【Transcript Content】
(Full transcript content)
User: Batch extract these video transcripts:
1. https://youtube.com/xxx1
2. https://tiktok.com/xxx2
3. https://instagram.com/xxx3
Agent: Batch extracting...
✅ Batch extraction complete!
| # | Platform | Title | Status | Words |
|---|----------|-------|--------|-------|
| 1 | YouTube | AI Tutorial | ✅ Success | 1234 |
| 2 | TikTok | Coding Tips | ✅ Success | 856 |
| 3 | Instagram | Travel Vlog | ✅ Success | 542 |
1. Parse video link → Get real URL and platform type
2. Call platform API → Get video info and subtitles/audio
3. Format conversion → Output as standard Markdown
4. AI correction (optional) → Improve accuracy
5. Return result → Display to user
| Platform | API Endpoint | Notes |
|---|---|---|
| YouTube | /api/v1/youtube/web/fetch_one_video | TikHub API |
| TikTok | /api/v1/tiktok/web/fetch_one_video | TikHub API |
| Douyin | /api/v1/douyin/web/fetch_one_video_by_share_url | TikHub API |
| Bilibili | /api/v1/bilibili/web/fetch_one_video | Requires BV number |
music.play_url field for pure audiovideo.play_addr field