Install
openclaw skills install video-intelDownload videos and get transcripts, summaries, or metadata from YouTube, TikTok, Instagram, and X (Twitter). Use when the user shares a video URL and wants...
openclaw skills install video-intelDownloads videos and extracts transcripts using yt-dlp (captions) with OpenAI Whisper fallback.
| Dependency | Purpose | Required? |
|---|---|---|
yt-dlp | Fetch captions and download audio/video | ✅ Always |
python3 | Parse VTT/SRT caption files | ✅ Always |
curl | Call OpenAI Whisper API | ✅ For Whisper fallback |
ffmpeg | Extract audio from TikTok/Instagram/X | ⚠️ Non-YouTube only |
OPENAI_API_KEY | Authenticate with OpenAI Whisper API | ⚠️ Only if captions unavailable |
Install binaries:
# yt-dlp
curl -L https://github.com/yt-dlp/yt-dlp/releases/latest/download/yt-dlp -o ~/bin/yt-dlp && chmod +x ~/bin/yt-dlp
# ffmpeg (Debian/Ubuntu)
sudo apt install ffmpeg
When captions are unavailable (common for TikTok, Instagram, X), the script downloads the audio and uploads it to OpenAI's transcription API (https://api.openai.com/v1/audio/transcriptions). This means:
OPENAI_API_KEY to be setOPENAI_API_KEY, or use a local transcription modelYouTube videos almost always have captions and will not trigger an upload.
~/.openclaw/skills/video-intel/scripts/video-intel.sh
~/.openclaw/skills/video-intel/scripts/video-intel.sh transcript <url>
--lang pt for Portuguese~/.openclaw/skills/video-intel/scripts/video-intel.sh info <url>
~/.openclaw/skills/video-intel/scripts/video-intel.sh captions <url>
~/.openclaw/skills/video-intel/scripts/video-intel.sh download <url> [--format audio|720p|best]
--format audio to get a smaller mp3/tmp/video-intel/ by default