Youtube Podcast summarizer via Elevenlabs

v1.0.0

Transform YouTube videos into podcast-style voice summaries using ElevenLabs TTS

1· 1.9k· 1 versions· 4 current· 5 all-time· Updated 3h ago· MIT-0

Security Scans

VirusTotalBenign ClawScanBenign Static analysisBenign

Install

openclaw skills install youtube-voice-summarizer-elevenlabs

YouTube Voice Summarizer

Transform any YouTube video into a professional voice summary delivered in under 60 seconds.

What It Does

When a user sends a YouTube URL, this skill:

Extracts the video transcript via Supadata
Generates a concise AI summary via OpenRouter/Cerebras
Converts the summary to natural speech via ElevenLabs
Returns an audio file the user can listen to

Requirements

This skill requires a running backend server. Deploy the summarizer service:

git clone https://github.com/Franciscomoney/elevenlabs-moltbot.git
cd elevenlabs-moltbot
npm install
cp .env.example .env
# Add your API keys to .env
npm start

Required API Keys

Service	Purpose	Get Key
ElevenLabs	Text-to-speech	https://elevenlabs.io
Supadata	YouTube transcripts	https://supadata.ai
OpenRouter	AI summarization	https://openrouter.ai

How to Use

When user sends a YouTube URL:

Step 1: Start the voice summary job

curl -s -X POST http://127.0.0.1:3050/api/summarize \
  -H "Content-Type: application/json" \
  -d '{"url":"YOUTUBE_URL","length":"short","voice":"podcast"}'

Returns: {"jobId": "job_xxx", "status": "processing"}

Step 2: Poll for completion (wait 3-5 seconds between checks)

curl -s http://127.0.0.1:3050/api/status/JOB_ID

Keep polling until status is "completed".

Step 3: Return the audio to user

When complete, the response includes:

result.audioUrl - The MP3 audio URL (send this to the user!)
result.teaser - Short hook text about the content
result.summary - Full text summary
result.keyPoints - Array of key takeaways

Send the user:

The teaser text as a message
The audio URL so they can listen

Voice Options

Voice	Style
`podcast`	Deep male narrator (default)
`news`	British authoritative
`casual`	Friendly conversational
`female_warm`	Warm female voice

Summary Lengths

Length	Duration	Best For
`short`	1-2 min	Quick overview
`medium`	3-5 min	Balanced detail
`detailed`	5-10 min	Comprehensive

Example Flow

User: "Summarize this: https://www.youtube.com/watch?v=dQw4w9WgXcQ"

Start job:

curl -s -X POST http://127.0.0.1:3050/api/summarize \
  -H "Content-Type: application/json" \
  -d '{"url":"https://www.youtube.com/watch?v=dQw4w9WgXcQ","length":"short","voice":"podcast"}'

Poll status with the returned jobId
When complete, send the audioUrl to the user

Text-Only Summary (No Audio)

For faster, cheaper text-only summaries:

curl -s -X POST http://127.0.0.1:3050/api/quick-summary \
  -H "Content-Type: application/json" \
  -d '{"url":"YOUTUBE_URL","length":"short"}'

Troubleshooting

"Video may not have captions"

The video needs subtitles enabled on YouTube
Auto-generated captions may take time on new videos

Audio URL not working

Ensure BASE_URL in .env is publicly accessible
Check firewall allows traffic on port 3050

Cost Per Summary

Service	Cost
Supadata	~$0.001
OpenRouter	~$0.005-0.02
ElevenLabs	~$0.05-0.15
Total	~$0.06-0.17

Version tags

elevenlabsvk9777b6wwpjt6qejke6swqees980aks3latestvk9777b6wwpjt6qejke6swqees980aks3samvk9777b6wwpjt6qejke6swqees980aks3

Runtime requirements

🎙️ Clawdis