Install
openclaw skills install listenhub-aiTurn ideas into podcasts, explainer videos, voice narration, and AI images via ListenHub. Use when the user wants to "make a podcast", "create an explainer video", "read this aloud", "generate an image", or share knowledge in audio/visual form. Supports: topic descriptions, YouTube links, article URLs, plain text, and image prompts. Requires LISTENHUB_API_KEY environment variable (get from https://listenhub.ai/settings/api-keys).
openclaw skills install listenhub-aiGenerate podcasts, explainer videos, TTS audio, and AI images through shell scripts that wrap the ListenHub API.
Set LISTENHUB_API_KEY before first use. Two options:
Option A — OpenClaw env config (recommended):
Add to ~/.openclaw/openclaw.json under env:
{ "env": { "LISTENHUB_API_KEY": "lh_sk_..." } }
Option B — Shell export:
export LISTENHUB_API_KEY="lh_sk_..."
Get your key: https://listenhub.ai/settings/api-keys
For image generation, also set LISTENHUB_OUTPUT_DIR (defaults to ~/Downloads).
All scripts live at scripts/ relative to this SKILL.md. Resolve the path:
SCRIPTS="$(cd "$(dirname "<path-to-this-SKILL.md>")" && pwd)/scripts"
Dependencies: curl, jq (install if missing).
| Mode | Script | Use Case |
|---|---|---|
| Podcast | create-podcast.sh | 1-2 speaker discussion |
| Explainer | create-explainer.sh + generate-video.sh | Narration + AI visuals |
| TTS | create-tts.sh | Pure voice reading |
| Speech | create-speech.sh | Multi-speaker scripted audio |
| Image | generate-image.sh | AI image generation |
Helper scripts: get-speakers.sh (list voices), check-status.sh (poll progress).
get-speakers.sh to discover them.Auto-detect from user input:
create-podcast.shcreate-explainer.shcreate-tts.shgenerate-image.shIf ambiguous, ask user.
$SCRIPTS/get-speakers.sh --language zh # or en
Returns JSON with data.items[].speakerId. If user doesn't specify a voice, pick the first match for the language.
$SCRIPTS/create-podcast.sh --query "topic" --language zh|en --mode quick|deep|debate --speakers <id1[,id2]> [--source-url URL] [--source-text TEXT]
quick is default mode. debate requires 2 speakers.--source-url / --source-text allowed.Use only when user wants to review/edit the script before audio generation.
Stage 1: $SCRIPTS/create-podcast-text.sh (same args as one-stage)
Review: Poll with check-status.sh --wait, save draft, STOP and wait for user approval.
Stage 2: $SCRIPTS/create-podcast-audio.sh --episode <id> [--scripts modified.json]
$SCRIPTS/create-explainer.sh --content "text" --language zh|en --mode info|story --speakers <id>
$SCRIPTS/generate-video.sh --episode <id>
$SCRIPTS/create-tts.sh --type text|url --content "text or URL" --language zh|en --mode smart|direct --speakers <id>
direct (no content modification). smart fixes grammar/punctuation.$SCRIPTS/create-speech.sh --scripts scripts.json
JSON format: {"scripts": [{"content": "...", "speakerId": "..."}]}
$SCRIPTS/generate-image.sh --prompt "description" [--size 1K|2K|4K] [--ratio 16:9|1:1|9:16|...] [--reference-images "url1,url2"]
$LISTENHUB_OUTPUT_DIR (default ~/Downloads).$SCRIPTS/check-status.sh --episode <id> --type podcast|flow-speech|explainer [--wait] [--timeout 300]
Exit codes: 0=done, 1=failed, 2=timeout (retry safe).
Use --wait for automated polling. Run generation in background for long tasks.
get-speakers.sh, pick first matchcheck-status.sh --waitMatch response language to user input language. Chinese input → Chinese responses. English → English.