Audiomind

Turn any idea into a finished podcast in one command. AudioMind handles ElevenLabs voice narration (29+ voices), AI background music, and server-side audio m...

MIT-0 · Free to use, modify, and redistribute. No attribution required.
5 · 499 · 0 current installs · 0 all-time installs
byWells Wu@wells1137
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (produce podcasts with ElevenLabs TTS, AI music, mixing) align with the SKILL.md steps: LLM script -> POST to backend TTS/music endpoints -> upload files -> call backend mix endpoint. No unexpected credentials or unrelated system access are requested.
Instruction Scope
Instructions are scoped to generating scripts and sending them and generated audio to the backend, then uploading MP3s via the agent's manus-upload-file tool to obtain public URLs. This is coherent for the stated purpose, but it explicitly sends user-provided scripts and audio to a third-party backend (audiomind-backend-nine.vercel.app) which is a privacy consideration.
Install Mechanism
Instruction-only skill with no install steps and no code files. No downloads or packages are installed by the skill itself.
Credentials
No required environment variables or secrets. A few optional vars are declared (AUDIOMIND_BACKEND_URL, AUDIOMIND_API_KEY, FAL_KEY) which are reasonable for configuring a self-hosted or pro backend; FAL_KEY's purpose is not explained in detail but is optional, not mandatory.
Persistence & Privilege
Does not request always:true or system-wide changes. It is user-invocable and can be invoked autonomously per platform defaults, which is expected behavior for a skill like this.
Assessment
AudioMind appears internally consistent but defaults to sending your scripts and generated audio to a third-party backend (audiomind-backend-nine.vercel.app operated by @wells1137). If the content you create is sensitive, do not use the public backend. Instead self-host the backend (the SKILL.md points to github.com/wells1137/audiomind-backend) and set AUDIOMIND_BACKEND_URL, or only use it for non-sensitive material. Be cautious about supplying any API keys to the public operator; prefer using your own backend so ElevenLabs keys remain under your control. If you want more assurance, review the referenced GitHub backend source and verify the operator before granting or entering any credentials.

Like a lobster shell, security has layers — review code before you run it.

Current versionv3.3.0
Download zip
audiovk97cmpdtxsc88svae3t0xa4d2h8273n2elevenlabsvk97cmpdtxsc88svae3t0xa4d2h8273n2falvk97cmpdtxsc88svae3t0xa4d2h8273n2latestvk97apfb1zsdjgdzzftxemp35cx8278armusicvk97cmpdtxsc88svae3t0xa4d2h8273n2podcastvk97cmpdtxsc88svae3t0xa4d2h8273n2sfxvk97cmpdtxsc88svae3t0xa4d2h8273n2speechvk97cmpdtxsc88svae3t0xa4d2h8273n2ttsvk97cmpdtxsc88svae3t0xa4d2h8273n2voice-clonevk97cmpdtxsc88svae3t0xa4d2h8273n2

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

AudioMind v3: The AI Podcast Studio

AudioMind turns a single sentence into a fully-produced podcast. It handles scripting, ElevenLabs voice narration, AI background music, and server-side audio mixing — all from one Manus command.

No setup required. The public shared backend works out of the box. Just install and start creating.


Quick Start

Install:

clawhub install audiomind

Use immediately (no configuration needed):

"Use AudioMind to create a 3-minute podcast about the future of AI agents."

That's it. AudioMind uses the public shared backend by default — 20 free generations per month, no API key required.


Configuration

VariableRequiredDescription
AUDIOMIND_BACKEND_URLOptionalYour own Vercel backend URL. Defaults to the public shared backend.
AUDIOMIND_API_KEYOptionalPro API key for unlimited generations. Get one at the landing page.

Free Tier (default): 20 generations/month tracked by IP. No configuration needed.

Pro Tier: Set AUDIOMIND_API_KEY with your Pro key for unlimited access.

Self-hosted: Deploy your own backend from github.com/wells1137/audiomind-backend and set AUDIOMIND_BACKEND_URL to your instance.


How It Works

When you ask Manus to create a podcast, the agent performs these steps automatically:

  1. Write Script — The agent uses its built-in LLM to write a structured podcast script based on your topic and desired length.

  2. Generate NarrationPOST {BACKEND_URL}/api/workflow/generate_tts with the script. Returns MP3 audio narrated by an ElevenLabs voice.

  3. Generate MusicPOST {BACKEND_URL}/api/workflow/generate_music with a mood/style prompt. Returns a background music MP3.

  4. Upload Audio — The agent uploads both MP3 files using manus-upload-file to obtain public URLs for the mixing step.

  5. Mix Final AudioPOST {BACKEND_URL}/api/workflow/mix_audio with { narration_url, music_url }. The backend mixes them with proper levels using ffmpeg and returns the final podcast MP3.

  6. Deliver — The agent saves and presents the finished podcast to you.


Example Prompts

  • "Create a 5-minute podcast about the history of jazz with a smooth jazz background."
  • "Make a daily news briefing about AI developments, formal tone, upbeat intro music."
  • "Generate a meditation podcast, 10 minutes, calm narration, ambient soundscape."
  • "Produce a tech explainer on quantum computing for a general audience."

Security

All API keys (ElevenLabs) are stored server-side. The skill file contains zero credentials. This architecture passes VirusTotal and ClawHub security scans. See the GitHub repo for the full backend source code.


Changelog

v3.3.0 — Removed local tools/start_server.sh entirely (not needed in v3 architecture). Declared FAL_KEY as optional env. Resolves all OpenClaw metadata inconsistency warnings.

v3.1.0 — Zero-config install. Public shared backend is now the default. No AUDIOMIND_BACKEND_URL setup required for free tier users.

v3.0.1 — Added openclaw.requires metadata to declare env vars and trusted network endpoints. Resolves OpenClaw security scanner warning.

v3.0.0 — Full architecture rewrite. All commercial logic moved to Vercel backend. ElevenLabs API keys are now server-side only. Passes VirusTotal security scan.

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…