Install
openclaw skills install @jsinbupt/senseaudio-voice-broadcastTurn plain text into character-style TTS scripts for narration, companion-style voice messages, emotional comfort audio, and expressive spoken responses. Use when Codex needs to prepare or call TTS for human-like voice broadcast, role-based narration, emotional companionship, bedtime or greeting voice messages, soothing or encouraging speech, or any task where speech should sound like a specific persona rather than neutral text-to-speech.
openclaw skills install @jsinbupt/senseaudio-voice-broadcastUse this skill to convert plain text into TTS-ready, persona-driven spoken content. Prioritize natural spoken rhythm, stable persona traits, and emotional control over literal text fidelity.
Apply this skill for requests like:
Follow this order unless the user already specifies some of it.
Decide who is speaking:
If the user names a persona, keep that identity stable through the whole output. If the user does not provide a persona, infer one from the task and state it clearly in the output.
Select one primary effect:
播报型: clear, structured, easy to follow陪伴型: warm, soft, intimate, less formal角色型: strong personality markers and stylized wording安慰型: gentle, slower, emotionally safe激励型: energetic, uplifting, rhythmicDo not mix conflicting effects unless the user explicitly asks for it.
Transform the source text into spoken language:
TTS scripts should sound like someone speaking to a listener, not like a document being read aloud.
When helpful, output both:
口播文本: the final text to send to TTS语音控制说明: style notes for the TTS backend, such as tone, pace, pause density, warmth, smile, tenderness, or firmnessIf the user asks for direct TTS calling instructions, also output backend-facing hints in short bullet form.
If the task is not just script writing but actual audio generation, call the bundled script:
python3 scripts/senseaudio_tts.py \
--text "你好,欢迎体验 SenseAudio 带来的极致语音服务。" \
--voice-id male_0004_a \
--output output.wav
Authentication:
SENSEAUDIO_API_KEY--api-keyDefault endpoint and model:
https://api.senseaudio.cn/v1/t2a_v2SenseAudio-TTS-1.0Use --text-file when the spoken script is long or already saved as a file.
Prefer generating the spoken script first, then sending that cleaned script to TTS instead of sending raw written prose.
Use emotion deliberately:
Avoid exaggerated emotional markers that would make TTS sound unnatural or theatrical unless explicitly requested.
Default to Chinese output unless the user requests another language.
Prefer short spoken units over long written paragraphs. Preserve concrete facts, names, numbers, and instructions unless the user asks for a looser adaptation.
If the task is informational broadcast, keep it understandable first and expressive second. If the task is emotional companionship, keep it emotionally coherent first and factually safe second.
When preparing a TTS-friendly script:
Produce the smallest useful set:
Use the templates in voice-broadcast-templates.md when the user does not specify a format.
Use this skill for requests like: