Install
openclaw skills install audioclaw-skills-voice-intakeUse when AudioClaw Skills needs to understand a user voice message with AudioClaw ASR, including speech-to-text, model routing for deepthink or pro features, optional timestamps or sentiment, and packaging the result into a ready-to-use AudioClaw user turn payload.
openclaw skills install audioclaw-skills-voice-intakeUse this skill when the user sends a voice message and AudioClaw should understand the content before replying.
Common triggers:
.ogg or .opus, and AudioClaw still needs one stable ASR path.Do not use this skill for speech output. Use $audioclaw-skills-voice-reply for TTS.
scripts/openclaw_voice_intake.py with the audio path.sense-asr-deepthink for normal single-speaker voice understandingsense-asr when a language hint is providedsense-asr-pro when timestamps, sentiment, speaker diarization, or punctuation are requestedsense-asr-lite when hotwords are requestedtranscript.normalized_textopenclaw.turn_payloadrouting.selected_modelunderstanding.clarification_needed is true, ask the user to repeat or resend the audio.Official HTTP ASR API:
https://api.senseaudio.cn/v1/audio/transcriptionsmultipart/form-data<=10MB.wav, .mp3, .ogg, .opus, .flac, .aac, .m4a, .mp4Supported response goals:
The skill keeps two layers separate:
This skill now treats SENSEAUDIO_API_KEY as the default API key source again.
Runtime rules:
SENSEAUDIO_API_KEY as an AudioClaw login token such as v2.public..., the shared bootstrap will replace it with the real sk-... value from ~/.audioclaw/workspace/state/senseaudio_credentials.json before ASR starts.--api-key-env still works, but the default runtime path is SENSEAUDIO_API_KEY.Basic voice intake:
python3 scripts/openclaw_voice_intake.py \
--input /path/to/user_audio.mp3
Voice intake with richer AudioClaw structure:
python3 scripts/openclaw_voice_intake.py \
--input /path/to/meeting_clip.m4a \
--enable-punctuation \
--timestamp-granularity segment \
--enable-sentiment \
--out-json /tmp/openclaw_voice_turn.json
Force a specific model:
python3 scripts/openclaw_voice_intake.py \
--input /path/to/user_audio.mp3 \
--model sense-asr-deepthink
Recommended handoff:
scripts/openclaw_voice_intake.py.openclaw.turn_payload.roleopenclaw.turn_payload.contentopenclaw.turn_payload.metadataOperational rules:
language only when you are confident; otherwise let ASR auto-detect.sense-asr-pro.scripts/senseaudio_asr_client.py
scripts/openclaw_voice_intake.py
references/openclaw_voice_intake.md