Install
openclaw skills install aliyun-speech-transcriberTranscribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript JSON or extracted plain text, or wants to process a cloud-accessible media URL (including signed Qiniu URLs) into transcription results.
openclaw skills install aliyun-speech-transcriberUse this skill to turn externally accessible media URLs into transcript results.
Current implementation focuses on DashScope file transcription using the paraformer-v2 model, aligned with the existing Java service pattern.
ASR_DASHSCOPE_API_KEYFallback supported:
DASHSCOPE_API_KEYOptional:
ALIYUN_SPEECH_MODEL - defaults to paraformer-v2ALIYUN_SPEECH_LANG_HINTS - defaults to zh,enALIYUN_SPEECH_POLL_SECONDS - defaults to 5ALIYUN_SPEECH_TIMEOUT_SECONDS - defaults to 1800Pass one or more externally accessible URLs:
node scripts/transcribe.js --file-url "https://example.com/audio.mp3"
Multiple files:
node scripts/transcribe.js --file-url "https://a.com/1.mp3" --file-url "https://a.com/2.mp3"
The script returns JSON with:
successproviderenginetaskIdrequestIdresultstexttext is a best-effort plain-text extraction from the final JSON result.
Typical workflow:
qiniu-upload to upload a local file.DASHSCOPE_API_KEY is missing.