Install
openclaw skills install alicloud-ai-entry-modelstudioRoute Alibaba Cloud Model Studio requests to the right local skill (Qwen Image, Qwen Image Edit, Wan Video, Wan R2V, Qwen TTS, Qwen ASR and advanced TTS variants). Use when the user asks for Model Studio without specifying a capability.
openclaw skills install alicloud-ai-entry-modelstudioCategory: task
Route requests to existing local skills to avoid duplicating model/parameter details.
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
DASHSCOPE_API_KEY (environment variable preferred; or dashscope_api_key in ~/.alibabacloud/credentials).| Need | Target skill |
|---|---|
| Text-to-image / image generation | skills/ai/image/alicloud-ai-image-qwen-image/ |
| Image editing | skills/ai/image/alicloud-ai-image-qwen-image-edit/ |
| Text-to-video / image-to-video (i2v) | skills/ai/video/alicloud-ai-video-wan-video/ |
| Reference-to-video (r2v) | skills/ai/video/alicloud-ai-video-wan-r2v/ |
| Text-to-speech (TTS) | skills/ai/audio/alicloud-ai-audio-tts/ |
| Speech recognition/transcription (ASR) | skills/ai/audio/alicloud-ai-audio-asr/ |
| Realtime speech recognition | skills/ai/audio/alicloud-ai-audio-asr-realtime/ |
| Realtime TTS | skills/ai/audio/alicloud-ai-audio-tts-realtime/ |
| Live speech translation | skills/ai/audio/alicloud-ai-audio-livetranslate/ |
| CosyVoice voice clone | skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/ |
| CosyVoice voice design | skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-design/ |
| Voice clone | skills/ai/audio/alicloud-ai-audio-tts-voice-clone/ |
| Voice design | skills/ai/audio/alicloud-ai-audio-tts-voice-design/ |
| Omni multimodal interaction | skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/ |
| Visual reasoning | skills/ai/multimodal/alicloud-ai-multimodal-qvq/ |
| Text embeddings | skills/ai/search/alicloud-ai-search-text-embedding/ |
| Rerank | skills/ai/search/alicloud-ai-search-rerank/ |
| Vector retrieval | skills/ai/search/alicloud-ai-search-dashvector/ or skills/ai/search/alicloud-ai-search-opensearch/ or skills/ai/search/alicloud-ai-search-milvus/ |
| Document understanding | skills/ai/text/alicloud-ai-text-document-mind/ |
| Video editing | skills/ai/video/alicloud-ai-video-wan-edit/ |
| Model list crawl/update | skills/ai/misc/alicloud-ai-misc-crawl-and-skill/ |
text generation/chat (LLM)
multimodal embeddings
OCR-specialized extraction and image translation
virtual try-on / digital human / advanced video personas
For multimodal/ASR download failures, prefer public URLs listed above.
For ASR parameter errors, use data URI in input_audio.data.
For multimodal embedding 400, ensure input.contents is an array.
When X-DashScope-Async: enable returns task_id, poll as follows:
GET https://dashscope.aliyuncs.com/api/v1/tasks/<task_id>
Authorization: Bearer $DASHSCOPE_API_KEY
Example result fields (success):
{
"output": {
"task_status": "SUCCEEDED",
"video_url": "https://..."
}
}
Notes:
output.video_url.Model list and links:output/alicloud-model-studio-models-summary.md
API/parameters/examples: see target sub-skill SKILL.md and references/*.md
Official source list:references/sources.md
mkdir -p output/alicloud-ai-entry-modelstudio
echo "validation_placeholder" > output/alicloud-ai-entry-modelstudio/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-entry-modelstudio/validate.txt is generated.
output/alicloud-ai-entry-modelstudio/.