Qwen Vision

Analyze images and videos using Qwen Vision API (Alibaba Cloud DashScope). Supports image understanding, OCR, visual reasoning.

openclaw skills install @perchouli/qwen-vision

Analyze images and videos using Alibaba Cloud's Qwen Vision API (通义千问视觉模型).

Usage

Analyze an image:

bash

uv run {baseDir}/scripts/analyze_image.py --image "/path/to/image.jpg" --prompt "请描述这张图片" --api-key sk-xxx

With custom model:

bash

uv run {baseDir}/scripts/analyze_image.py --image "/path/to/image.jpg" --model qwen-vl-max-latest --api-key sk-xxx

Get your API key from:

Model	Description
`qwen-vl-max-latest`	Latest max model (default)
`qwen-vl-plus-latest`	Faster, cost-effective