Install
openclaw skills install image-readerImage recognition and understanding tool. Uses a multimodal model (e.g. doubao-seed-2.0-pro, kimi-k2.5) to analyze image content and supports OCR text extrac...
openclaw skills install image-readerImage recognition and understanding tool that leverages Doubao multimodal models to analyze image content.
| Item | Value |
|---|---|
| API Endpoint | https://ark.cn-beijing.volces.com/api/coding/v3 |
| Model | doubao-seed-2.0-pro |
| Authentication | API Key (configured in config.yaml) |
# General analysis
python image_reader.py /path/to/image.png
# Extract text (OCR)
python image_reader.py /path/to/image.png -p "Extract all text from the image"
# Describe the image
python image_reader.py /path/to/image.png -p "Describe this image in detail"
Once installed, you can invoke it using natural language:
Analyze this image
Extract the text from the image
Describe this screenshot