AB Agents Vision (MiniMax) ๐๏ธ
Image analysis via MiniMax VL API โ simple, fast, reliable.
โ ๏ธ Requires MiniMax Token Plan API key โ get free key
What It Does
- ๐ธ Describe images โ Get detailed scene descriptions
- ๐ Extract text โ Read from screenshots, photos, documents
- ๐ Analyze photos โ Identify objects, people, settings
- ๐ URL support โ Analyze images from the web
Requirements
- MiniMax Token Plan API key โ Subscribe free
- Linux/macOS
uvx (auto-installed)
Quick Start
# 1. Install uvx
curl -LsSf https://astral.sh/uv/install.sh | sh
# 2. Get free MiniMax API key
# https://platform.minimax.io โ Subscribe โ Token Plan (free tier)
# 3. Use
export MINIMAX_API_KEY="sk-cp-your-key"
./vision.sh image.jpg "Describe this image"
Usage
# Basic description
./vision.sh photo.jpg
# With custom prompt
./vision.sh screenshot.png "What text do you see?"
# URL support
./vision.sh "https://example.com/image.jpg" "Describe this"
Examples
Screenshot analysis:
Input: screenshot.png + "What text is in the image?"
Output: "The screenshot shows a code editor with Python code..."
Photo description:
Input: photo.jpg + "Describe in detail"
Output: "A person's bare foot and lower leg resting on a brown
textured waffle-weave blanket. The skin is light-toned..."
Installation
git clone https://github.com/alexburrstudio/ab-agents-vision.git
cd ab-agents-vision/skills/vision
chmod +x vision.sh
Or via ClaWHub:
clawhub install AB-Agents-Vision-MiniMax
Troubleshooting
| Error | Solution |
|---|
| API Error: 1033 | Retry โ MiniMax system error |
| No response | Check MINIMAX_API_KEY is set correctly |
| Slow | Use smaller images (<10MB) |
AB-Agents ๐ฆ
Related Skills
๐ AB Agents Meter Reader โ Read meter readings from photos (uses this skill for vision)
AB-Agents ๐ฆ