Install
openclaw skills install nvidia-kimi-visionAnalyze images using NVIDIA Kimi K2.5 vision model via NVIDIA NIM API. Perfect for adding vision to non-vision models like MiniMax M2.5, GLM-5, or any model without native image support. Supports png, jpg, jpeg, webp.
openclaw skills install nvidia-kimi-visionFast image analysis using Kimi K2.5 multimodal model from NVIDIA NIM.
When using this skill, if no API key is found, it will automatically guide the user through setup:
# Option A: Save to file (recommended)
mkdir -p ~/.config
echo 'your-api-key-here' > ~/.config/nvidia-kimi-api-key
# Option B: Pass directly when running
python3 scripts/analyze_image.py photo.jpg "What's this?" sk-your-key-here
When a user tries to use this skill without an API key, the script will output clear setup instructions. Guide them through:
python3 scripts/analyze_image.py <image_path> "<prompt>" [api_key]
# What's in this image?
python3 scripts/analyze_image.py "/path/to/image.jpg" "Describe what's in this image"
# Extract text from screenshot
python3 scripts/analyze_image.py "/path/screenshot.png" "Extract all text"
# Analyze a meme
python3 scripts/analyze_image.py "/path/meme.jpg" "Explain this meme"
# With API key inline
python3 scripts/analyze_image.py photo.jpg "What's this?" sk-xxxxx
Supports: png, jpg, jpeg, webp
The free tier through NVIDIA NIM has some limits, but they're not clearly documented on the site. Check https://build.nvidia.com for the latest info on your specific key's limits.