Install
openclaw skills install vlm-image-helperVisual inspection helper for VLM and OCR workflows. Use when agent needs to help a vision model see an image more clearly before re-analysis: rotate misaligned or sideways text, crop to a relevant region, zoom small details, enhance readability, or convert an image for re-input. Trigger especially when the model cannot confidently read text, cannot tell similar characters apart such as O/0 or I/l/1, says the image is unclear, needs to inspect only one area of the image, or would benefit from a second pass on a clearer view. Do not use as a general-purpose image editor.
openclaw skills install vlm-image-helperTreat this skill as a visual aid for the model, not as a general image editor.
Use scripts/image_helper.py to create a clearer intermediate image, then re-run analysis on that result.
# Rotate sideways text
python scripts/image_helper.py image.png --rotate 90 -o rotated.png
# Crop a likely area and zoom it
python scripts/image_helper.py image.png --crop-preset bottom-right --scale-preset x3 -o detail.png
# Improve low-contrast text
python scripts/image_helper.py image.png --auto-enhance -o enhanced.png
# Convert an existing file path directly to base64
python scripts/image_helper.py image.png --base64
--rotate.--crop-preset first, then add --scale-preset.--scale-preset x2 or x3.--auto-enhance, or manually tune --contrast and --sharpness.--base64.--input-mode auto or force --input-mode base64 / data-uri.-o or return inline base64 with --base64.references/cli-reference.mdreferences/presets.mdInstall Pillow if it is missing:
pip install Pillow
# or
uv pip install Pillow