Install
openclaw skills install image2textExtract text from images using tesseract OCR, supporting local files, URLs, and base64 inputs for text-only AI models without vision capability.
openclaw skills install image2textExtract text from images without needing a vision-capable AI model.
python3 scripts/ocr.py <image path|URL|base64> [--lang <languages>] [--psm <mode>] [--raw]
--lang: Language codes, comma-separated, default chi_sim+eng
chi_sim Simplified Chinese | chi_tra Traditional | eng English | jpn Japanese | kor Korean | and 30+ morechi_sim+eng--psm: Page segmentation mode, default 6
3 Fully automatic | 6 Block-level | 4 Single line | 11 Sparse text--raw: Output plain text only, no markers/Users/xxx/Downloads/xxx.pnghttps://example.com/image.png — OSS temp links work tooOCR a Chinese receipt:
python3 scripts/ocr.py ~/Downloads/receipt.png --lang chi_sim
English + Chinese mixed:
python3 scripts/ocr.py https://example.com/doc.jpg --lang chi_sim+eng
Plain text only (no markers):
python3 scripts/ocr.py /path/to/image.png --raw
brew install tesseract/opt/homebrew/bin/tesseract--psm 3