image2text

Extract text from images using tesseract OCR, supporting local files, URLs, and base64 inputs for text-only AI models without vision capability.

openclaw skills install image2text

Extract text from images without needing a vision-capable AI model.

Usage

python3 scripts/ocr.py <image path|URL|base64> [--lang <languages>] [--psm <mode>] [--raw]

--lang: Language codes, comma-separated, default chi_sim+eng
- chi_sim Simplified Chinese | chi_tra Traditional | eng English | jpn Japanese | kor Korean | and 30+ more
- Combine: chi_sim+eng
--psm: Page segmentation mode, default 6
- 3 Fully automatic | 6 Block-level | 4 Single line | 11 Sparse text
--raw: Output plain text only, no markers

OCR a Chinese receipt:

python3 scripts/ocr.py ~/Downloads/receipt.png --lang chi_sim

English + Chinese mixed:

python3 scripts/ocr.py https://example.com/doc.jpg --lang chi_sim+eng

Plain text only (no markers):

python3 scripts/ocr.py /path/to/image.png --raw