OCR with python

Extract Chinese and English text from images and scanned PDFs, including documents like invoices and contracts, using PaddleOCR in Python.

openclaw skills install ocr-python

OCR Text Recognition

This skill uses PaddleOCR for text recognition, supporting both Chinese and English.

Perform OCR recognition directly on image or PDF files:

from paddleocr import PaddleOCR

ocr = PaddleOCR(lang='ch')
result = ocr.predict("file_path.jpg")

Install dependencies before first use:

pip3 install paddlepaddle paddleocr

Recognition results return JSON containing:

Common scripts are located in the scripts/ directory.