Install
openclaw skills install pdf-ocr-parseExtract text from scanned PDFs using Tesseract OCR. Supports multiple languages, page selection, DPI control, and word-level bounding boxes.
openclaw skills install pdf-ocr-parseRasterises each selected page of a PDF at the given DPI, then runs Tesseract OCR on each page image. Returns per-page text with confidence scores, and optionally per-word bounding boxes.
Provide one of:
url — URL to a scanned PDFbase64_pdf — base64-encoded PDFfile fieldSend your API key in the CLIENT-API-KEY header.
Get your free API key at https://pdfapihub.com. Full API documentation is available at https://pdfapihub.com/docs.
| Param | Default | Description |
|---|---|---|
lang | eng | Language code(s), + separated |
psm | 3 | Page segmentation mode (0–13) |
oem | 3 | OCR engine mode (0=legacy, 1=LSTM, 3=default) |
dpi | 200 | Rasterisation DPI (72–400) |
char_whitelist | — | Restrict to specific characters |
curl -X POST https://pdfapihub.com/api/v1/pdf/ocr/parse \
-H "CLIENT-API-KEY: your_api_key" \
-H "Content-Type: application/json" \
-d '{
"url": "https://pdfapihub.com/sample-pdfinvoice-with-image.pdf",
"pages": "1-3",
"lang": "eng",
"dpi": 300,
"detail": "words"
}'