Install
openclaw skills install pdf-utilsPDF Utils enables OCR of image-based PDFs, extraction of arXiv IDs from text or OCR output, and scriptable PDF tasks like merging, splitting, and rendering.
openclaw skills install pdf-utilsUse this skill for local, scriptable PDF processing. It is a stable 1.x skill for OCR, arXiv reference mining, and repeatable PyMuPDF workflows. Prefer the built-in pdf tool for AI-style reading, summarization, question-answering, and semantic analysis of PDF content.
pdf tool for summary, Q&A, extraction by meaning, or general document understanding.scripts/extract_refs.py when the PDF already has extractable text and you need arXiv IDs or batch downloads.scripts/ocr_pdf.py when the PDF is scanned/image-based and text extraction is poor or empty.scripts/pdf_ops.py for repeatable local PDF operations such as merge, split, and rendering a page to an image.Run:
python3 scripts/extract_refs.py paper.pdf
If needed, download the referenced papers:
python3 scripts/extract_refs.py paper.pdf --download --out ~/papers/
Run OCR on all pages:
python3 scripts/ocr_pdf.py paper.pdf --all
To OCR and immediately extract arXiv IDs from the OCR output:
python3 scripts/ocr_pdf.py paper.pdf --all --extract-refs
Install these before using OCR features:
brew install tesseract
brew install tesseract-lang
pip3 install pytesseract Pillow pymupdf --break-system-packages
references/usage.md for CLI examples, programmatic API notes, PDF ops usage, and known limits.scripts/pdf_ops.py first before writing one-off snippets.