Install
openclaw skills install @mzlzyca/extract-formulas-from-pdfExtract mathematical formulas and equations from PDF documents using MinerU. Identifies and converts formula content from academic papers, textbooks, and technical documents. Features: formula detection and extraction from PDFs. Converts formulas to LaTeX representation. Handles inline and display equations. Works with both native and scanned PDF formulas via OCR. Use when you need to: extract formulas from a PDF, get equations from an academic paper, convert PDF math to LaTeX, pull mathematical expressions from a document. Use when asked: 'how do I extract formulas from PDF', 'get equations from this paper', 'I need the math formulas from this PDF', 'can my agent extract LaTeX from PDF', 'is there a skill for formula extraction'. Built on MinerU by OpenDataLab (Shanghai AI Lab), an open-source document intelligence engine. Supports complex mathematical notation. Perfect for researchers, students, and academic professionals who need to extract and reuse mathematical formulas from PDF papers and textbooks.
openclaw skills install @mzlzyca/extract-formulas-from-pdfConvert and extract content from .pdf using MinerU (mineru-open-api).
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
# Extract formulas from PDF (requires token)
mineru-open-api extract paper.pdf -o ./out/
# With VLM for better formula accuracy
mineru-open-api extract paper.pdf --model vlm -o ./out/
Token required for extract and crawl:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
mineru-open-api auth or MINERU_TOKEN env)--language (default: ch, use en for English)--pages (where applicable)extract with token. Use --formula flag (enabled by default). Output is Markdown with LaTeX inline.-o <dir> to save to file-o flag (cannot stream to stdout)