Install
openclaw skills install mineru-skillParse PDFs, Word docs, PPTs, and images into clean Markdown using MinerU's VLM engine. Use when: (1) Converting PDF/Word/PPT/image to Markdown, (2) Extracting text/tables/formulas from documents, (3) Batch processing multiple files, (4) Saving parsed content to Obsidian or knowledge bases. Supports LaTeX formulas, tables, images, multilingual OCR, and async parallel processing.
openclaw skills install mineru-skillConvert PDF, Word, PPT, and images to clean Markdown using MinerU's VLM engine — LaTeX formulas, tables, and images all preserved.
export MINERU_TOKEN="your-token-here"
Limits: 2000 pages/day · 200 MB per file · 600 pages per file
| Type | Formats |
|---|---|
.pdf — papers, textbooks, scanned docs | |
| 📝 Word | .docx — reports, manuscripts |
| 📊 PPT | .pptx — slides, presentations |
| 🖼️ Image | .jpg, .jpeg, .png — OCR extraction |
python3 scripts/mineru_v2.py --file ./document.pdf --output ./output/
python3 scripts/mineru_v2.py \
--dir ./docs/ \
--output ./output/ \
--workers 10 \
--resume
python3 scripts/mineru_v2.py \
--dir ./pdfs/ \
--output "~/Library/Mobile Documents/com~apple~CloudDocs/Obsidian/VaultName/" \
--resume
python3 scripts/mineru_v2.py --dir ./papers/ --output ./output/ --language ch
python3 scripts/mineru_v2.py --file ./paper.pdf --output ./output/ --model vlm
--dir PATH Input directory (PDF/Word/PPT/images)
--file PATH Single file
--output PATH Output directory (default: ./output/)
--workers N Concurrent workers (default: 5, max: 15)
--resume Skip already processed files
--model MODEL Model version: pipeline | vlm | MinerU-HTML (default: vlm)
--language LANG Document language: auto | en | ch (default: auto)
--no-formula Disable formula recognition
--no-table Disable table extraction
--token TOKEN API token (overrides MINERU_TOKEN env var)
| Model | Speed | Accuracy | Best For |
|---|---|---|---|
pipeline | ⚡ Fast | High | Standard docs, most use cases |
vlm | 🐢 Slow | Highest | Complex layouts, multi-column, mixed text+figures |
MinerU-HTML | ⚡ Fast | High | Web-style output, HTML-ready content |
| Script | Use When |
|---|---|
mineru_v2.py | Default — async parallel (up to 15 workers) |
mineru_async.py | Fast network, need maximum throughput |
mineru_stable.py | Unstable network — sequential, max retry |
output/
├── document-name/
│ ├── document-name.md # Main Markdown
│ ├── images/ # Extracted images
│ └── content.json # Metadata
| Workers | Speed |
|---|---|
| 1 (sequential) | 1.2 files/min |
| 5 | 3.1 files/min |
| 15 | 5.6 files/min |
--resume to continue interrupted batchesFor detailed API documentation, see references/api_reference.md.