Install
openclaw skills install paddle-ocr-vlGPU-accelerated document parsing and OCR via PaddleOCR-VL. Detects layout, recognizes Chinese/English text, tables, charts, and seals in images. Use when the user asks to OCR an image, extract text from a document, parse a screenshot, or recognize text in photos. Supports vertical classical Chinese text, modern newspaper layouts, and mixed-content documents.
openclaw skills install paddle-ocr-vlGPU-accelerated document OCR using the PaddleOCR-VL model running inside an ephemeral Docker container. Auto-detects NVIDIA GPU architecture (Blackwell SM120 vs. standard) and selects the correct official image.
This skill includes an MCP server (server.py) that exposes three tools:
| Tool | Purpose |
|---|---|
run_ocr | OCR any image — provide an absolute path |
check_environment | Verify Docker, GPU drivers, and image are ready |
run_demo | Run OCR on bundled demo images to test the setup |
Add to ~/.config/Claude/claude_desktop_config.json (Claude Desktop)
or ~/.claude/settings.json (Claude Code):
{
"mcpServers": {
"paddle-ocr-vl": {
"command": "python3",
"args": ["<INSTALL_DIR>/server.py"]
}
}
}
# Blackwell GPU (RTX 50xx, B100/B200 — compute capability >= 12.0):
docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlepaddle/paddleocr-vl:latest-nvidia-gpu-sm120
# Other NVIDIA GPU:
docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlepaddle/paddleocr-vl:latest-nvidia-gpu
Call check_environment to verify everything is set up, then run_demo to
test on the bundled sample images.
| File | Content |
|---|---|
demo/newspaper.png | People's Daily article about China-Eritrea relations |
demo/classical_text.png | Records of the Three Kingdoms, vertical classical Chinese |
nvidia-container-toolkitmcp SDK (pip install mcp)--rm flag)--network host (needed for GPU)None. All processing is local.