Install
openclaw skills install universal-pdf-vision-parseExtract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs using multimodal vision (Qwen-VL-Max). This skill converts PDF pages to high-res images and 'sees' the content to produce perfectly structured, high-readability Markdown.
openclaw skills install universal-pdf-vision-parseVersion: 0.1
This skill is a high-end multilingual document digitizer. It uses multimodal vision to 'look' at each PDF page, making it perfect for language learning notes, bilingual documents, and complex layouts that standard OCR fails to capture.
qwen-vl-max access.pip install pymupdf dashscope
python scripts/vision_parse.py --pdf <path_to_pdf> --out <path_to_output.md> --api-key <YOUR_API_KEY> --max-pages 2
--max-pages: (Optional) Max pages to process. Defaults to 2. Set to -1 for all pages.User: "Convert this German-Chinese note to markdown: notes.pdf"
Agent Action:
python scripts/vision_parse.py --pdf notes.pdf --out notes.md