Pic2md

OCR

Image to Markdown - extract text from images (PNG, JPG, WebP) to Markdown with OCR. Use when reading text from screenshots, photos, scanned pages, or any image file.

Install

openclaw skills install @tanis90/pic2md

Pic2MD - Picture to Markdown Parser

Extract text from images to Markdown using MinerU Open API. No API key required.

Quick Start

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract screenshot.png

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract https://example.com/image.png

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract photo.jpg -o ./output/

# Pic2MD - Picture to Markdown Parser
mineru-open-api flash-extract scan.jpg --language en

Language Rule

You MUST reply to the user in the SAME language they use. This is non-negotiable.

Capabilities

OCR text extraction from PNG, JPG, JPEG, WebP, BMP, TIFF
Supports both local files and URLs directly
Language hint with --language (default: ch, use en for English)
No API key, no signup, no authentication
Max 10MB per image

When to Use

User asks to "read", "extract", or "OCR" an image
User shares a screenshot and asks what it says
User wants text from a photo of a document or whiteboard
User needs image content converted to Markdown

CLI Reference

Run mineru-open-api flash-extract --help for all available options.

Data Privacy

flash-extract uploads the image to MinerU's cloud API for processing and returns the result. No account or API key is required.
Images are processed in real-time and are not stored after extraction.
For details, see https://mineru.net

Notes

Output is Markdown text extracted via OCR
For higher precision or batch processing, use mineru-open-api extract (requires auth via mineru-open-api auth)
If the CLI cannot be installed via npm/uv/go, download it from https://mineru.net/ecosystem?tab=cli