Mistral OCR

v1.0.4

Extract text, tables, and images from PDFs or images using Mistral OCR API and output in Markdown, JSON, or HTML formats.

5· 1.8k·6 current·9 all-time
byLeyuDame2@yzdame

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for yzdame/mistral-ocr.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Mistral OCR" (yzdame/mistral-ocr) from ClawHub.
Skill page: https://clawhub.ai/yzdame/mistral-ocr
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Canonical install target

openclaw skills install yzdame/mistral-ocr

ClawHub CLI

Package manager switcher

npx clawhub@latest install mistral-ocr
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (Mistral OCR) align with required artifacts: the SKILL.md, README, requirements.txt and script all declare and use MISTRAL_API_KEY and the mistralai client to upload files for OCR. There are no unrelated credentials or binaries requested.
Instruction Scope
SKILL.md explicitly instructs uploading files to Mistral and warns about privacy. The included script only reads the user-specified input file, uses MISTRAL_API_KEY, uploads the file to Mistral, calls the OCR model, and writes output and extracted images locally. The instructions do not request additional system files, secrets, or unexpected external endpoints.
Install Mechanism
There is no automated install spec (instruction-only), and dependencies are limited to a single Python package (requirements.txt: mistralai). This is proportionate for a Python CLI tool that talks to Mistral. Installing from PyPI (mistralai) is the expected setup; no opaque downloads or extract-from-URL steps are present.
Credentials
The skill requires a single environment variable (MISTRAL_API_KEY) which is necessary and sufficient for the declared cloud OCR functionality. The code does not read additional environment variables or other credentials.
Persistence & Privilege
The skill is not always-enabled, does not request persistent platform privileges, and does not modify other skills or system-wide agent settings. It runs as a normal user-mode CLI script.
Assessment
This skill will upload any file you give it to Mistral's servers for OCR. Only install/use it for non-sensitive documents unless you accept that remote processing will occur and you've reviewed Mistral's privacy policy. Installing requires the mistralai Python package (pip install -r requirements.txt) and setting MISTRAL_API_KEY in your environment; limit the key's scope where possible and rotate/revoke it if you stop using the skill. Minor note: the package metadata in _meta.json shows a different ownerId than the registry header—not necessarily malicious but worth verifying the repository/source if provenance matters. If you need offline processing for confidential documents, use a local OCR tool instead.

Like a lobster shell, security has layers — review code before you run it.

documentvk973wch8r8z583gbe559t3q3es8163vglatestvk97fxnfvzv88zwfesmxyg8qvgx817tfxocrvk973wch8r8z583gbe559t3q3es8163vgpdfvk973wch8r8z583gbe559t3q3es8163vg
1.8kdownloads
5stars
5versions
Updated 1mo ago
v1.0.4
MIT-0

⚠️ Privacy Warning - 隐私警告

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your files to Mistral's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless:

  • You trust Mistral's data handling policies
  • You have reviewed Mistral's privacy policy
  • You accept that file contents will be transmitted and processed remotely

For sensitive documents, use offline/local OCR tools instead.


Mistral OCR Skill

A powerful OCR tool that converts PDF files and images into Markdown, JSON, or HTML formats using Mistral's state-of-the-art OCR API.

Installation

# Clone or download this repository
git clone https://github.com/YZDame/Mistral-OCR-SKILL.git
cd Mistral-OCR-SKILL

# Install dependencies
pip install -r requirements.txt

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.mistral.ai/home

Set the environment variable:

export MISTRAL_API_KEY=your_api_key

CLI Usage

cd scripts

# Process PDF to Markdown
python3 mistral_ocr.py -i input.pdf

# Process PDF to JSON
python3 mistral_ocr.py -i input.pdf -f json

# Specify output directory
python3 mistral_ocr.py -i input.pdf -o ~/my_ocr_results

Arguments

FlagDescription
-i, --inputInput file path (required)
-f, --formatOutput format: markdown/json/html (default: markdown)
-o, --outputOutput directory

Data Privacy

What happens to your files:

  1. Files are uploaded to Mistral's OCR API
  2. Files are processed on Mistral servers
  3. Processing results are returned to you
  4. Files are not stored on Mistral servers (per Mistral policy)

For more details, see: https://mistral.ai/privacy-policy

License

MIT

Comments

Loading comments...