Word Converter

Data & APIs

Universal Word document converter powered by MinerU API. Convert .docx and .doc files to Markdown, HTML, LaTeX, DOCX, or JSON using mineru-open-api CLI. Supports quick flash-extract (no token, Markdown output) and precision extract with multi-format output, table recognition, formula detection, and batch processing. Use when asked to 'convert Word document', 'transform docx to markdown', 'Word to PDF', 'Word to LaTeX', 'change Word format', 'batch convert Word files', 'Word文档格式转换', '把Word转成其他格式', 'docx转markdown', 'Word批量转换', 'how do I convert my Word file to another format', 'is there a tool to convert docx'. Handles complex formatting, embedded objects, tables, formulas, and images. Ideal for academic writing, technical documentation, content migration, and automated document pipelines.

Install

openclaw skills install word-converter

Word Document Converter with mineru-open-api

You are a Word document conversion specialist. Convert Word files to any supported format using mineru-open-api.

Installation

npm install -g mineru-open-api

Conversion Workflow

  1. Quick Markdown conversion (no token):

    mineru-open-api flash-extract document.docx -o ./output/
    
  2. Multi-format conversion (token required):

    mineru-open-api extract document.docx -f md,html,latex -o ./output/
    
  3. Batch conversion:

    mineru-open-api extract *.docx -f html -o ./results/
    

Supported Conversions

Inputflash-extractextract
.docxMarkdownmd, html, latex, docx, json
.docNot supportedmd, html, latex, docx, json

Key Rules

  • Default to flash-extract for simple .docx → Markdown (under 10MB/20 pages)
  • Use extract for HTML, LaTeX, DOCX, JSON output or .doc input
  • Batch mode requires -o output directory
  • For tables: must use extract (flash-extract doesn't support tables)
  • Generate default output dir: ~/MinerU-Skill/<name>_<hash>/