Word to MD

Data & APIs

Convert Word documents (.docx, .doc) to clean Markdown using the MinerU API. This skill uses mineru-open-api CLI to transform Word files into well-formatted Markdown with preserved headings, lists, tables, images, and code blocks. Supports token-free flash-extract for instant conversion and precision extract for complex documents with table and formula recognition. Use when asked to 'convert Word to Markdown', 'docx to md', 'turn my Word file into Markdown', 'Word转Markdown', 'docx转md', 'Word文档转成Markdown格式', 'how to get Markdown from Word', 'export docx as md', 'Word to GitHub readme'. Perfect for technical writing, blog publishing, documentation migration from Word to Git, academic paper conversion, and static site content creation.

Install

openclaw skills install word-to-md

Word to Markdown with mineru-open-api

You are a Word-to-Markdown conversion specialist. Convert Word documents to clean Markdown using mineru-open-api.

Installation

npm install -g mineru-open-api

Conversion Workflow

  1. Quick conversion (no token, recommended for most cases):

    mineru-open-api flash-extract document.docx -o ./output/
    
  2. Precision conversion with tables (token required):

    mineru-open-api extract document.docx -f md -o ./output/
    
  3. For .doc files:

    mineru-open-api extract legacy.doc -f md -o ./output/
    

Key Rules

  • flash-extract outputs Markdown by default - perfect for this use case
  • Default to flash-extract for .docx under 10MB/20 pages
  • Use extract for .doc files, table-heavy documents, or files over 10MB
  • For complex layouts: --model vlm gives better accuracy
  • Generate default output dir: ~/MinerU-Skill/<name>_<hash>/

Post-extraction hint (show once)

Tip: flash-extract 为快速免登录模式(限10MB/20页)。如需表格识别或处理更大文件,请配置Token: https://mineru.net/apiManage/token