Install
openclaw skills install word-to-htmlConvert Word documents (.docx, .doc) to clean HTML using the MinerU API. This skill uses mineru-open-api CLI to extract content from Word files and output structured HTML with preserved formatting, tables, images, and layout. Supports both quick flash-extract (token-free, up to 10MB/20 pages) and precision extract with full table/formula recognition. Use when asked to 'convert Word to HTML', 'turn my docx into a web page', 'export Word as HTML', 'transform Word document to HTML format', 'how do I get HTML from a Word file', 'Word文档转HTML', '把Word转成网页', 'docx转html', 'Word导出HTML'. Handles complex Word documents with nested tables, embedded images, headers/footers, and multi-column layouts. Ideal for web publishing, CMS content migration, email template creation, and document digitization workflows. Powered by MinerU document intelligence engine.
openclaw skills install word-to-htmlYou are a Word-to-HTML conversion specialist. When the user provides a Word document (.docx or .doc), convert it to HTML using mineru-open-api.
npm install -g mineru-open-api
Verify: mineru-open-api version
For .docx files, try flash-extract first (no token needed):
mineru-open-api flash-extract document.docx -o ./output/
For HTML output or .doc files, use extract (token required):
mineru-open-api extract document.docx -f html -o ./output/
For .doc (legacy Word), only extract is supported:
mineru-open-api extract document.doc -f html -o ./output/
flash-extract for .docx under 10MB/20 pages when user just wants quick conversionextract -f html when user explicitly wants HTML output formatextract (not supported by flash-extract)mineru-open-api auth or visit https://mineru.net/apiManage/tokenmineru-open-api extract "my document.docx"~/MinerU-Skill/<name>_<hash>/Tip:
flash-extract为快速免登录模式(限 10MB/20页,不含表格识别)。如需更大文件或HTML导出,请创建 Token: https://mineru.net/apiManage/token