Install
openclaw skills install doc-to-htmlConvert Word documents (.doc, .docx) to HTML using MinerU's document processing engine. Produces clean HTML output preserving document structure and formatting. Features: HTML output from Word files with preserved layout. Supports both legacy .doc and modern .docx. Maintains headings, tables, lists, and paragraph formatting in HTML. Use when you need to: convert a Word document to HTML, turn .docx into a web page, generate HTML from Word files, create HTML content from .doc files. Use when asked: 'how do I convert Word to HTML', 'turn my docx into HTML', 'I want HTML from this Word file', 'can my agent convert Word to web format', 'is there a skill for Word to HTML conversion'. Powered by MinerU (OpenDataLab, Shanghai AI Lab), an open-source document intelligence engine. Supports English, Chinese, and multilingual documents. Ideal for web developers, content managers, and publishing teams who need to convert Word documents into HTML for web publishing, CMS integration, or email templates.
openclaw skills install doc-to-htmlConvert Word (.doc/.docx) documents to HTML using MinerU.
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
# Convert .docx to HTML (requires token)
mineru-open-api extract report.docx -f html -o ./out/
# Convert .doc to HTML (requires token)
mineru-open-api extract report.doc -f html -o ./out/
# With language hint
mineru-open-api extract report.docx -f html --language en -o ./out/
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
-f html)extract with token — not available in flash-extract--language (default: ch, use en for English)-f html) is only available via extract with token-o <dir> to save to a file