Install
openclaw skills install docx-to-htmlUse this skill whenever the user has a DOCX file (.docx) and wants to convert, read, view, extract content from, or process it in any way — including summari...
openclaw skills install docx-to-htmlThis skill provides a straightforward method to convert Microsoft Word (.docx) documents into clean, semantic HTML, making them suitable for various web-based and AI-driven applications.
mammoth installed (core conversion engine)To install Node.js dependencies, run once from the scripts/ directory:
npm install
Locate DOCX File: Identify the path to the .docx file to convert.
Run Conversion Script: Execute the Python wrapper from the skill's scripts/ directory:
python3 <skill-dir>/scripts/convert.py <input_path.docx> <output_path.html>
Replace <skill-dir> with the actual path where this skill is installed.
Verify Output: Open the generated .html file in a browser and check:
<h1>, <h2>, etc.) appear at the correct hierarchy levelsProcess HTML: Use the resulting HTML for further tasks like summarization, indexing, or display.
scripts/docx-converter.js: Core Node.js conversion logic using mammoth.js.scripts/convert.py: Python wrapper for invoking the Node.js converter.scripts/package.json: Node.js dependency manifest (includes mammoth).The conversion leverages mammoth.js, which prioritizes semantic meaning over visual replication:
<h1>/<h2>, lists become <ul>/<ol>, etc.| Problem | Likely Cause | Fix |
|---|---|---|
node: command not found | Node.js not installed | Install Node.js (v16+) |
Cannot find module 'mammoth' | npm deps missing | Run npm install in scripts/ |
| Empty or garbled output | Corrupted or password-protected DOCX | Try re-saving the file from Microsoft Word |
| Missing images | Large embedded images | Check mammoth.js image size limits in docx-converter.js |