Install
openclaw skills install webpage-exportExport webpages into clean local TXT, DOCX, and PDF files with source metadata, fallback extraction logic, and browser-assisted recovery for difficult pages. Useful for archiving articles, policy pages, WeChat posts, official notices, and other webpages before downstream analysis or sharing.
openclaw skills install webpage-exportUse this skill to turn a webpage URL into local files that downstream agents can archive, send, or reference.
scripts/export_webpage.py <url> to create a TXT snapshot first.--docx when the user wants a Word document.--pdf when Chrome/Chromium is available and the user wants a PDF.--outdir when the user provides one; otherwise let the script use its local default export folder under the current working directory.python3 scripts/export_webpage.py "<url>"
python3 scripts/export_webpage.py "<url>" --docx
python3 scripts/export_webpage.py "<url>" --pdf
python3 scripts/export_webpage.py "<url>" --docx --pdf --outdir ./exports/temp
python3.curl for baseline webpage fetching.node and the playwright package.textutil.--outdir values for production or shared environments.curltextutil on macOSWhen the user wants PDF, prefer Chrome/Chromium headless printing because it preserves Chinese text and webpage layout better than ad-hoc PDF generation.
Read references/chrome-pdf-guide.md when:
Read references/accuracy-and-fallbacks.md when:
Read references/delivery-rules.md when:
textutil.Accuracy is the top standard. Keep original title, original URL, and extracted source metadata. If any field is uncertain, mark it as missing instead of guessing.