Install
openclaw skills install docx-pdf-knowledge-parserParse local `.docx` and `.pdf` files into structured knowledge artifacts with detailed reports, tracking successes, failures, and summaries without auto-writ...
openclaw skills install docx-pdf-knowledge-parserUse this skill to turn local or uploaded .docx and .pdf files into structured, reviewable knowledge outputs.
.docx and .pdf files..docx and .pdf in v1.0.MEMORY.md directly..docx file path.pdf file path.docx and .pdf files in one directory.docx.pdfingest-report.mdkb-items.jsonlfailed-items.jsonlMEMORY.candidate.mdMEMORY.md in v1.0.Treat these as parseable in v1.0:
.docx.pdfRoute here when the file is out of scope or low-confidence in v1.0:
.pptxRoute here when the file cannot be opened, parsed, or extracted successfully.
batch_idstarted_at.docx -> use parsers/parse_docx.py.pdf -> use parsers/parse_pdf.pykb-items.jsonlfailed-items.jsonlingest-report.mdMEMORY.candidate.mdfinished_atMEMORY.mdWrite one JSON object per successfully extracted knowledge item with at least:
batch_idsource_filesource_pathfile_typetopiccontent_typesummaryextracted_atconfidenceWrite one JSON object per failed file with at least:
batch_idsource_filesource_pathfile_typefailure_reasonerror_detailsuggested_actionfailed_atInclude:
batch_id, started_at, finished_at, source_directory or source_file)Include:
MEMORY.candidate.md unless the workflow explicitly allows it.run.py: minimal batch runner for local testingparsers/parse_docx.py: docx text extraction helperparsers/parse_pdf.py: pdf text extraction helperreferences/output_examples.md: sample output shapes and field guidanceREADME.md: setup and usage notes