Install
openclaw skills install word-comments-extractorExtract comments from Word documents and format them into standardized review opinions. Auto-matches page numbers, agent-powered semantic polishing. Designed...
openclaw skills install word-comments-extractorExtract comments from Word (.docx) documents and format them into standardized review opinions. The script handles data extraction (comment text, anchor text, page numbers) and outputs structured JSON. The agent then performs semantic polishing to produce professional, publication-ready review opinions.
Core capabilities:
extract_comments.py): Handles all data extraction locally. Unpacks the .docx file (using Python's built-in zipfile), parses XML to extract comments and anchor text, retrieves page numbers via Word COM. Outputs JSON. No external dependencies beyond pywin32.python extract_comments.py <docx_file_path>
The script takes a single argument — the path to the .docx file. It handles unpacking internally.
Output is a JSON array, each element containing:
index: Comment sequence numberpage: Page numbercomment_text: Original comment textanchor_text: The document text that the comment is attached toAfter receiving the JSON data, the agent processes each comment:
Extract a concise, precise content description from the anchor text for the "regarding XX" part of the output.
Requirements:
Examples:
Core principle: Understand intent, rewrite professionally, never mechanically concatenate.
Rules:
1. Understand the commenter's true intent
2. Combine with anchor text context
3. Neither expand nor reduce
4. Professional language standards
5. Prohibited error patterns
| Original comment | Anchor text context | Correct polishing |
|---|---|---|
| The reason for the price decline wasn't mentioned | Sensor gross margin decline, unit price decline | Please supplement the specific reasons for the average unit price decline |
| The wording here isn't very clear, it's actually more about product mix or specific products, specific customers having a bigger impact | Oxygen sensor revenue fluctuation | Please clarify the core factors driving the fluctuation: product mix, specific product characteristics, and specific customer dynamics |
| Add numbering to subheadings, same below | Oxygen sensor downstream domestic substitution | Please add numbering; apply the same numbering format to all subsequent subheadings |
| The performance improvement compared to externally sourced chip modules needs to be mentioned here | MEMS pressure sensor cost | Please supplement the specific performance improvements of the self-developed chip module compared to externally sourced modules |
| After reading, the comparison doesn't convey much information. Are there more in-depth capacity parameter comparisons? | Capacity parameter comparison table | The current comparison lacks depth. Please supplement with a more detailed cross-comparison of core capacity parameters |
| Typo? | Text contains character error | Please verify and correct the typo at this location |
| Be more precise, make it clear this is projected | Gross margin related statement | Please ensure precise wording, explicitly stating the "projected" nature to avoid ambiguity |
Each comment formatted as:
[number]. Page [X]: Regarding [description], [polished suggestion]
Overall structure:
[comment 1]
[comment 2]
...
Total: XX review opinions
================================================================================
[Page number note]
Page numbers correspond to physical pages in the document and may differ from
displayed page numbers (e.g., if the document has cover pages or table of contents
that are not numbered). If adjustment is needed, provide the offset between
physical and displayed page numbers for batch correction.
================================================================================
Output requirement: Only output the polished comment list + page number note. No additional summaries, category descriptions, or polishing explanations.
pip install pywin32