Office Docs

v1.0.0

Process and manage Microsoft Word (.docx) and WPS documents for creation, editing, format conversion, text extraction, analysis, troubleshooting, and batch o...

⭐ 0· 263·3 current·3 all-time

by@baiyunrei2025

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for baiyunrei2025/office-docs.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Office Docs" (baiyunrei2025/office-docs) from ClawHub.
Skill page: https://clawhub.ai/baiyunrei2025/office-docs
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install office-docs

ClawHub CLI

Package manager switcher

npx clawhub@latest install office-docs

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Suspicious

medium confidence

ℹ

Purpose & Capability

The name and description (docx/WPS processing: create, edit, convert, extract) align with the included instructions (python-docx examples, pandoc/libreoffice/unoconv, exiftool). However some referenced libraries appear incorrect or mismatched for the stated purpose (e.g., 'pywps' is likely not a WPS Office document manipulation library), and the skill references several files (CONVERSION.md, ANALYSIS.md, TROUBLESHOOTING.md, CREATION.md, EDITING.md) that are not present in the package. These discrepancies suggest sloppy packaging or incorrect dependencies.

ℹ

Instruction Scope

SKILL.md contains explicit code and shell commands that stay within the scope of document processing (reading/writing .docx, batch conversion, metadata extraction). No external network endpoints or secret access are requested. Concerns: the included references/TEXT_EXTRACTION.md is truncated and contains an apparent bug/incomplete code snippet, and SKILL.md points to multiple missing reference docs — this may lead to unpredictable behavior if an agent follows incomplete instructions.

✓

Install Mechanism

This is an instruction-only skill with no install spec or bundled executables, so nothing will be written to disk by an installer. The skill recommends using third-party tools (python-docx, pandoc, libreoffice, exiftool) but does not install them itself — appropriate for an instruction-only skill.

✓

Credentials

No environment variables, credentials, or config paths are requested. The listed operations only require local tools and libraries, which is proportionate to the claimed functionality.

✓

Persistence & Privilege

The skill is not always-enabled and does not request persistent privileges or self-modifying configuration. Autonomous invocation is allowed by platform default but is not combined with other high-risk factors here.

What to consider before installing

This skill appears to be a straightforward document-processing guide, but the package is incomplete and has some likely mistakes. Before installing or using it: 1) Do not run batch conversion commands on sensitive system directories — test on a safe folder first. 2) Verify the required tools (python-docx, pandoc, libreoffice, exiftool) are the intended packages and that you install them from official sources. 3) Ask the author or maintainer for the missing reference files (CONVERSION.md, TROUBLESHOOTING.md, etc.) and a corrected TEXT_EXTRACTION.md (the included file is truncated and contains an incomplete code fragment). 4) Confirm whether the 'pywps' mention is correct for WPS Office support; if not, don't rely on the skill for WPS files. Because of these inconsistencies, treat the skill as unreliable until the documentation and dependencies are clarified.

Like a lobster shell, security has layers — review code before you run it.

latestvk97acadg5xyfjhm58jp06t78c1833yhq

263downloads

0stars

1versions

Updated 23h ago

v1.0.0

MIT-0

Office Documents Skill

This skill provides comprehensive tools and workflows for working with Microsoft Word (.docx) and WPS Office documents. It covers creation, editing, conversion, analysis, and troubleshooting of professional documents.

Quick Start

Basic Operations

Read document content:

# Use python-docx for .docx files
from docx import Document
doc = Document('document.docx')
text = '\n'.join([paragraph.text for paragraph in doc.paragraphs])

Create new document:

from docx import Document
from docx.shared import Inches

doc = Document()
doc.add_heading('Document Title', 0)
doc.add_paragraph('This is a new paragraph.')
doc.save('new_document.docx')

Common Tasks

Text extraction - See TEXT_EXTRACTION.md
Format conversion - See CONVERSION.md
Document analysis - See ANALYSIS.md
Troubleshooting - See TROUBLESHOOTING.md

Core Tools and Libraries

Python Libraries

For .docx files:

python-docx - Primary library for reading/writing .docx
docx2txt - Simple text extraction
docxcompose - Advanced document composition
docx-mailmerge - Mail merge functionality

For WPS files:

pywps - WPS file manipulation (when available)
Conversion to .docx first recommended

For format conversion:

pandoc - Universal document converter
libreoffice - Office suite for conversion
unoconv - Universal office converter

Command Line Tools

Document conversion:

# Convert .docx to PDF
libreoffice --headless --convert-to pdf document.docx

# Convert .docx to text
pandoc document.docx -o document.txt

# Batch convert WPS to .docx
for file in *.wps; do libreoffice --headless --convert-to docx "$file"; done

Document analysis:

# Extract metadata
exiftool document.docx

# Check file integrity
file document.docx

Workflows

1. Document Creation Workflow

When creating new documents:

Choose template - Start from template or create from scratch
Add structure - Headings, paragraphs, lists
Apply formatting - Styles, fonts, spacing
Add elements - Tables, images, hyperlinks
Finalize - Page setup, headers/footers, save

See CREATION.md for detailed patterns.

2. Document Editing Workflow

When modifying existing documents:

Backup original - Always create backup first
Analyze structure - Understand document layout
Make changes - Edit content, update formatting
Preserve formatting - Maintain original styles
Validate - Check for corruption, save new version

See EDITING.md for detailed patterns.

3. Conversion Workflow

When converting between formats:

Identify source format - .docx, .wps, .doc, .rtf, etc.
Choose conversion tool - Based on format and requirements
Convert - With appropriate options
Verify - Check content preservation
Clean up - Remove temporary files

See CONVERSION.md for detailed patterns.

Common Issues and Solutions

1. Corrupted Documents

Symptoms: Won't open, error messages, missing content

Solutions:

Try opening in different application
Use recovery mode in Word/WPS
Extract content with python-docx ignoring errors
Convert to different format and back

See TROUBLESHOOTING.md for detailed recovery procedures.

2. Formatting Issues

Symptoms: Wrong fonts, broken layout, missing styles

Solutions:

Check style definitions
Verify font availability
Use template-based approach
Simplify complex formatting

3. Compatibility Problems

Symptoms: Different appearance in Word vs WPS, missing features

Solutions:

Stick to common features
Test in both applications
Use standard formats
Provide alternative versions

Advanced Features

Document Automation

Batch processing:

import os
from docx import Document

def process_documents(folder_path):
    for filename in os.listdir(folder_path):
        if filename.endswith('.docx'):
            doc_path = os.path.join(folder_path, filename)
            process_single_document(doc_path)

Template-based generation:

from docx import Document

def generate_from_template(template_path, data):
    doc = Document(template_path)
    # Replace placeholders with data
    for paragraph in doc.paragraphs:
        for key, value in data.items():
            if f'{{{{ {key} }}}}' in paragraph.text:
                paragraph.text = paragraph.text.replace(f'{{{{ {key} }}}}', value)
    return doc

Document Analysis

Extract statistics:

def analyze_document(doc_path):
    doc = Document(doc_path)
    stats = {
        'paragraphs': len(doc.paragraphs),
        'tables': len(doc.tables),
        'images': len(doc.inline_shapes),
        'sections': len(doc.sections),
        'styles': len(doc.styles)
    }
    return stats

Check formatting consistency:

def check_formatting(doc):
    issues = []
    for i, para in enumerate(doc.paragraphs):
        if para.style.name == 'Normal' and para.text.strip():
            # Check for inconsistent formatting
            if len(para.runs) > 1:
                issues.append(f"Paragraph {i}: Multiple runs in Normal style")
    return issues

Best Practices

1. Always Backup

import shutil
import os

def backup_document(filepath):
    backup_path = filepath + '.backup'
    shutil.copy2(filepath, backup_path)
    return backup_path

2. Use Version Control

Save incremental versions
Use descriptive filenames
Document changes made

3. Test Thoroughly

Test in target application
Verify all content preserved
Check formatting integrity

4. Handle Errors Gracefully

try:
    doc = Document(filepath)
except Exception as e:
    print(f"Error opening {filepath}: {e}")
    # Try alternative methods
    return extract_text_fallback(filepath)

Reference Files

For detailed information on specific topics, consult these reference files:

TEXT_EXTRACTION.md - Text extraction methods and patterns
CONVERSION.md - Format conversion guides
ANALYSIS.md - Document analysis techniques
TROUBLESHOOTING.md - Common issues and solutions
CREATION.md - Document creation patterns
EDITING.md - Document editing workflows
AUTOMATION.md - Automation scripts and templates

Scripts

Available scripts in the scripts/ directory:

extract_text.py - Extract text from .docx files
convert_format.py - Convert between document formats
batch_process.py - Process multiple documents
document_stats.py - Generate document statistics
repair_document.py - Attempt to repair corrupted documents

Run scripts with appropriate parameters:

python scripts/extract_text.py input.docx output.txt

Getting Help

If you encounter issues not covered in this skill:

Check the relevant reference file
Search for specific error messages
Try alternative approaches
Consider converting to simpler format

Remember: When in doubt, create a backup and work on a copy.

Comments

Loading comments...