Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

Office Docs

v1.0.0

Process and manage Microsoft Word (.docx) and WPS documents for creation, editing, format conversion, text extraction, analysis, troubleshooting, and batch o...

0· 263·3 current·3 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for baiyunrei2025/office-docs.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Office Docs" (baiyunrei2025/office-docs) from ClawHub.
Skill page: https://clawhub.ai/baiyunrei2025/office-docs
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install office-docs

ClawHub CLI

Package manager switcher

npx clawhub@latest install office-docs
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
Purpose & Capability
The name and description (docx/WPS processing: create, edit, convert, extract) align with the included instructions (python-docx examples, pandoc/libreoffice/unoconv, exiftool). However some referenced libraries appear incorrect or mismatched for the stated purpose (e.g., 'pywps' is likely not a WPS Office document manipulation library), and the skill references several files (CONVERSION.md, ANALYSIS.md, TROUBLESHOOTING.md, CREATION.md, EDITING.md) that are not present in the package. These discrepancies suggest sloppy packaging or incorrect dependencies.
Instruction Scope
SKILL.md contains explicit code and shell commands that stay within the scope of document processing (reading/writing .docx, batch conversion, metadata extraction). No external network endpoints or secret access are requested. Concerns: the included references/TEXT_EXTRACTION.md is truncated and contains an apparent bug/incomplete code snippet, and SKILL.md points to multiple missing reference docs — this may lead to unpredictable behavior if an agent follows incomplete instructions.
Install Mechanism
This is an instruction-only skill with no install spec or bundled executables, so nothing will be written to disk by an installer. The skill recommends using third-party tools (python-docx, pandoc, libreoffice, exiftool) but does not install them itself — appropriate for an instruction-only skill.
Credentials
No environment variables, credentials, or config paths are requested. The listed operations only require local tools and libraries, which is proportionate to the claimed functionality.
Persistence & Privilege
The skill is not always-enabled and does not request persistent privileges or self-modifying configuration. Autonomous invocation is allowed by platform default but is not combined with other high-risk factors here.
What to consider before installing
This skill appears to be a straightforward document-processing guide, but the package is incomplete and has some likely mistakes. Before installing or using it: 1) Do not run batch conversion commands on sensitive system directories — test on a safe folder first. 2) Verify the required tools (python-docx, pandoc, libreoffice, exiftool) are the intended packages and that you install them from official sources. 3) Ask the author or maintainer for the missing reference files (CONVERSION.md, TROUBLESHOOTING.md, etc.) and a corrected TEXT_EXTRACTION.md (the included file is truncated and contains an incomplete code fragment). 4) Confirm whether the 'pywps' mention is correct for WPS Office support; if not, don't rely on the skill for WPS files. Because of these inconsistencies, treat the skill as unreliable until the documentation and dependencies are clarified.

Like a lobster shell, security has layers — review code before you run it.

latestvk97acadg5xyfjhm58jp06t78c1833yhq
263downloads
0stars
1versions
Updated 23h ago
v1.0.0
MIT-0

Office Documents Skill

This skill provides comprehensive tools and workflows for working with Microsoft Word (.docx) and WPS Office documents. It covers creation, editing, conversion, analysis, and troubleshooting of professional documents.

Quick Start

Basic Operations

Read document content:

# Use python-docx for .docx files
from docx import Document
doc = Document('document.docx')
text = '\n'.join([paragraph.text for paragraph in doc.paragraphs])

Create new document:

from docx import Document
from docx.shared import Inches

doc = Document()
doc.add_heading('Document Title', 0)
doc.add_paragraph('This is a new paragraph.')
doc.save('new_document.docx')

Common Tasks

  1. Text extraction - See TEXT_EXTRACTION.md
  2. Format conversion - See CONVERSION.md
  3. Document analysis - See ANALYSIS.md
  4. Troubleshooting - See TROUBLESHOOTING.md

Core Tools and Libraries

Python Libraries

For .docx files:

  • python-docx - Primary library for reading/writing .docx
  • docx2txt - Simple text extraction
  • docxcompose - Advanced document composition
  • docx-mailmerge - Mail merge functionality

For WPS files:

  • pywps - WPS file manipulation (when available)
  • Conversion to .docx first recommended

For format conversion:

  • pandoc - Universal document converter
  • libreoffice - Office suite for conversion
  • unoconv - Universal office converter

Command Line Tools

Document conversion:

# Convert .docx to PDF
libreoffice --headless --convert-to pdf document.docx

# Convert .docx to text
pandoc document.docx -o document.txt

# Batch convert WPS to .docx
for file in *.wps; do libreoffice --headless --convert-to docx "$file"; done

Document analysis:

# Extract metadata
exiftool document.docx

# Check file integrity
file document.docx

Workflows

1. Document Creation Workflow

When creating new documents:

  1. Choose template - Start from template or create from scratch
  2. Add structure - Headings, paragraphs, lists
  3. Apply formatting - Styles, fonts, spacing
  4. Add elements - Tables, images, hyperlinks
  5. Finalize - Page setup, headers/footers, save

See CREATION.md for detailed patterns.

2. Document Editing Workflow

When modifying existing documents:

  1. Backup original - Always create backup first
  2. Analyze structure - Understand document layout
  3. Make changes - Edit content, update formatting
  4. Preserve formatting - Maintain original styles
  5. Validate - Check for corruption, save new version

See EDITING.md for detailed patterns.

3. Conversion Workflow

When converting between formats:

  1. Identify source format - .docx, .wps, .doc, .rtf, etc.
  2. Choose conversion tool - Based on format and requirements
  3. Convert - With appropriate options
  4. Verify - Check content preservation
  5. Clean up - Remove temporary files

See CONVERSION.md for detailed patterns.

Common Issues and Solutions

1. Corrupted Documents

Symptoms: Won't open, error messages, missing content

Solutions:

  • Try opening in different application
  • Use recovery mode in Word/WPS
  • Extract content with python-docx ignoring errors
  • Convert to different format and back

See TROUBLESHOOTING.md for detailed recovery procedures.

2. Formatting Issues

Symptoms: Wrong fonts, broken layout, missing styles

Solutions:

  • Check style definitions
  • Verify font availability
  • Use template-based approach
  • Simplify complex formatting

3. Compatibility Problems

Symptoms: Different appearance in Word vs WPS, missing features

Solutions:

  • Stick to common features
  • Test in both applications
  • Use standard formats
  • Provide alternative versions

Advanced Features

Document Automation

Batch processing:

import os
from docx import Document

def process_documents(folder_path):
    for filename in os.listdir(folder_path):
        if filename.endswith('.docx'):
            doc_path = os.path.join(folder_path, filename)
            process_single_document(doc_path)

Template-based generation:

from docx import Document

def generate_from_template(template_path, data):
    doc = Document(template_path)
    # Replace placeholders with data
    for paragraph in doc.paragraphs:
        for key, value in data.items():
            if f'{{{{ {key} }}}}' in paragraph.text:
                paragraph.text = paragraph.text.replace(f'{{{{ {key} }}}}', value)
    return doc

Document Analysis

Extract statistics:

def analyze_document(doc_path):
    doc = Document(doc_path)
    stats = {
        'paragraphs': len(doc.paragraphs),
        'tables': len(doc.tables),
        'images': len(doc.inline_shapes),
        'sections': len(doc.sections),
        'styles': len(doc.styles)
    }
    return stats

Check formatting consistency:

def check_formatting(doc):
    issues = []
    for i, para in enumerate(doc.paragraphs):
        if para.style.name == 'Normal' and para.text.strip():
            # Check for inconsistent formatting
            if len(para.runs) > 1:
                issues.append(f"Paragraph {i}: Multiple runs in Normal style")
    return issues

Best Practices

1. Always Backup

import shutil
import os

def backup_document(filepath):
    backup_path = filepath + '.backup'
    shutil.copy2(filepath, backup_path)
    return backup_path

2. Use Version Control

  • Save incremental versions
  • Use descriptive filenames
  • Document changes made

3. Test Thoroughly

  • Test in target application
  • Verify all content preserved
  • Check formatting integrity

4. Handle Errors Gracefully

try:
    doc = Document(filepath)
except Exception as e:
    print(f"Error opening {filepath}: {e}")
    # Try alternative methods
    return extract_text_fallback(filepath)

Reference Files

For detailed information on specific topics, consult these reference files:

Scripts

Available scripts in the scripts/ directory:

  • extract_text.py - Extract text from .docx files
  • convert_format.py - Convert between document formats
  • batch_process.py - Process multiple documents
  • document_stats.py - Generate document statistics
  • repair_document.py - Attempt to repair corrupted documents

Run scripts with appropriate parameters:

python scripts/extract_text.py input.docx output.txt

Getting Help

If you encounter issues not covered in this skill:

  1. Check the relevant reference file
  2. Search for specific error messages
  3. Try alternative approaches
  4. Consider converting to simpler format

Remember: When in doubt, create a backup and work on a copy.

Comments

Loading comments...