Install
openclaw skills install redactPrivacy redaction toolkit for images, PDFs, Word documents, and PowerPoint presentations. Use when the user needs to redact, mask, or replace sensitive/private information in files. Triggers: - Redacting or masking sensitive text in images, PDFs, documents, or presentations - Replacing names, phone numbers, IDs, or other PII in files - Processing privacy compliance for documents before sharing - Anonymizing content in visual files Supported formats: png/jpg images, PDF, docx/doc, pptx/ppt
openclaw skills install redactPrivacy redaction toolkit using PPStructureV3 OCR for text detection and replacement.
| Script | Format | Command |
|---|---|---|
read.py | Images / PDF / Word / PowerPoint | read.py <input> [--info] [--mode json] |
redact-image.py | Images (png, jpg, etc.) | redact-image.py <input> <rules.csv> <output> |
redact-pdf.py | redact-pdf.py <input> <rules.csv> <output> | |
redact-document.py | Word (docx, doc) | redact-document.py <input> <rules.csv> <output> |
redact-presentation.py | PowerPoint (pptx, ppt) | redact-presentation.py <input> <rules.csv> <output> |
target_text,replacement_text
张三,李四
手机号,
身份证号,
| Rule | Effect |
|---|---|
原文本,新文本 | Replace with new text |
原文本, | Empty = mask with █ (documents) or solid color block (images/PDF) |
| Format | Empty Replacement |
|---|---|
| Images, PDF | Solid color block overlay |
| Word, PowerPoint | █ characters (same length as target) |
read.py supports:
--info structured output:
<page index="..."><image format='markdown' type='ocr'>...</image> for OCR text extracted from imagesDocument-like files (pdf, docx, doc, pptx) output:
{
"type": "pptx",
"pages": [
{
"page_index": 1,
"content": [
{ "type": "text", "text": "..." },
{ "type": "image", "text": "ocr text..." }
]
}
]
}
Image files output:
{
"type": "image",
"content": "..."
}
| Feature | Image | Document | Presentation | |
|---|---|---|---|---|
| Read text | ✅ | ✅ | ✅ | ✅ |
| JSON output | ✅ | ✅ | ✅ | ✅ |
| Text replacement | ✅ | ✅ | ✅ | ✅ |
| Solid color mask | ✅ | ✅ | - | - |
| █ character mask | - | - | ✅ | ✅ |
| OCR detection | ✅ | ✅ | ✅ (images) | ✅ (images) |
| Tables | - | ✅ | ✅ | ✅ |
| Headers/Footers | - | ✅ | ✅ | - |
| Embedded images | - | ✅ | ✅ | ✅ |
使用 uv 安装依赖:
# 进入 skill 目录
cd skills/redact
# 同步依赖(自动创建虚拟环境并安装)
uv sync