file-processor
Automatically detects and processes files including PDF, Excel, CSV, Word, images, and text for extraction, OCR, data analysis, and summarization.
MIT-0 · Free to use, modify, and redistribute. No attribution required.
⭐ 0 · 598 · 13 current installs · 13 all-time installs
MIT-0
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
Name/description match the actual code and README: processor.py implements PDF, Excel, CSV, DOCX, TXT, and image OCR processing using the listed Python libraries. No unrelated services, credentials, or binaries are requested.
Instruction Scope
SKILL.md instructions are scoped to installing the listed Python libraries and sending files for processing. The runtime code reads only the provided file path and does not access other system files, environment variables, or external endpoints.
Install Mechanism
This is an instruction-only skill (no automated install). SKILL.md recommends pip installs from PyPI (pdfplumber, openpyxl, python-docx, pytesseract, pillow, and SKILL.py uses pandas). Installing packages from PyPI is expected for this functionality; there is no download-from-arbitrary-URL or archive extraction in the manifest.
Credentials
No environment variables, credentials, or config paths are requested. The libraries used are appropriate for the stated tasks. There are no unexpected secret accesses.
Persistence & Privilege
always:false and disable-model-invocation:false (normal). The skill does not request persistent or system-wide changes, nor does it modify other skills' configs.
Assessment
This skill appears to do what it claims: local parsing, OCR, and summaries of common file types, with no network calls or credential use. Before installing or running it: 1) install dependencies in an isolated environment (virtualenv/container) because some packages (e.g., pdf parsers, image libraries) can be exposed by maliciously crafted files; 2) note that OCR requires Tesseract to be installed on the host (SKILL.md hints at this but doesn't automate it); 3) avoid processing highly sensitive files unless you trust the skill source (owner is unknown); 4) if you need higher assurance, review or run the included processor.py in a sandbox to confirm behavior. Overall the package is internally consistent and coherent with its description.Like a lobster shell, security has layers — review code before you run it.
Current versionv1.0.0
Download ziplatest
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
SKILL.md
文件处理技能
自动识别和处理用户发送的文件。
支持格式
- PDF (pdf)
- Excel (xlsx, xls)
- CSV (csv)
- Word (docx)
- 图片 (jpg, png, jpeg)
- 文本 (txt)
功能
1. PDF 读取
提取文字、表格、页码信息
2. Excel/CSV 处理
读取数据、统计、筛选
3. OCR 文字识别
图片转文字
4. 文档摘要
长文本自动摘要
依赖
pip install pdfplumber openpyxl python-docx pytesseract pillow
使用方法
直接发送文件给小乡,根据格式自动处理。
Files
2 totalSelect a file
Select a file to preview.
Comments
Loading comments…
