Image OCR Reader

v1.0.0

Extract text from images using OCR with support for Chinese and English in common formats like jpg, png, and jpeg.

0· 658·9 current·9 all-time
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description, SKILL.md, _meta.json, and the included Python code all implement local OCR using Tesseract (pytesseract + Pillow). Required binaries (tesseract) and Python packages match the stated purpose.
Instruction Scope
Runtime instructions and code only read the user-provided image file and optionally write a user-specified output file. There are no references to external endpoints, environment secrets, or unrelated system paths in the SKILL.md or code.
Install Mechanism
There is no automatic install spec (instruction-only for system/package installs). The SKILL.md instructs the user to install system tesseract and pip packages manually. This is low-risk but means the user must install external binaries (and Tesseract language data for Chinese) themselves.
Credentials
The skill requests no environment variables, credentials, or config paths. The needed components (tesseract binary, pytesseract, Pillow) are directly related to OCR functionality.
Persistence & Privilege
The skill does not request persistent presence (always:false) and does not alter other skills or global agent configuration. It only writes to a user-specified output file if requested.
Assessment
This skill appears to do only local OCR with Tesseract and Pillow. Before installing/running: 1) Manually install the tesseract binary and the appropriate language data (chi_sim) as the SKILL.md directs; without the Chinese traineddata, Chinese OCR will fail. 2) Install the Python packages (pytesseract, Pillow) in a controlled environment (virtualenv). 3) Note that --output will write to whatever path you provide; avoid running with paths you don't trust. 4) The code has no networking or secret access, so it won't exfiltrate data, but treat untrusted images cautiously (Pillow vulnerabilities are possible in edge cases). 5) If you plan to use the detailed mode, test it — the code returns confidence values as strings which may cause minor formatting issues when printing; this is a quality issue, not malicious.

Like a lobster shell, security has layers — review code before you run it.

latestvk9737626ary3wqhtyzzftar1f1826tgt
658downloads
0stars
1versions
Updated 1mo ago
v1.0.0
MIT-0

Image OCR Reader

从图片中提取文字内容的技能,支持中文和英文识别。

功能

  • 从图片文件中提取文字 (OCR)
  • 支持中文、英文混合识别
  • 使用 Tesseract OCR 引擎
  • 兼容 jpg、png、jpeg 等常见图片格式

依赖

  • Python 3
  • tesseract-ocr
  • pytesseract
  • Pillow

安装

1. 安装系统依赖

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

# CentOS/RHEL
sudo yum install tesseract

2. 安装 Python 依赖

pip install pytesseract Pillow

使用方法

命令行

python3 image_ocr_reader.py --file /path/to/image.jpg

Python API

from image_ocr_reader import extract_text

text = extract_text("/path/to/image.jpg")
print(text)

输出示例

输入一张包含文字的图片,输出提取的文字内容。


积分

  • 版本: 1.0.0
  • 作者: OpenClaw
  • 许可证: MIT

Comments

Loading comments...