Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

Tesseract OCR文字识别

v1.0.1

基于Tesseract引擎的OCR文字识别技能,支持中文、英文、中英混合三种模式,输出text/structured/question_answer三种格式。

0· 232·0 current·0 all-time
byflyingants@tom859174-sketch

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for tom859174-sketch/tesseract-ocr-skill.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Tesseract OCR文字识别" (tom859174-sketch/tesseract-ocr-skill) from ClawHub.
Skill page: https://clawhub.ai/tom859174-sketch/tesseract-ocr-skill
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install tesseract-ocr-skill

ClawHub CLI

Package manager switcher

npx clawhub@latest install tesseract-ocr-skill
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
!
Purpose & Capability
The README and code clearly require the Tesseract OCR engine to be installed, but the skill metadata lists no required binaries/config paths. SKILL.json describes TCM (traditional Chinese medicine) material focus while SKILL.md is more generic; version in SKILL.json (1.0.0) differs from registry (1.0.1). These inconsistencies indicate sloppy packaging and incomplete metadata.
Instruction Scope
SKILL.md instructs running the included Python CLI on a local image path (no external endpoints). The runtime instructions stay within the stated purpose (OCR). However, the included code has issues: a duplicated extract_text_with_tesseract definition (the second uses os.path.exists but os is not imported) and potential import path problems (relative import in tesseract_ocr_skill.py). These will likely cause runtime errors and make actual behavior unpredictable.
Install Mechanism
There is no install spec (instruction-only + bundled scripts). This is low-risk from an installer perspective — nothing is downloaded at install time. The skill does expect external system dependency (Tesseract engine), but that is not declared in the install metadata.
Credentials
The skill requests no environment variables or credentials, and the code does not access environment secrets or external network endpoints. This is proportionate for a local OCR tool. Note: the missing declaration of the required Tesseract binary reduces clarity but is not a credential concern.
Persistence & Privilege
The skill does not request persistent/always-on presence; default invocation settings are used (agent may invoke autonomously). The skill does not attempt to modify other skill configs or system-wide settings.
What to consider before installing
This skill appears to be a local Tesseract OCR tool and does not contact external servers or ask for credentials, but there are important practical issues to consider before installing: - Install Tesseract on the host first (the SKILL.md mentions it, but the skill's metadata did not declare this dependency). Without the Tesseract binary the skill will fail. - The included Python has bugs: a duplicated function, a missing import (os) and a relative import that may fail when run as a script. Expect runtime errors unless the code is fixed. - Because the package metadata is inconsistent (version mismatch, differing descriptions), treat this as low-quality packaging rather than a polished official skill. - Run and review the code in an isolated/sandboxed environment before enabling autonomous invocation. Test the CLI on non-sensitive images to confirm behavior. If you need a stable OCR skill, prefer one that explicitly declares system dependencies (Tesseract binary) in metadata, has matching versions, and has been tested to run without code errors.

Like a lobster shell, security has layers — review code before you run it.

imagevk977yp2ep2st9v53ymr11c6zpd83b7z3latestvk977yp2ep2st9v53ymr11c6zpd83b7z3ocrvk977yp2ep2st9v53ymr11c6zpd83b7z3openclaw-skillvk977yp2ep2st9v53ymr11c6zpd83b7z3tesseractvk977yp2ep2st9v53ymr11c6zpd83b7z3text-extractionvk977yp2ep2st9v53ymr11c6zpd83b7z3
232downloads
0stars
2versions
Updated 1mo ago
v1.0.1
MIT-0

Tesseract OCR Skill

使用 Tesseract 引擎从图像中提取文字内容。

功能

  • 支持中文(简体)、英文、中英混合三种语言模式
  • 支持三种输出格式:
    • text: 纯文本输出
    • structured: 结构化输出(识别题目、选项、答案)
    • question_answer: 问答对格式

使用方式

python <skill-path>/tesseract_ocr_skill.py <image_path> --lang chi_sim+eng --format text

参数

参数说明默认值
image_path图片文件路径必填
--lang识别语言chi_sim+eng
--format输出格式text
--output输出文件路径控制台输出

依赖

  • Python 3.x
  • pytesseract
  • Pillow
  • Tesseract OCR 引擎(需单独安装)

安装 Tesseract

Windows: 下载安装包 https://github.com/UB-Mannheim/tesseract/wiki

Comments

Loading comments...