Image OCR Parse

v1.0.2

Extract text from images via the PDFAPIHub cloud OCR API. Images are uploaded to pdfapihub.com for Tesseract OCR processing. Supports preprocessing (grayscal...

1· 89·1 current·1 all-time
byRishabh Dugar@rishabhdugar

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for rishabhdugar/image-ocr-parse.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Image OCR Parse" (rishabhdugar/image-ocr-parse) from ClawHub.
Skill page: https://clawhub.ai/rishabhdugar/image-ocr-parse
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required env vars: PDFAPIHUB_API_KEY
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install image-ocr-parse

ClawHub CLI

Package manager switcher

npx clawhub@latest install image-ocr-parse
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description and declared requirements match: the skill calls a hosted OCR API (pdfapihub.com) and requires PDFAPIHUB_API_KEY. There are no unrelated env vars or binaries requested.
Instruction Scope
SKILL.md explicitly instructs uploading images (image_url, base64_image, or multipart file) to https://pdfapihub.com/api and sending the API key in the CLIENT-API-KEY header. It does not instruct reading unrelated files, environment variables, or system state. It also contains a privacy warning about uploads.
Install Mechanism
No install spec or code files; instruction-only skill. No downloads or archive extraction are performed by the skill itself.
Credentials
Only a single API key (PDFAPIHUB_API_KEY) is required, which is proportional to contacting the external OCR service. The primaryEnv matches the header name documented in SKILL.md/skill.json.
Persistence & Privilege
always is false and the skill is user-invocable. It does not request persistent or cross-skill configuration changes and does not demand elevated platform privileges.
Assessment
This skill is coherent but remember it sends your images and the extracted text to a third party (pdfapihub.com). Do not upload sensitive images (IDs, passports, medical records, passwords) unless you trust the provider and their retention policy (the SKILL.md claims files are auto-deleted after 30 days). Before installing or using: verify the service domain and privacy/terms pages, use a dedicated API key with limited scope if possible, monitor for unexpected billing, rotate the key if it is ever exposed, and consider local/offline OCR if you need stronger privacy guarantees.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

EnvPDFAPIHUB_API_KEY
Primary envPDFAPIHUB_API_KEY
latestvk975xdhzfmt8zrsca2f5v1h789852sa7
89downloads
1stars
3versions
Updated 1w ago
v1.0.2
MIT-0

Image OCR Parse

What It Does

Extracts text from images via the PDFAPIHub hosted OCR API. Your image is uploaded to PDFAPIHub servers where Tesseract OCR processes it and returns the extracted text. Supports optional image preprocessing to improve OCR quality on low-resolution or noisy inputs.

When to Use

  • Extract text from photos of receipts, signs, or documents
  • OCR business cards, ID cards, or labels
  • Process low-quality images with preprocessing

Required Inputs

Provide one of:

  • image_url — URL to an image
  • base64_image — base64-encoded image
  • Multipart upload with file field

Authentication

This skill calls the PDFAPIHub hosted API at https://pdfapihub.com/api. Your image is uploaded to PDFAPIHub servers for OCR processing.

Send your API key in the CLIENT-API-KEY header.

Get your free API key at https://pdfapihub.com. Full API documentation is available at https://pdfapihub.com/docs.

Privacy note: Images you process are uploaded to PDFAPIHub's cloud service and the extracted text is returned. Do not send sensitive images unless you trust the service. Files are auto-deleted after 30 days.

Use Cases

  • Receipt Scanning — Extract text from receipt photos for expense tracking
  • Business Card Reader — OCR business card images to extract name, phone, email
  • License Plate Recognition — Extract plate numbers from photos (with char_whitelist)
  • Meter Reading — Extract digits from utility meter photos for automated logging
  • Whiteboard Capture — OCR whiteboard or handwritten note photos into text
  • Product Label Scanning — Extract ingredient lists or nutrition info from product label photos

Image Preprocessing Options

ParamDefaultDescription
grayscalefalseConvert to grayscale
sharpenfalseApply sharpening
threshold0Binarization threshold (1–255)
resize0Scale factor (max 4x)

Example Usage

curl -X POST https://pdfapihub.com/api/v1/image/ocr/parse \
  -H "CLIENT-API-KEY: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "image_url": "https://pdfapihub.com/sample-invoicepage.png",
    "lang": "eng",
    "grayscale": true,
    "sharpen": true,
    "detail": "words"
  }'

Comments

Loading comments...