Vlm Image Helper

v0.1.0

Visual inspection helper for VLM and OCR workflows. Use when agent needs to help a vision model see an image more clearly before re-analysis: rotate misalign...

2· 107·0 current·0 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (VLM/OCR preprocessing) match the provided CLI, SKILL.md, README and the included script: rotation, semantic cropping, scaling and enhancement are implemented and expected for this purpose.
Instruction Scope
SKILL.md limits scope to minimal transformations for re-analysis and documents input/output rules. The runtime instructions only reference the included script and local image inputs/outputs (file paths, data URIs, base64) — no directives to read unrelated files or send data externally.
Install Mechanism
No install spec in registry; the code is instruction-only plus a small Python script. The only external dependency is Pillow (pip), which is reasonable and documented. No downloads from unknown URLs or archive extraction are used.
Credentials
The skill requires no environment variables, no credentials, and no config paths. The script uses only local temp files and in-memory base64 — proportional to the stated functionality.
Persistence & Privilege
Skill is not always-enabled and uses no privileged agent APIs or modifications to other skills. It writes only its own temporary output files when asked and returns base64 on demand.
Assessment
This appears to be a focused, local image-preprocessing helper suitable for VLM/OCR workflows. Before installing or running: (1) review the full script if you will process sensitive images (it operates locally and returns base64 or files, but you should avoid pasting secrets into command arguments), (2) install Pillow from the official PyPI source (pip install Pillow) or via your vetted package manager, and (3) confirm your agent won't forward image data to external services unless you intend that. If you need a deeper audit, provide the remainder of scripts/image_helper.py for a line-by-line review.

Like a lobster shell, security has layers — review code before you run it.

latestvk977cdhsvgj03s3586wts97tn98331gg

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments