Ocr Scanner Image

Security checks across malware telemetry and agentic risk

Overview

The skill mostly performs OCR, but it claims local-only Tesseract processing while also shipping a runnable helper and reference docs that send images or URLs to OCR.space.

Review before installing. Use scripts/ocr.sh only if you want local Tesseract OCR, and avoid scripts/example.py unless you are comfortable sending the selected image or URL to OCR.space. Do not use the cloud helper for sensitive documents unless the package is updated to clearly disclose the data flow and require opt-in consent.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (8)

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The reference documentation conflicts with the skill metadata by describing a third-party OCR API instead of local-only processing via the system tesseract binary. This can mislead maintainers, reviewers, or downstream agents into implementing or invoking remote processing, creating undisclosed data egress and breaking the expected trust boundary for local OCR.

Context-Inappropriate Capability

Medium

Confidence: 93% confidence
Finding: The documentation introduces remote network OCR capability that is not necessary for a skill described as local tesseract-based OCR. In a security-sensitive agent setting, undocumented expansion from local file processing to network transmission materially increases attack surface and may cause sensitive images or derived text to be sent off-host.

Description-Behavior Mismatch

High

Confidence: 99% confidence
Finding: The skill metadata says OCR is performed locally with the system's tesseract binary, but the implementation actually sends images or URLs to a third-party service. This creates a material trust-boundary violation: users may provide sensitive local files expecting offline processing, while the code exfiltrates that data externally.

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The manifest describes OCR on image files, but the code also accepts arbitrary remote URLs. That expands the skill's behavior beyond its declared scope and can cause unintended external fetching and processing of attacker-controlled resources, increasing privacy and abuse risk.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The code is hardwired to make outbound requests to ocr.space even though the skill is described as using a local OCR binary. This can leak sensitive image contents or internal-only URLs to a third party and violates the principle of least surprise for a supposedly local-processing skill.

Intent-Code Divergence

Medium

Confidence: 89% confidence
Finding: The module docstring openly states behavior that contradicts the manifest, indicating the skill package presents inconsistent security-relevant claims. This mismatch makes operator review and user trust decisions unreliable, which is especially dangerous because the actual behavior involves external data transmission.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The documentation describes uploading local image files or image URLs to a third-party OCR service without any privacy warning or consent language. Because images often contain sensitive personal, financial, or proprietary information, this omission can lead users or agents to exfiltrate confidential data to an external provider unexpectedly.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The request uploads local file contents or user-supplied URLs to an external OCR provider without any explicit warning, consent flow, or indication of data handling. In the context of an OCR skill, inputs often contain sensitive documents, making silent transmission a meaningful privacy and compliance risk.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal