Doc Ocr Skills

Security checks across malware telemetry and agentic risk

Overview

This is a coherent OCR skill, but users should treat its downloaded binary installer and optional Gemini cloud processing as important trust and privacy considerations.

Install only if you trust the publisher and the GitHub release binary. Safer choices are to inspect the installer first, build from source, or use verified checksums if the publisher provides them. Use RapidOCR or PaddleOCR for private documents; use Gemini only when uploading the selected document contents to Google’s cloud service is acceptable, and protect the API key stored in ~/.ocr/config.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (7)

Lp3

Medium
Category
MCP Least Privilege
Confidence
88% confidence
Finding
The skill invokes shell commands and an installer script, but it does not declare any permissions or operational risks to the user. This creates a trust gap: consumers may believe the skill only performs OCR when it also requires command execution and local system modification, increasing the chance of unintended execution in sensitive environments.

Tp4

High
Category
MCP Tool Poisoning
Confidence
95% confidence
Finding
The documented purpose is OCR, but the installation path includes downloading an executable binary, detecting host OS/architecture, and installing software locally. That mismatch is security-relevant because users may not expect network retrieval and execution of platform-specific binaries, which materially expands the attack surface and supply-chain risk.

Description-Behavior Mismatch

Medium
Confidence
90% confidence
Finding
The installer downloads a platform-specific executable from a remote GitHub release and installs it locally without any integrity verification such as a checksum or signature check. Even if the OCR purpose may justify installing a helper binary, the lack of trust validation means a compromised release asset, repository, or network path could result in arbitrary code execution on the host.

Context-Inappropriate Capability

Medium
Confidence
92% confidence
Finding
This script fetches an executable from GitHub and writes it into the skill directory, creating a supply-chain risk because the downloaded file is trusted implicitly. In the context of an OCR skill, downloading a native OCR utility can be functionally relevant, but doing so without provenance checks, size/type validation, or checksum verification makes the installation path unsafe.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
The README promotes a cloud OCR engine but does not warn users that document contents will be transmitted to an external service for processing. For OCR workloads, inputs commonly contain sensitive personal, financial, legal, or proprietary information, so omission of a privacy/data-handling warning can cause users to expose confidential documents unintentionally.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The README promotes a cloud OCR engine but does not clearly warn that document contents will be transmitted to an external Gemini API for processing. For OCR on scanned PDFs and images, this can expose sensitive personal, financial, legal, or proprietary data, so missing disclosure creates a real privacy and compliance risk even if the behavior is expected by the software design.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The Gemini engine sends document contents to a cloud service, but the skill does not clearly warn users of that data flow in the description or quick-start guidance. This can lead to accidental disclosure of sensitive documents, especially if users assume all OCR modes are local because PaddleOCR and RapidOCR are local options.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal