Screenshot Ocr

Security checks across malware telemetry and agentic risk

Overview

This is a straightforward local screenshot OCR helper, with normal privacy cautions because it can read clipboard images and copy or save recognized text.

Install only if you are comfortable running a local Python OCR script and installing Tesseract/pytesseract/Pillow from trusted sources. Avoid using it on screenshots containing passwords, private messages, payment details, or verification codes unless you intend that text to appear in the terminal, clipboard, or a saved file; delete sensitive saved outputs when finished.

SkillSpector

By NVIDIA

Vulnerability Patterns

Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (5)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: try: # 尝试使用 pbpaste (macOS) subprocess.run(["pbpaste", ">", "/tmp/clipboard.png"], shell=True, capture_output=True) if os.path.exists("/tmp/clipboard.png"): return "/tmp/clipboard.png" except:
Confidence: 96% confidence
Finding: subprocess.run(["pbpaste", ">", "/tmp/clipboard.png"], shell=True, capture_output=True)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 86% confidence
Finding: The skill invokes shell commands and advertises file-writing behavior, but it does not declare corresponding permissions or safety boundaries. This creates a trust gap where an agent or user may not realize the skill can write files or execute local commands, increasing the chance of unintended modification of the host environment.

Vague Triggers

Medium

Confidence: 80% confidence
Finding: The trigger phrase "OCR" is so broad that it can match many unrelated requests, causing this skill to activate outside its intended screenshot-specific context. Over-broad routing can expose clipboard, image, file-save, or shell-backed behavior when the user did not specifically request this skill.

Vague Triggers

Medium

Confidence: 75% confidence
Finding: The trigger "屏幕识别" is ambiguous and could be interpreted as general screen analysis rather than static OCR of a user-provided screenshot. In context, that ambiguity is more dangerous because the skill handles potentially sensitive on-screen content and may copy or save extracted text.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: The skill explicitly supports copying recognized text to the clipboard and saving it to a TXT file, but it provides no warning about handling sensitive data, clipboard exposure, or accidental overwrite of existing files. OCR commonly captures secrets, personal data, or one-time codes from screenshots, so silent persistence or clipboard placement can leak information beyond the user's intent.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal