Vision Helper — AI Image Analysis

Security checks across malware telemetry and agentic risk

Overview

This skill does what it says, but users should treat screenshots and images as sensitive because they are sent to the configured Ollama or cloud vision endpoint.

Install only if you are comfortable with a helper that can read image paths you provide and send those images plus prompts to the configured Ollama endpoint. Keep OLLAMA_API_URL on localhost for private images, avoid screenshots containing passwords or confidential data, and use cloud models only when the image is safe to share with that provider.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Taint TrackingDirect Taint Flow, Variable-Mediated Taint Flow, Credential Exfiltration Chain
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (4)

Tainted flow: 'req' from os.environ.get (line 102, credential/environment) → urllib.request.urlopen (network output)

Critical

Category: Data Flow
Content: ) try: resp = urllib.request.urlopen(req, timeout=timeout) result = json.loads(resp.read().decode()) content = result.get("message", {}).get("content", "") if not content:
Confidence: 96% confidence
Finding: resp = urllib.request.urlopen(req, timeout=timeout)

Vague Triggers

Medium

Confidence: 96% confidence
Finding: The trigger list includes the generic everyday term "screenshot," which can cause the skill to activate in many unrelated conversations. Because this skill encourages executing local screenshot capture and image-analysis commands, accidental invocation can lead to unintended processing of sensitive screen contents and broader-than-expected tool use.

Missing User Warnings

High

Confidence: 98% confidence
Finding: The skill promotes use of cloud vision models and explicitly highlights bypassing built-in restrictions, but it does not clearly warn that images may be transmitted to remote services. This is dangerous because screenshots and desktop captures often contain credentials, personal data, internal documents, or other sensitive information that a user may assume remains local.

Missing User Warnings

Medium

Confidence: 92% confidence
Finding: The script sends the entire image to the configured vision API but does not provide an execution-time warning or confirmation when that destination may be remote. In an agent/automation setting, users may assume local processing while screenshots or other sensitive images are actually transmitted to a cloud or attacker-controlled service.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal