Video Insight

Security checks across malware telemetry and agentic risk

Overview

This looks like a legitimate video transcript tool, but it needs Review because it can automatically use local Chrome browser cookies for Bilibili downloads without clear user consent.

Install only if you are comfortable with a video tool that may touch your Chrome browser session when Bilibili downloads fail. For safer use, remove or disable the --cookies-from-browser chrome retry, avoid sensitive videos unless using --no-cache, clear ~/.cache/video-insight when needed, and use --summarize only with an LLM endpoint and token setup you trust.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (6)

Description-Behavior Mismatch

Medium

Confidence: 96% confidence
Finding: When the initial download fails, the code silently expands scope by accessing browser cookies from Chrome to retry the request. That exceeds the stated transcript-extraction role and can expose authenticated session material and private account context without explicit user approval.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The skill invokes yt-dlp with --cookies-from-browser chrome, causing it to read sensitive browser cookies from the local profile. In an agent/tooling context, this is dangerous because it grants the skill access to authentication artifacts unrelated to transcript extraction and can enable account misuse or privacy compromise.

Vague Triggers

Medium

Confidence: 77% confidence
Finding: The trigger phrases are broad enough to match many ordinary user requests about summaries or transcripts, which can cause the skill to activate when the user did not specifically intend to use it. Because this skill performs network retrieval and local caching, overbroad invocation can lead to unnecessary external requests, unexpected storage of content, and expansion of the skill's operational footprint.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The skill states that transcripts are permanently cached locally but does not clearly warn users that extracted content will be retained on disk. Video transcripts may contain sensitive, copyrighted, or user-request-specific material, so silent persistence increases privacy and data-retention risk, especially on shared systems or long-lived agent hosts.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: Reading browser cookies without any user-facing warning or confirmation creates a hidden sensitive-data access path. In practice, users may not realize the skill can touch local authentication data, making the behavior especially risky in an automation environment.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: When --summarize is enabled, the full transcript is sent to an LLM via generate_summary() with no visible disclosure in this CLI about external data transmission, retention, or privacy implications. Users may assume summarization is local, but transcripts can contain sensitive or copyrighted content, making silent third-party transmission a meaningful privacy and compliance risk.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal