VibeVideoIO AI Script to Video

Security checks across malware telemetry and agentic risk

Overview

The skill appears to do what it claims, but it uses real VibeVideo/Bollo account sessions and sends scripts to that service.

Install only if you are comfortable letting OpenClaw operate your bollo.video or vibevideo.io Studio account. Pick the site explicitly, avoid submitting sensitive scripts unless intended, and use --no-save or logout if you do not want the local session token retained.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (13)

Description-Behavior Mismatch

High

Confidence: 96% confidence
Finding: The manifest advertises account-management capabilities such as access overview, sub-user management, and token-policy updates that go materially beyond the stated skill purpose of login, project listing, video creation, and logout. Even if not all are wired into the OpenClaw provider today, defining them in the capability catalog increases the chance of accidental exposure, future overbinding, or unauthorized invocation by another integration path.

Description-Behavior Mismatch

Medium

Confidence: 94% confidence
Finding: The manifest includes generic tool-generation endpoints and a writable service-flag control unrelated to the described script-to-video workflow. This broadens the attack surface and creates confused-deputy risk, where an agent or future provider binding could trigger actions the user did not intend or that bypass product-level separation of duties.

Context-Inappropriate Capability

High

Confidence: 98% confidence
Finding: Sub-user creation, update, and deletion are sensitive administrative actions with clear privilege and account-governance impact, and they are unjustified by the advertised user-facing function of making videos. In an agent context, exposing these capabilities is especially dangerous because natural-language requests can be misinterpreted or abused to alter account access for additional principals.

Context-Inappropriate Capability

High

Confidence: 97% confidence
Finding: A service-flag mutation endpoint allows changing operational behavior and is not aligned with the skill's stated purpose. Exposing configuration toggles through an agent-facing manifest can let users or malicious prompts influence backend state in ways that affect availability, integrity, or rollout posture beyond their own content-generation task.

Vague Triggers

Medium

Confidence: 84% confidence
Finding: Broad invocation language at the metadata level can cause the agent to select this skill for loosely related user requests, even when the user did not clearly ask to access third-party accounts or external video-generation services. In a skill that performs login and project operations, overbroad routing raises the chance of unnecessary credential handling, unintended actions, or user-confusing privilege escalation.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The 'When To Use' section uses generic phrases like creating or generating an AI video without clear constraints, which can over-match many normal creative requests. Because this skill is connected to login-backed Studio operations, ambiguous activation criteria make unintended external service use more dangerous than in a purely local content-generation tool.

Natural-Language Policy Violations

Medium

Confidence: 90% confidence
Finding: Routing users to a specific site based solely on language input creates implicit account and data-flow decisions without explicit consent. This is risky because the two domains may involve different accounts, privacy expectations, jurisdictions, or backend behavior, and the skill is designed to perform authenticated operations once routed.

Vague Triggers

Medium

Confidence: 90% confidence
Finding: The default prompt includes very broad trigger phrases such as "create a video" and "make me a video," which can cause the skill to be invoked for generic video-related requests without clear user intent to use this specific external service. Because the skill can perform account login and project actions on bollo.video or vibevideo.io, overbroad invocation increases the risk of unintended account access, data exposure, or actions on the wrong platform.

Natural-Language Policy Violations

Medium

Confidence: 95% confidence
Finding: The manifest instructs the agent to choose bollo.video for Chinese requests and vibevideo.io for English requests unless the user explicitly specifies a site, which makes a service-selection decision on the user's behalf. This can route users to an unintended domain, create privacy and consent issues, and lead to account actions or content submission on the wrong service without explicit opt-in.

Natural-Language Policy Violations

Medium

Confidence: 89% confidence
Finding: The manifest description states that Chinese requests default to bollo.video and English requests default to vibevideo.io unless explicitly overridden, which implies routing users to different third-party services based on inferred language rather than explicit user choice. This can create privacy, consent, and expectation issues because users may be sent to a different service endpoint without clear opt-in, and language is an unreliable proxy for jurisdictional, trust, or account preference.

Vague Triggers

Medium

Confidence: 95% confidence
Finding: The invocation rules are broad enough to match ordinary requests like 'make a video for me' or pasted scripts, which can trigger a high-impact skill that performs login and account actions on external services. In an agent environment, overbroad routing increases the chance of unintended activation, causing actions against third-party accounts without sufficiently explicit user intent for authentication, project access, or content submission.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The skill states that CAPTCHA images are saved locally and tokens are stored in a local session file, but it does not present this as a clear user-facing warning before login. This creates a privacy and credential-handling risk because users may not realize authentication artifacts and challenge images persist on disk beyond the immediate interaction.

Natural-Language Policy Violations

Medium

Confidence: 90% confidence
Finding: Automatically selecting bollo.video or vibevideo.io based on detected language can send users to a different service domain than they intended without explicit consent. Because this skill performs authentication and content creation, silent site selection increases the risk of credential submission, data exposure, or account actions on the wrong platform.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal