Doubao API Toolkit

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real Doubao/Volcengine media toolkit, but it can upload user-selected local video files and other prompts/media to a third-party API without a clear privacy warning or point-of-use confirmation.

Install only if you are comfortable sending prompts, image URLs, and any local video files you choose to analyze to Volcengine/Doubao. Avoid confidential, regulated, proprietary, or third-party media unless you have permission, and prefer an updated version that adds explicit upload warnings, consent before local file transmission, and corrected image-to-image documentation.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 95% confidence
Finding: The skill documents use of environment variables and outbound API calls, but it does not declare corresponding permissions. Missing permission declarations undermine least-privilege review and can cause users or platforms to underestimate that the skill can read secrets from the environment and send data over the network. In a toolkit that processes prompts, URLs, and local media for a third-party API, that gap materially affects trust and reviewability.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 90% confidence
Finding: The declared purpose does not accurately match the described functionality: undocumented video analysis, image-to-video generation, and status-checking expand the operational surface, while claimed image-to-image support appears unimplemented. Description-behavior mismatches are dangerous because reviewers and users may approve or invoke a skill under incorrect assumptions about what data it handles and what actions it can perform. Here, the hidden/omitted behaviors all involve external processing of user data and media, increasing the risk of unintended disclosure or misuse.

Description-Behavior Mismatch

Medium

Confidence: 92% confidence
Finding: The skill metadata emphasizes image/video generation and vision analysis, but the code also implements local video analysis by reading a local file, base64-encoding it, and sending the full contents to a remote API. This hidden capability creates a transparency and consent problem because users may not realize local files are being uploaded off-device.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The skill explains API-key setup and output storage but does not clearly disclose that prompts, image URLs, local video files, and generated content are sent to the external Doubao/Volcengine ARK service. This is a real security and privacy issue because users may submit sensitive text or local media without understanding that third-party transmission and processing will occur. The context makes this more dangerous because the skill explicitly supports local video analysis, so potentially private local files may be uploaded off-host.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The analyze_video function reads an arbitrary local video file and transmits its entire contents to the remote chat/completions API without any warning, consent flow, or data-minimization controls. This can expose sensitive local media, including personal, proprietary, or regulated content, to a third party unexpectedly.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal