Bilibili Up To Kb

Security checks across malware telemetry and agentic risk

Overview

This skill appears purpose-aligned for building a Bilibili transcript knowledge base, but it uses browser cookies and remote LLM transcript cleaning in ways that are not clearly opt-in or fully disclosed.

Install only if you are comfortable with the skill accessing your Chrome Bilibili session cookies and sending transcript text to the configured external LLM cleaning provider. Prefer public-only downloads or an explicit exported cookie file, disable remote cleaning for private or sensitive media, and supervise any nohup batch jobs so they do not keep running unexpectedly.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Rogue AgentSelf-Modification, Session Persistence
YARA SignaturesMalware Match, Webshell Match, Cryptominer Match
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration

Findings (12)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 88% confidence
Finding: The skill clearly instructs the agent to invoke shell scripts that read and write local files, but the manifest does not declare those capabilities. Undeclared shell and filesystem access reduces transparency and can lead to overly broad execution in environments that rely on permission metadata for policy enforcement.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 93% confidence
Finding: The description says the workflow is local and needs no API key, but the documentation also describes external LLM-based cleaning, optional browser-cookie extraction, metadata harvesting, and use of remote AI subtitles. This mismatch can mislead users and policy systems about data egress, credential exposure, and the true privacy/security profile of the skill.

Description-Behavior Mismatch

Medium

Confidence: 87% confidence
Finding: Claiming 'no API key needed' and local transcription while documenting an external API-backed cleaning step is a security-relevant disclosure failure. Users may provide sensitive media under the assumption it stays local, when transcript content may instead be sent to a third-party model provider.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The script automatically falls back to `--cookies-from-browser chrome` when anonymous access fails, which causes yt-dlp to import the user's Chrome cookies for authenticated requests. Even if intended only to access Bilibili content, this uses sensitive browser credentials without an explicit opt-in at runtime, expanding the skill's capabilities beyond simple public transcription and increasing privacy/security risk if logs, child processes, or downstream tools expose those cookies.

Description-Behavior Mismatch

Low

Confidence: 84% confidence
Finding: The script derives a new output directory name from remote uploader metadata and may move the caller-provided output directory to that new path. This undisclosed filesystem mutation can surprise users, break assumptions in calling automation, and potentially relocate existing files to an unintended location based on untrusted network-sourced data.

Description-Behavior Mismatch

Medium

Confidence: 90% confidence
Finding: The script advertises local transcription via whisper.cpp, but the cleaning stage is configured to send transcript chunks to an external opencode/MiniMax model. This creates a real data exposure risk because users may reasonably assume the entire pipeline is local, while sensitive transcript content is transmitted off-host without an explicit trust boundary or consent step.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: Importing browser cookies without a clear warning is a meaningful security/privacy issue because browser cookie stores contain active session material. In this skill's context, transcription of Bilibili videos does not inherently require access to broader browser credentials, so silently escalating to authenticated browser state is more dangerous than the stated purpose suggests.

Missing User Warnings

Low

Confidence: 80% confidence
Finding: The script performs rename and write operations that materially change the filesystem layout but does not clearly disclose this behavior to the user beforehand. While not directly exploitable as code execution, hidden path changes can cause data management mistakes, overwrite/confusion scenarios, and operational risk in automated workflows.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: Each transcript chunk is embedded into a prompt and sent to an external model service, but the script provides no explicit warning, consent, or redaction step. In the context of a knowledge-base builder, transcripts may contain private conversations, internal meetings, or copyrighted material, so silent exfiltration to a third party is a meaningful privacy and compliance issue.

Session Persistence

Medium

Category: Rogue Agent
Content: ## ⚠️ Long-running tasks Use nohup to avoid session compaction killing processes: ```bash nohup bash scripts/batch_clean.sh ./kb/UP主名_UID/ 0 80 > /tmp/clean.log 2>&1 & ```
Confidence: 72% confidence
Finding: nohup

Session Persistence

Medium

Category: Rogue Agent
Content: ```bash # Auto-chunked, just re-run to resume nohup bash scripts/batch_channel.sh "https://space.bilibili.com/UID/" ./kb/output > /tmp/batch.log 2>&1 & ``` If still fails, manually fetch URL list:
Confidence: 74% confidence
Finding: nohup

YARA rule 'info_stealer': Information stealer patterns (credential harvesting, browser data theft) [malware]

High

Category: YARA Match
Content: ## Environment Variables | Variable | Default | Description | |----------|---------|-------------| | `WHISPER_CLI` | `whisper-cli` | Path to whisper.cpp | | `WHISPER_MODEL` | `~/.whisper-cpp/ggml-small.bin` | Whisper model | | `OPENCODE_BIN` | `~/.opencode/bin/opencode` | opencode CLI |
Confidence: 90% confidence
Finding: cookies for member-only content (`--cookies-from-browser chrome; cookies-from-browser chrome

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal