Fetch Archive to Lexiang

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real archiving skill, but it needs Review because it can reuse browser sessions and cookies and send archived content to external services without strong consent boundaries.

Install only with the expectation that the skill may access your logged-in Chrome sessions, use cookies for paid or private sites, send article/transcript/PDF text to cloud AI providers, and upload copies to Lexiang/Tencent-backed storage. Prefer an isolated browser profile, disable cookie/CDP modes unless needed, avoid sensitive accounts or confidential documents, use environment variables instead of CLI token arguments, and confirm each translation or upload destination before running it.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Behavioral ASTexec() Call, eval() Call, Dynamic Import
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (13)

subprocess module call

Medium

Category: Dangerous Code Execution
Content: # On macOS, cookies are encrypted. We need to decrypt them. # Get the Chrome Safe Storage key from Keychain try: result = subprocess.run( [ "security", "find-generic-password",
Confidence: 99% confidence
Finding: result = subprocess.run( [ "security", "find-generic-password", "-s", "Chrome Safe Storage",

Context-Inappropriate Capability

High

Confidence: 100% confidence
Finding: The function explicitly extracts cookies from Chrome's database and decrypts them using the Keychain-derived secret, then repackages them for browser automation. This is classic session hijacking behavior: possession of those cookies can grant access to paid content and authenticated accounts without the user's explicit per-site consent.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The module advertises attaching to the user's real Chrome over CDP specifically to bypass anti-automation and Cloudflare protections while inheriting the full login state. That creates a powerful mechanism to act as the user against protected services and intentionally defeats access controls rather than simply fetching public content.

Intent-Code Divergence

Medium

Confidence: 94% confidence
Finding: The docstring openly states the tool 'completely bypasses' Google and Cloudflare anti-automation detection. That is a strong indicator the implementation is designed to evade service protections, which increases the likelihood of deliberate abuse and can facilitate unauthorized access workflows.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The README explicitly promotes accessing paid or login-walled content via Chrome cookie injection and CDP mode, but does not warn users about the sensitivity of authenticated session data, consent boundaries, or privacy/legal implications. In an agent context, this can normalize handling live browser sessions and cookies in ways that expose account access, subscription-only content, or personal data far beyond what a user intended.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: The README tells users the agent will automatically clone the repository, install dependencies, and perform initial configuration, but does not clearly warn that this causes code execution and system modification. In agent environments, silent installation behavior increases supply-chain and local-environment risk because users may not realize external code and packages are being fetched and run.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: The trigger keywords are broad enough to activate this highly privileged skill for ordinary conversations unrelated to archiving. Because the skill can read local files, use browser login state, download protected content, and upload data to external services, accidental invocation materially raises the chance of unintended collection and transmission.

Missing User Warnings

High

Confidence: 99% confidence
Finding: The script accesses Chrome cookies and the Keychain without an explicit consent checkpoint at the time of execution. Even if the broader tool is user-invoked, silently harvesting browser session material is dangerous because users may not understand they are authorizing extraction of reusable credentials.

Missing User Warnings

High

Confidence: 98% confidence
Finding: This code copies the user's real Chrome cookie database into a separate CDP profile automatically and without an explicit warning. Duplicating session stores broadens the exposure of authenticated data and makes session replay or accidental leakage more likely.

Missing User Warnings

Medium

Confidence: 88% confidence
Finding: The script transmits full Markdown content to an external LLM API without any user confirmation, warning, or data classification check. In this skill context, the tool handles archived articles, paywalled content, transcripts, and other potentially sensitive material, so silent exfiltration to a third party creates privacy, compliance, and contractual risk.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The script defaults to extracting cookies from the local Chrome profile and sending them to yt-dlp for requests, which can use sensitive authenticated session material without a strong user warning or explicit opt-in. In this skill's context, the stated goal includes bypassing login/paywalls, which makes silent credential use materially more dangerous and privacy-sensitive.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: Transcript paragraphs are sent to the OpenAI API for translation, but the script does not clearly warn that potentially private or copyrighted content will be transmitted to a third-party service. Because this tool is designed to process paywalled/login-gated media, the privacy and compliance implications are higher than for generic public content.

Ssd 3

Medium

Confidence: 94% confidence
Finding: The skill instructs the agent to persist arbitrary user-provided evaluation text to temporary files and inject it into documents without any sensitivity review. This can leak secrets, personal data, internal assessments, or regulated content into local storage and onward into the external knowledge base, especially because the text may contain confidential business commentary the user did not intend to archive verbatim.

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal