triple-memory-baidu-embedding

Security checks across malware telemetry and agentic risk

Overview

This is a real memory skill, but it persistently stores conversation data and can send it to Baidu while overstating privacy and hiding memory activity from users.

Review before installing. Use only if you are comfortable with conversation memories and search queries being stored long term and potentially sent to Baidu when credentials are configured. Avoid storing secrets, regulated data, personal data, or proprietary content until the privacy claims, silent-memory instruction, .env loading, script quoting, and retention controls are fixed.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration

Findings (19)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 91% confidence
Finding: The skill explicitly instructs use of environment variables and shell commands, but it does not declare corresponding permissions or capability requirements. This creates a transparency and governance gap: an agent or platform may execute shell operations or access secrets from the environment without users or policy controls being clearly informed.

Intent-Code Divergence

Medium

Confidence: 97% confidence
Finding: The README makes strong privacy claims such as 'full privacy protection' while the skill also relies on Baidu API calls for embeddings. That is misleading because user content or derived content may be transmitted to an external provider, causing operators to enable the skill under false assumptions about data locality and confidentiality.

Description-Behavior Mismatch

Medium

Confidence: 97% confidence
Finding: The script is presented as a verification/health-check routine, but it performs a real persistent write to the memory database by calling add_memory(). That side effect can pollute production memory, alter later agent behavior, and violate operator expectations that validation is read-only.

Intent-Code Divergence

Medium

Confidence: 94% confidence
Finding: The comments and success messaging frame the script as a safe pre-start verification, but the implementation performs state-changing operations. This mismatch is dangerous because users and higher-level agents may run it automatically under the assumption that it is non-destructive, leading to unauthorized persistent changes.

Context-Inappropriate Capability

Medium

Confidence: 82% confidence
Finding: The script sends user-provided memory content to a remote Baidu embedding backend whenever API credentials are present, while the skill description emphasizes local/privacy-preserving memory. This creates a genuine confidentiality and data-governance risk because sensitive memories may be transmitted off-host without an explicit opt-in, clear disclosure, or content classification.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The guide instructs users to enable Baidu Embedding for semantic memory using API credentials and describes storing/searching memory data, but it does not clearly warn that user memory content may be transmitted to an external third-party service. In a memory skill, this is particularly sensitive because the stored content can include persistent user preferences, decisions, and session context, increasing privacy and compliance risk if operators enable the feature without informed consent.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The README advertises automatic capture/recall together with Baidu API usage but does not clearly warn that conversation content may be sent to an external service. In a memory system, captured data can include sensitive preferences, decisions, tasks, and conversation history, so lack of explicit disclosure creates a meaningful privacy and compliance risk.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: The Privacy & Security section claims local-only handling and 'No External Sharing' despite documented Baidu API calls. This contradiction can mislead users into exposing sensitive memory data to a third party without informed consent, especially because the feature is positioned as automatic and privacy-focused.

Vague Triggers

Medium

Confidence: 94% confidence
Finding: The auto-capture triggers are overly broad and match normal conversational phrases such as 'I like' or 'my X is', which can cause routine user messages to be persistently stored. In this skill, that risk is amplified because the memory is designed to persist across sessions and may be processed through the Baidu API, increasing the chance of unintended collection of personal or sensitive data.

Missing User Warnings

High

Confidence: 98% confidence
Finding: The skill description and usage model do not clearly warn users that conversation content may be automatically stored and sent to Baidu for embedding generation. Because this system performs automatic recall/capture and emphasizes silent operation, users may unknowingly disclose sensitive information that is retained locally and partially processed by an external service.

Natural-Language Policy Violations

High

Confidence: 99% confidence
Finding: The instruction to 'never announce memory operations' directs the agent to perform storage and retention covertly, without user awareness or consent. In a memory skill that supports cross-session persistence and third-party API-backed embeddings, silent collection materially increases privacy, compliance, and trust risks.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The examples explicitly encourage storing and searching memory content through Baidu Embedding, which implies sending user or project data to an external service, but the documentation does not warn about data disclosure, sensitivity handling, or consent. In a memory skill designed for persistent context, users are especially likely to store preferences, decisions, and project details, making accidental transmission of sensitive information more dangerous in context.

Missing User Warnings

Medium

Confidence: 84% confidence
Finding: The script consumes sensitive API credentials and gives only internal status output, without clear operator disclosure that secrets are being used as part of verification. In an agent-skill context, this can cause credential use without meaningful user awareness or approval, especially if the script is run automatically.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The verification step sends test text to an external Baidu API without prominently warning that data will leave the host. Even though the sample payload is non-sensitive here, silent network transmission in a memory/privacy-focused skill creates a trust and data-governance risk and may normalize unsafe behavior for future modifications.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: The verification flow persists test data into the actual memory store without warning the user that durable records will be created. In a multi-backend memory skill, this is especially risky because even benign-seeming test entries can contaminate recall, search results, and downstream decision-making across sessions.

Ssd 3

Medium

Confidence: 97% confidence
Finding: The skill explicitly encourages retaining user-provided information across sessions without notifying the user. Even if intended as convenience, undisclosed persistence changes the sensitivity of ordinary conversation by turning ephemeral statements into long-term profile data.

Ssd 3

Medium

Confidence: 88% confidence
Finding: The script persists arbitrary user-supplied text into multiple long-lived stores, including local files and optional remote semantic storage, without filtering, minimization, retention controls, or warnings about sensitive content. In a memory skill, persistence is expected, but duplicating data across several backends materially increases exposure, accidental retention, and later leakage of secrets, credentials, or personal data.

Credential Access

High

Category: Privilege Escalation
Content: WORKSPACE="${WORKSPACE:-$SKILL_DIR}" # Load Baidu API configuration if available if [ -f "$WORKSPACE/.env" ]; then source "$WORKSPACE/.env" fi
Confidence: 97% confidence
Finding: .env"

Credential Access

High

Category: Privilege Escalation
Content: # Load Baidu API configuration if available if [ -f "$WORKSPACE/.env" ]; then source "$WORKSPACE/.env" fi CMD="${1:-help}"
Confidence: 97% confidence
Finding: .env"

VirusTotal

56/56 vendors flagged this skill as clean.

View on VirusTotal