deepsop-声音克隆助手

Security checks across malware telemetry and agentic risk

Overview

This voice-cloning skill does what it claims, but it exposes a likely real API key and handles sensitive voice data with weak consent and privacy guardrails.

Review this skill carefully before installing. Do not use the included API key; it should be considered exposed and revoked. Only upload voices you are authorized to use, assume local recordings and generated speech may leave your machine and become accessible through service URLs, and prefer a version that adds explicit consent prompts, privacy/retention documentation, and impersonation restrictions.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (6)

Intent-Code Divergence

High

Confidence: 98% confidence
Finding: The documentation includes what appears to be a real API key in plaintext, directly contradicting its own guidance not to commit secrets. Exposed credentials can be harvested from repositories or logs and used to access paid APIs, consume quota, or interact with the associated account.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The README advertises automatic audio upload and voice cloning but does not clearly warn users that local audio and synthesis text will be transmitted to an external third-party service. In a voice-cloning context, uploaded samples may contain biometric voice data and sensitive speech content, so inadequate disclosure can lead to uninformed sharing of highly sensitive personal data.

Vague Triggers

Medium

Confidence: 80% confidence
Finding: Overly broad trigger phrases increase the chance that the skill activates unintentionally during ordinary conversation. In a skill that can upload audio, clone voices, and perform networked operations, accidental invocation can lead to privacy-impacting actions or unintended API usage.

Missing User Warnings

High

Confidence: 95% confidence
Finding: The skill promotes cloning voices, including those of leaders or celebrities, without clear warnings about consent, authorization, impersonation, or privacy risk. In this context, lack of guardrails materially increases the likelihood of misuse for fraud, harassment, or unauthorized biometric/voice replication.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The documentation enables uploading local voice recordings, cloning voices, and returning directly accessible OSS URLs without any mention of consent, authorization, retention, or access controls. In a voice-cloning skill, this omission is security-relevant because it normalizes processing biometric voice data and sharing potentially public audio links, which can facilitate impersonation, privacy violations, and unauthorized disclosure of sensitive recordings.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill uploads user-provided audio to a remote service for voice cloning without any explicit consent flow or warning that biometric voice data leaves the local environment. Voiceprints are sensitive personal data, and silent transmission to a third-party API can create privacy, compliance, and impersonation risks, especially in a voice-cloning context.

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal