多语种音频翻译助手

Security checks across malware telemetry and agentic risk

Overview

The skill appears to be a real audio translator, but it needs review because it can install unpinned packages during use and sends speech-derived text to third-party services.

Review the scripts before installing, prefer running them in a virtual environment, and avoid sensitive audio unless you accept sending its text to MyMemory and edge-tts. Pin dependencies and fix the dynamic python -c handling before trusting it for untrusted audio or URLs.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (8)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 92% confidence
Finding: The skill declares tools with clear file, shell, and network capabilities but does not declare corresponding permissions, which weakens transparency and policy enforcement. This can lead users or the host platform to underestimate that the skill can read local files, write outputs, download remote content, and invoke shell commands.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The script installs Python packages on the host at runtime using pip, which changes the user's environment beyond the declared purpose of translating audio. This creates supply-chain and integrity risk because package installation executes code from external dependencies and can persist software changes on the system.

Description-Behavior Mismatch

Medium

Confidence: 98% confidence
Finding: The script sends recognized speech text to the external MyMemory translation API, but the skill metadata does not disclose that user-derived content leaves the local machine. Audio transcripts often contain sensitive personal or business data, so undisclosed third-party transmission is a real privacy and data-governance issue.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill sends transcribed text to MyMemory and uses networked TTS, but the documentation does not clearly warn users that their audio-derived content may leave the local environment. Because audio often contains sensitive personal or business information, silent external transmission creates a privacy and data-handling risk.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The workflow transmits text extracted from user audio to external translation services and also uses an external TTS service, yet there is no explicit privacy warning or consent flow. In this skill context, the data being processed is directly derived from user speech, which increases the sensitivity of the transmission.

External Transmission

Medium

Category: Data Exfiltration
Content: ### 步骤3: 翻译（MyMemory API） ```bash curl -s "https://api.mymemory.translated.net/get?q=<文本>&langpair=<源>|<目标>" ``` ### 步骤4: 目标语言语音合成（edge-tts）
Confidence: 94% confidence
Finding: https://api.mymemory.translated.net/

Unpinned Dependencies

Low

Category: Supply Chain
Content: faster-whisper edge-tts
Confidence: 95% confidence
Finding: faster-whisper

Unpinned Dependencies

Low

Category: Supply Chain
Content: faster-whisper edge-tts
Confidence: 95% confidence
Finding: edge-tts

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal