Security audit

MOSS Voice Generator 指令式音色生成

Security checks across malware telemetry and agentic risk

Overview

This is a straightforward cloud voice-generation skill that sends requested text and style prompts to MOSI Studio and saves the returned audio file.

Install this only if you trust MOSI Studio to process the text you synthesize. Avoid sending confidential, personal, or regulated content unless MOSI’s handling terms are acceptable, and prefer using MOSI_TTS_API_KEY from the environment instead of passing the key on the command line.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

Findings (2)

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The script transmits user-supplied text and voice-style instructions to a third-party cloud API, but it does not clearly disclose this data egress at runtime or require explicit user confirmation. This can expose sensitive or confidential content if callers assume the synthesis happens locally, especially in an agent context where arbitrary user prompts may be forwarded automatically.

External Transmission

Medium

Category: Data Exfiltration
Content: echo "Generating voice..." >&2 RESPONSE=$(curl -sf -X POST \ "https://studio.mosi.cn/api/v1/audio/speech" \ -H "Authorization: Bearer $API_KEY" \ -H "Content-Type: application/json" \
Confidence: 89% confidence
Finding: curl -sf -X POST \ "https://studio.mosi.cn/api/v1/audio/speech" \ -H "Authorization: Bearer $API_KEY" \ -H "Content-Type: application/json" \ -d

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal