Alicloud Ai Audio Asr

Security checks across malware telemetry and agentic risk

Overview

This is a coherent Alibaba Cloud speech-to-text skill, but users should understand that selected audio is sent to DashScope and transcripts are saved locally.

Install only if you are authorized to send the chosen audio to Alibaba Cloud DashScope. Use a scoped API key, avoid processing highly sensitive recordings unless your compliance requirements allow it, and protect or delete saved transcript and raw-response files when they contain private content.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (2)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 95% confidence
Finding: The skill exercises sensitive capabilities—environment variable access, file read/write, and outbound network calls—but does not declare permissions or present that capability surface explicitly. This increases the chance that agents or users invoke it without understanding it will read credentials, upload audio to a third-party service, and persist outputs locally.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The documentation instructs users to send audio content and bearer-authenticated requests to external ASR endpoints, but it does not warn that uploaded audio may contain sensitive personal, business, or regulated data. In a transcription skill, the context makes this more significant because the primary function is to transmit potentially sensitive voice data off-box to a cloud provider.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal