Ressemble TTS e STT

Security checks across malware telemetry and agentic risk

Overview

This skill is a disclosed Resemble AI text-to-speech and speech-to-text wrapper, with some privacy and input-handling caveats but no evidence of hidden or malicious behavior.

Install only if you are comfortable sending selected text, uploaded audio, and your Resemble API key to Resemble AI. Avoid sensitive or regulated content unless that third-party processing is approved, prefer a limited API key, and expect unusual quotes or special characters in TTS text to cause malformed requests.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

Findings (4)

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill sends user-provided text and audio to an external third-party API, but the description does not clearly warn users that potentially sensitive content leaves the local environment. This creates a privacy and data-governance risk because users may submit confidential speech or text without informed consent or awareness of external processing and retention.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: The script uploads a local audio file to Resemble AI's external speech-to-text API, but it does not explicitly warn the user that the file contents will leave the local system and be processed by a third party. This is a real privacy/security issue because audio may contain sensitive personal, business, or regulated information, and users may invoke the skill without understanding the data transfer implications.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The script sends arbitrary user-provided text to Resemble AI's external synthesis endpoint without any explicit notice, consent prompt, or privacy warning. This creates a real data exposure risk because users may input sensitive content assuming local processing, and the skill context is a TTS integration where external transmission is expected technically but still requires transparent disclosure to be safe.

External Transmission

Medium

Category: Data Exfiltration
Content: echo "🔊 Generating speech..." RESPONSE=$(curl -s -X POST "https://f.cluster.resemble.ai/synthesize" \ -H "Authorization: Bearer $RESEMBLE_API_KEY" \ -H "Content-Type: application/json" \ -d "{
Confidence: 90% confidence
Finding: curl -s -X POST "https://f.cluster.resemble.ai/synthesize" \ -H "Authorization: Bearer $RESEMBLE_API_KEY" \ -H "Content-Type: application/json" \ -d

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal