Speak Turbo - Talk to your Claude 90ms latency!

Security checks across malware telemetry and agentic risk

Overview

This is a coherent local text-to-speech skill that plays or saves audio and runs a disclosed localhost daemon.

Install only if you are comfortable with a local TTS daemon, audible playback by default, and local WAV file creation. Avoid speaking highly sensitive text in shared spaces, and do not add broad directories such as your whole home folder to the permanent output allowlist unless you intend that access.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Rogue AgentSelf-Modification, Session Persistence
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (5)

Lp3

Medium
Category
MCP Least Privilege
Confidence
91% confidence
Finding
The skill documentation describes capabilities to read and write files, invoke shell commands, and access a local HTTP service, but it declares no permissions. This creates a trust and review gap: users or platforms may approve the skill without understanding that it can persist configuration, write audio files, manage processes, and communicate over the network.

Missing User Warnings

Low
Confidence
86% confidence
Finding
The quick-start instructs immediate playback through speakers without a clear warning that audio will play automatically. Unexpected audio can disclose sensitive content to nearby people, disrupt shared environments, or create unsafe situations if used in the wrong context.

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The CLI sends user-provided text to a local daemon over plain HTTP with no explicit warning, consent prompt, or authentication boundary explanation. Although the endpoint is bound to 127.0.0.1, sensitive text can still be exposed to any local process able to observe, intercept, or abuse the local service, which is especially relevant for a TTS skill likely to handle private agent/user content.

Missing User Warnings

Medium
Confidence
83% confidence
Finding
User-provided text is transmitted over HTTP to a local daemon process without any explicit notice, consent boundary, or authentication. In an agent setting, users may reasonably assume text is handled in-process; sending potentially sensitive prompts to another service increases exposure to interception by local malware, unintended logging, or abuse by a rogue process bound to the expected port.

Session Persistence

Medium
Category
Rogue Agent
Content
- Your current working directory
- `~/.speakturbo/`

If you need to write elsewhere, use `--allow-dir`:

```bash
speakturbo "Hello" -o /custom/path/audio.wav --allow-dir /custom/path
Confidence
78% confidence
Finding
write elsewhere, use `--allow-dir`: ```bash speakturbo "Hello" -o /custom/path/audio.wav --allow-dir /custom/path ``` To permanently allow a directory, add it to `~/.speakturbo/config`: ```bash mkd

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal