Pocket TTS Complete Documentation

Security checks across malware telemetry and agentic risk

Overview

This is a documentation-only Pocket TTS skill whose higher-risk examples are disclosed and aligned with its text-to-speech purpose.

Install this if you want Pocket TTS documentation available to your agent. Before running examples, verify the external pocket-tts package, prefer pinned versions in controlled environments, use only voices you own or have permission to clone, treat remote voice/model URLs as trusted-input decisions, keep any server bound to localhost unless secured, and protect generated safetensors because they can enable later voice reuse.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (3)

Vague Triggers

Medium
Confidence
87% confidence
Finding
The README says the skill triggers automatically for broad topics like text-to-speech, voice cloning, and streaming audio generation without defining tighter boundaries. In an agent ecosystem, overly broad trigger criteria can cause the skill to activate in unrelated conversations, increasing the chance of unnecessary context injection, mistaken tool selection, or exposure to sensitive prompts that were not intended for this documentation skill.

Missing User Warnings

Medium
Confidence
89% confidence
Finding
The skill prominently advertises voice cloning and third-party voice use but provides no warning about consent, impersonation, or misuse of someone else's audio. In a TTS skill, this omission increases the chance of harmful or deceptive use because users are explicitly encouraged to clone voices from short samples and remote repositories.

Missing User Warnings

Low
Confidence
80% confidence
Finding
The documentation instructs users to start a FastAPI server and even change host/port, but it does not warn that binding to broader interfaces could expose generated content or cloned-voice capabilities to other users on the machine or network. While this is not an exploit by itself, the omission can lead to unintended access in environments where local services are reachable by others.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal