Voice Chat Bridge

Security checks across malware telemetry and agentic risk

Overview

This skill does what it claims, but it enables always-on voice-message processing and automatic chat replies without enough scoping or privacy controls.

Install only if you want an automatic voice-message responder. Before enabling the SOUL.md trigger, restrict it to trusted chats or users, decide whether Edge TTS and the active LLM provider may receive sensitive text, suppress transcript/reply logging where possible, and use pinned dependencies and verified model downloads.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (15)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 88% confidence
Finding: The skill declares no permissions even though its documented behavior requires shell execution, environment access, and outbound network use. This creates a transparency and consent problem: users and reviewers cannot accurately assess the skill's capabilities, and the undeclared network/shell access increases the risk of unexpected data exposure or misuse if the skill is installed or auto-triggered.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 93% confidence
Finding: The documented behavior does not fully match the actual capabilities described by analysis, including use of an external session-level LLM/context service and platform-specific handling not disclosed in the skill description. Behavior/description mismatch is dangerous because it defeats informed trust boundaries: a user may believe the skill only processes local audio, while it may also transmit transcript and conversation context externally or operate in broader contexts than advertised.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The design explicitly enables automatic download, transcription, and reply for voice messages across Telegram and Feishu, but it does not document consent, retention, disclosure, or handling boundaries for potentially sensitive audio content. Because voice messages often contain personal or confidential information and may be sent to local and remote STT/TTS components, the absence of privacy controls and user-facing notice creates a real privacy and compliance risk.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The README states that voice messages are automatically processed and that Edge TTS may be used, but it does not clearly warn that user audio/transcripts may be sent to external services or require explicit user consent. In a messaging skill that auto-activates on received voice/audio, this creates a real privacy and compliance risk because sensitive user content can be transmitted off-device without notice.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The skill omits a clear warning that generated replies and potentially sensitive derived content are sent to a cloud TTS provider. In a voice/chat workflow, transcripts and context can contain private or regulated information, so undisclosed third-party transmission creates privacy, compliance, and data-handling risk, especially when activation is automatic on incoming audio.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The skill forwards full transcribed speech to another agent/LLM process without any consent, minimization, or disclosure. Spoken audio often contains sensitive personal or business information, so silent forwarding increases privacy and data-handling risk, especially because the content is combined with session context.

Ssd 3

Medium

Confidence: 97% confidence
Finding: The code logs the full transcript and reply and returns the transcript in the JSON output, creating multiple exposure paths for potentially sensitive spoken content. Logs, stdout capture, and downstream consumers may retain or disclose this data beyond the original voice interaction.

Ssd 3

Medium

Confidence: 87% confidence
Finding: The default reply template repeats the full transcribed speech back into the generated response, which can unnecessarily disclose sensitive spoken content in downstream channels, logs, notifications, or shared chat contexts. In a voice-processing skill that auto-activates on incoming audio, this increases the chance of propagating private data without explicit user confirmation.

Unpinned Dependencies

Low

Category: Supply Chain
Content: pyttsx3>=2.90 # Configuration pydantic>=2.6.0 pydantic-settings>=2.0.0 python-dotenv>=1.0.0
Confidence: 89% confidence
Finding: pydantic>=2.6.0

Unpinned Dependencies

Low

Category: Supply Chain
Content: # Configuration pydantic>=2.6.0 pydantic-settings>=2.0.0 python-dotenv>=1.0.0 # Audio Processing numpy>=1.21.0
Confidence: 78% confidence
Finding: python-dotenv>=1.0.0

Unpinned Dependencies

Low

Category: Supply Chain
Content: python-dotenv>=1.0.0 # Audio Processing numpy>=1.21.0 # Optional (for future extensions) requests>=2.31.0
Confidence: 72% confidence
Finding: numpy>=1.21.0

Unpinned Dependencies

Low

Category: Supply Chain
Content: numpy>=1.21.0 # Optional (for future extensions) requests>=2.31.0 # aiohttp>=3.9.0
Confidence: 82% confidence
Finding: requests>=2.31.0

Known Vulnerable Dependency: pydantic — 3 advisory(ies): CVE-2021-29510 (Use of "infinity" as an input to datetime and date fields causes infinite loop i); CVE-2024-3772 (Pydantic regular expression denial of service); CVE-2021-29510 (Pydantic is a data validation and settings management using Python type hinting.)

High

Category: Supply Chain
Confidence: 91% confidence
Finding: pydantic

Known Vulnerable Dependency: python-dotenv — 1 advisory(ies): CVE-2026-28684 (python-dotenv: Symlink following in set_key allows arbitrary file overwrite via )

Low

Category: Supply Chain
Confidence: 61% confidence
Finding: python-dotenv

Known Vulnerable Dependency: requests — 10 advisory(ies): CVE-2014-1830 (Exposure of Sensitive Information to an Unauthorized Actor in Requests); CVE-2024-47081 (Requests vulnerable to .netrc credentials leak via malicious URLs); CVE-2024-35195 (Requests `Session` object does not verify requests after making first request wi) +7 more

High

Category: Supply Chain
Confidence: 84% confidence
Finding: requests

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal