Audio Processing (Iyeque)

Security checks across malware telemetry and agentic risk

Overview

This skill appears to do what it advertises: user-directed audio transcription, analysis, transformation, and text-to-speech, with no evidence of hidden persistence, credential access, or unrelated data collection.

Install in an isolated Python environment if possible. Treat the TTS feature as cloud-backed: do not use it for secrets, regulated data, or confidential internal text unless you are comfortable sending that text to Google TTS. Expect ffmpeg and Whisper dependencies to consume disk space, and note that the strict path checks may block files outside the current workspace.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (4)

Lp3

Medium
Category
MCP Least Privilege
Confidence
89% confidence
Finding
The skill documentation exposes shell-capable installation and execution flows (`brew`, `pip`, `uv run`, `ffmpeg`, `python3`) but does not declare corresponding permissions. This creates a trust and review gap: an agent or user may invoke commands with system-level effects without explicit permission signaling, increasing the chance of unintended command execution or unsafe environment modification.

Context-Inappropriate Capability

Medium
Confidence
93% confidence
Finding
The TTS implementation uses gTTS, which sends the provided text to Google's remote service to generate audio. That creates an unannounced data egress path for potentially sensitive user content, and the skill description does not clearly disclose that external network transmission is required.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The TTS action uses gTTS, which relies on Google's service and may transmit user-provided text off-host, but the documentation does not warn users about that external disclosure. This is dangerous because users may submit sensitive text under the assumption that processing is local, leading to privacy, confidentiality, or compliance issues.

Missing User Warnings

High
Confidence
96% confidence
Finding
User-provided text is transmitted to an external TTS service without any warning, consent flow, or alternative local mode. If users submit secrets, internal data, or regulated content, this can lead to unintended disclosure to a third party and associated compliance/privacy issues.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal