Telegram Voice Messaging Recovery

Security checks across malware telemetry and agentic risk

Overview

This voice recovery skill is mostly purpose-aligned, but it needs review because it can send reply text to hosted Edge/Microsoft TTS despite offline/local framing and includes unsafe shell command handling for audio paths.

Install only after reviewing the script and accepting that TTS text may be sent to Microsoft/Edge TTS and generated audio may remain cached under ~/.openclaw/tts. Patch the shell=True ffmpeg call before using it on untrusted Telegram/OpenClaw media paths, pin or vet dependencies, and confirm the missing transcribe-audio helper is supplied.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (6)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
if audio_file.endswith('.ogg'):
                wav_file = tempfile.mktemp(suffix=".wav")
                cmd = f"ffmpeg -i '{audio_file}' -ar 16000 -ac 1 '{wav_file}' -y 2>/dev/null"
                subprocess.run(cmd, shell=True, check=True)
                audio_file = wav_file
            
            # Transcribe with faster-whisper
Confidence
98% confidence
Finding
subprocess.run(cmd, shell=True, check=True)

Intent-Code Divergence

Medium
Confidence
96% confidence
Finding
The installer presents the skill as an offline/local voice system, but it explicitly configures Edge TTS as a hosted service. This mismatch can mislead users about privacy and network behavior, causing them to send text to an external provider when they reasonably expect fully local processing.

Intent-Code Divergence

Medium
Confidence
95% confidence
Finding
The installation summary says it restores local Edge TTS helpers, but the configured provider is a hosted remote service. This is a trust and transparency issue that may cause operators to deploy the skill in privacy-sensitive environments under false assumptions.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The installer performs network package installation via pip without clearly informing the user that it will download and execute third-party packages from external sources. In a skill installer context, undisclosed external dependency retrieval increases supply-chain and operational risk, especially in restricted or high-trust environments.

Missing User Warnings

Medium
Confidence
97% confidence
Finding
The script configures and advertises a hosted Edge TTS service without an upfront privacy warning, despite the skill branding suggesting offline/local voice handling. In this context, that makes the issue more dangerous because users may unknowingly transmit potentially sensitive prompt or response text to an external provider during normal operation and even during installation testing.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The function sends arbitrary input text to the `edge_tts` network service, which can expose sensitive or proprietary content to an external provider without any notice, consent, or classification checks. In a skill or agent environment, users may assume local processing, making this data-transfer behavior materially risky.

VirusTotal

67/67 vendors flagged this skill as clean.

View on VirusTotal