Telegram Voice To Voice Macos

PassAudited by VirusTotal on May 12, 2026.

Overview

Type: OpenClaw Skill Name: telegram-voice-to-voice-macos Version: 0.1.3 The skill bundle is classified as benign. All scripts (`transcribe_telegram_ogg.sh`, `tts_telegram_voice.sh`) and instructions (`SKILL.md`) are directly aligned with the stated purpose of providing a Telegram voice-to-voice workflow on macOS. File system operations are confined to expected OpenClaw media and workspace directories (`~/.openclaw/media/inbound`, `~/.openclaw/workspace/voice_out`). The use of external commands (`yap`, `ffmpeg`, `say`, `defaults`) is necessary for the skill's functionality, and arguments are generally quoted, mitigating direct shell injection risks from the script's side. There is no evidence of intentional data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts designed to subvert the agent's purpose.

Findings (0)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Note

ASI02: Tool Misuse and Exploitation

What this means

The agent can send generated voice replies in Telegram on the user's behalf when this workflow is used.

Why it was flagged

The skill instructs the agent to send generated audio back through Telegram. This is expected for the stated purpose, but it is an external message-sending action.

Skill content

use the `message` tool with `asVoice: true` and `media: <path.ogg>`

Recommendation

Use this skill only where Telegram auto-replies are intended, and review generated replies or switch to text mode if voice replies are not desired.

Note

ASI07: Insecure Inter-Agent Communication

What this means

In a busy Telegram inbox, the fallback could transcribe the wrong recently saved voice note.

Why it was flagged

If no explicit attachment path is passed, the helper selects the newest inbound OGG file without binding it to the current Telegram sender or message.

Skill content

OGG_PATH="$(ls -t "${HOME}/.openclaw/media/inbound"/*.ogg 2>/dev/null | head -n 1 || true)"

Recommendation

Prefer passing the explicit attachment path from the message context; consider changing the helper to require a path or validate the sender/message before transcription.

Note

ASI06: Memory and Context Poisoning

What this means

The workspace may retain Telegram sender IDs and their voice/text preference until the state file is deleted.

Why it was flagged

The skill stores persistent per-user reply-mode state using Telegram sender IDs. This is disclosed and purpose-aligned, but it is persistent local state.

Skill content

State file: `voice_state/telegram.json`; Key: Telegram sender user id

Recommendation

Keep the workspace private and delete `voice_state/telegram.json` if you want to reset or remove stored reply preferences.

Note

ASI04: Agentic Supply Chain Vulnerabilities

What this means

A compromised or untrusted local `yap` or `ffmpeg` binary would affect the safety of this workflow.

Why it was flagged

The skill relies on locally installed third-party binaries rather than installing them itself. No hidden installer is shown, but binary provenance matters.

Skill content

`yap` CLI available in `PATH` ... Project: https://github.com/finnvoor/yap ... `ffmpeg` available in `PATH`

Recommendation

Install required binaries from trusted sources and keep them updated.