Voice Assistant

Security checks across malware telemetry and agentic risk

Overview

This is a disclosed voice assistant whose microphone, gateway, and ElevenLabs behavior matches its stated purpose, though users should treat it as privacy-sensitive.

Install only if you are comfortable with a microphone assistant that can send recognized speech to your OpenClaw gateway and send assistant response text to ElevenLabs for TTS. Protect the .env file, use a least-privileged gateway token, keep GATEWAY_URL pointed at a trusted gateway, and pause or quit the tray app when you do not want listening active.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep

Findings (4)

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The skill description discusses microphone capture, transcription, gateway streaming, and ElevenLabs TTS, but it does not prominently warn users that audio-derived data and conversation text may leave the local machine and be processed by external services. In a voice assistant context, this omission can mislead users about privacy expectations and result in unintentional disclosure of sensitive spoken content.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: The documentation states that the assistant automatically listens for 5 seconds after speaking, but it does not present this as an explicit privacy/safety warning. Users may reasonably assume listening ends after the response, so this hidden follow-up capture increases the risk of recording nearby or unintended speech.

Missing User Warnings

Medium

Confidence: 88% confidence
Finding: The assistant transcribes microphone input and forwards the resulting text to a gateway service without any explicit runtime notice or consent flow informing the user that spoken content leaves the local speech-to-text component. In an always-on voice assistant, this creates a meaningful privacy risk because users may reasonably assume processing is local after wake-word detection and may disclose sensitive information unintentionally.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The code sends arbitrary text to the external ElevenLabs API for speech generation, which can expose user prompts, assistant responses, or other sensitive content to a third party without any user-facing notice or consent mechanism in this component. In a voice-assistant skill, spoken content may include private conversational data, making this more sensitive than a purely local TTS implementation.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal