Maylo Voice Assistant

Security checks across malware telemetry and agentic risk

Overview

This is a real local voice assistant, but its network web UI can accept or trigger audio processing without authentication, so it needs careful review before use.

Install only if you are comfortable running a local voice assistant that listens through the Mac microphone, saves audio/log artifacts, and exposes a web UI on your network. Prefer binding the UI to localhost or a trusted interface, add authentication before using 0.0.0.0, disable MAYLO_WEB_WAKE unless you explicitly want web-started wake listening, and review the OpenClaw agent’s permissions because spoken or streamed audio becomes local agent input.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Findings (4)

Lp3

Medium
Category
MCP Least Privilege
Confidence
94% confidence
Finding
The skill instructs the user to run local shell scripts, copy application files, create a virtual environment, and start services, which clearly implies shell, file read/write, and environment access despite not declaring permissions. Undeclared capabilities undermine the platform’s trust and review model because operators may authorize or execute the skill without understanding that it can modify files and launch processes on the host.

Description-Behavior Mismatch

Medium
Confidence
89% confidence
Finding
The web server starts an always-on wake-word listener automatically at application startup, which expands the component from a UI service into a background audio-capture service without an explicit opt-in. In this context, that creates privacy and operational risk because merely launching the web UI can enable continuous microphone monitoring on the host Mac mini.

Missing User Warnings

Medium
Confidence
87% confidence
Finding
The /ws/audio endpoint accepts arbitrary audio streamed over WebSocket and processes it immediately, but there is no authentication, origin restriction, consent flow, or user-warning mechanism visible in this file. In a voice-assistant context this increases privacy risk and can allow any reachable client to submit microphone-derived audio for transcription/agent processing.

Missing User Warnings

Medium
Confidence
84% confidence
Finding
The legacy push-to-talk path can trigger server-side microphone processing via assistant.run_once_ptt(), and startup also enables background wake-word listening by default. That means remote interaction with the web UI can cause audio capture on the server host itself, which is more sensitive than ordinary client-side recording because it activates the machine's microphone.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal