MLX STT

v1.0.7

Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.

1· 3.4k·15 current·15 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
Purpose & Capability
Name and description (local MLX-based STT on Apple Silicon) align with the provided scripts: ffmpeg + mlx_audio invocation to transcribe audio. Requiring brew on macOS to install ffmpeg/uv is reasonable for this purpose.
Instruction Scope
Runtime instructions and scripts only convert the provided audio to WAV, invoke mlx_audio.stt.generate, print transcript files, and clean up temporary files. The scripts do download a model at first run and the mlx_audio command's output is redirected to /dev/null (silenced), which hides runtime logs/errors — not clearly malicious but reduces transparency. The skill does not read unrelated files or request extra environment data.
!
Install Mechanism
install.sh uses brew (expected) but relies on the 'uv' CLI to install 'mlx-audio' with --force and --prerelease=allow. 'uv' and the source/registry used for mlx-audio are not documented here; installing a force/prerelease package from an opaque source can deliver arbitrary code. The install does not download from a clearly identified, verifiable release URL (e.g., official GitHub release or known package registry with provenance shown).
Credentials
The skill declares no required environment variables or credentials and its scripts do not attempt to read other env vars or sensitive config paths — the requested environment access appears minimal and proportional.
!
Persistence & Privilege
Registry metadata sets always:true (force‑included in every agent run). A narrow, on‑demand STT skill does not reasonably need to be force‑enabled for all agents. Combined with the opaque install of a prerelease binary, this increases the blast radius if the installed tool were malicious or buggy.
What to consider before installing
This skill appears to perform local STT as described, but exercise caution before installing: - always:true is unnecessary for an on‑demand STT tool; prefer a skill that is not force‑enabled. - install.sh runs 'uv tool install --force mlx-audio --prerelease=allow' — that will fetch and install a third‑party prerelease binary from an unspecified source. Ask the author for the exact upstream registry/URL and inspect that package before installing. - The mlx_audio tool will download models at runtime (network activity). If you have sensitive data or need an auditable supply chain, run this in an isolated VM or disposable machine first. - Because stdout/stderr are silenced for the tool, initial failures or unexpected network activity may be hidden; consider running the command manually without redirection to inspect behavior. - If you decide to proceed, manually run the install script in a controlled environment, verify the origin of the 'uv' CLI and the 'mlx-audio' package, and avoid installing on a machine with sensitive secrets. Additional information that would raise confidence to 'high': explicit upstream URLs or package registry details for 'uv' and 'mlx-audio', a signed release or checksum for the model/binary, and removal of always:true or an explanation why force‑enable is required.

Like a lobster shell, security has layers — review code before you run it.

Apple Siliconvk971snmp51szkhfyeg7syvzgn180wkmkMac minivk971snmp51szkhfyeg7syvzgn180wkmkMacBookvk971snmp51szkhfyeg7syvzgn180wkmkasrvk971snmp51szkhfyeg7syvzgn180wkmkaudiovk971snmp51szkhfyeg7syvzgn180wkmkdarwinvk97etwgsn06wnb3fk5yjkdzmnd80bs41glmvk971snmp51szkhfyeg7syvzgn180wkmkglm-asrvk971snmp51szkhfyeg7syvzgn180wkmkglm-asr-nano-2512vk971snmp51szkhfyeg7syvzgn180wkmkglm-asr-nano-2512-8bitvk971snmp51szkhfyeg7syvzgn180wkmklatestvk971snmp51szkhfyeg7syvzgn180wkmkmacOSvk971snmp51szkhfyeg7syvzgn180wkmkmlxvk971snmp51szkhfyeg7syvzgn180wkmkmlx-audiovk971snmp51szkhfyeg7syvzgn180wkmkspeech-to-textvk971snmp51szkhfyeg7syvzgn180wkmksttvk971snmp51szkhfyeg7syvzgn180wkmk

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

🦞 Clawdis
OSmacOS
Binsbrew

Comments