Install
openclaw skills install pixel-lobsterPixel art desktop lobster that lip-syncs to OpenClaw TTS speech. Use when: (1) user wants a visual avatar for their AI agent, (2) user wants a desktop overla...
openclaw skills install pixel-lobsterA transparent desktop overlay featuring a pixel art lobster that animates when your OpenClaw agent speaks. Powered by envelope data from your local TTS server — the lobster's mouth only moves during AI speech, not music or system audio.
The app is fully bundled inside this skill — no external repository clone required.
npx availableGET /audio/envelope (XTTS on port 8787, or any OpenAI-compatible TTS via the OpenClaw TTS proxy)The app is included in this skill at <skill_dir>/app/. Install dependencies once:
cd <skill_dir>/app
npm install
Edit <skill_dir>/app/config.json before launching. Key settings:
| Key | Default | Description |
|---|---|---|
audioMode | "tts" | "tts" reacts only to TTS speech; "system" captures all audio output |
ttsUrl | "http://127.0.0.1:8787" | Base URL of your TTS server |
monitor | "primary" | "primary", "secondary", "left", "right", or display index |
lobsterScale | 4 | Sprite scale (4 = 480px tall lobster) |
clickThrough | false | Start with click-through mode on so the lobster doesn't block clicks |
swimEnabled | true | Enable swimming animation |
cd <skill_dir>/app
npx electron .
Or use the included helper script (handles first-run npm install automatically):
bash <skill_dir>/scripts/launch.sh
| Key | Action |
|---|---|
| F8 | Move window to next monitor (cycles through all displays) |
| F9 | Toggle click-through mode |
| F12 | Toggle DevTools |
With OpenClaw and a local XTTS server, set audioMode to "tts" and point ttsUrl at your XTTS instance. The lobster polls the envelope endpoint at 45ms intervals during active speech and 500ms when idle — no perceptible CPU cost.
If you use the OpenClaw TTS proxy (port 8788), point ttsUrl at port 8787 (the XTTS server directly), not the proxy — the envelope endpoint is on the TTS server, not the proxy layer.
If the mouth movement is ahead of or behind the audio:
ttsPlayStartOffsetMs (default 1100ms)ttsPlayStartOffsetMsThe default is tuned for PowerShell MediaPlayer on Windows. Other playback methods may need adjustment.
Six visemes drive natural speech animation:
Plus X (closed) for silence and pauses. Spring physics and variety enforcement prevent robotic repetition.