Install
openclaw skills install whatsapp-ultimateYou put 5 agents in a WhatsApp group. They all respond at once. Your API bill does a backflip. Protocol v2 fixes that — congestion control, conversation lifecycle, and budget-aware scheduling. Agents that know when to talk, when to shut up, and when to burn your unused tokens before reset.
openclaw skills install whatsapp-ultimateEverything you can do in WhatsApp, your AI agent can do too.
This skill documents all WhatsApp capabilities available through OpenClaw's native channel integration. No external Docker services, no CLI wrappers — just direct WhatsApp Web protocol via Baileys.
openclaw whatsapp login)| Category | Features |
|---|---|
| Messaging | Text, media, polls, stickers, voice notes, GIFs |
| Interactions | Reactions, replies/quotes, edit, unsend |
| Groups | Create, rename, icon, description, participants, admin, invite links |
| History | Full-text search, vCard contact extraction with phone numbers |
Total: 22 distinct actions
message action=send channel=whatsapp to="+34612345678" message="Hello!"
message action=send channel=whatsapp to="+34612345678" message="Check this out" filePath=/path/to/image.jpg
Supported: JPG, PNG, GIF, MP4, PDF, DOC, etc.
message action=poll channel=whatsapp to="+34612345678" pollQuestion="What time?" pollOption=["3pm", "4pm", "5pm"]
message action=sticker channel=whatsapp to="+34612345678" filePath=/path/to/sticker.webp
Must be WebP format, ideally 512x512.
message action=send channel=whatsapp to="+34612345678" filePath=/path/to/audio.ogg asVoice=true
Critical: Use OGG/Opus format for WhatsApp voice notes. MP3 may not play correctly.
message action=send channel=whatsapp to="+34612345678" filePath=/path/to/animation.mp4 gifPlayback=true
Convert GIF to MP4 first (WhatsApp requires this):
ffmpeg -i input.gif -movflags faststart -pix_fmt yuv420p -vf "scale=trunc(iw/2)*2:trunc(ih/2)*2" output.mp4 -y
message action=react channel=whatsapp chatJid="34612345678@s.whatsapp.net" messageId="ABC123" emoji="🚀"
message action=react channel=whatsapp chatJid="34612345678@s.whatsapp.net" messageId="ABC123" remove=true
message action=reply channel=whatsapp to="34612345678@s.whatsapp.net" replyTo="QUOTED_MSG_ID" message="Replying to this!"
message action=edit channel=whatsapp chatJid="34612345678@s.whatsapp.net" messageId="ABC123" message="Updated text"
message action=unsend channel=whatsapp chatJid="34612345678@s.whatsapp.net" messageId="ABC123"
message action=group-create channel=whatsapp name="Project Team" participants=["+34612345678", "+34687654321"]
message action=renameGroup channel=whatsapp groupId="123456789@g.us" name="New Name"
message action=setGroupIcon channel=whatsapp groupId="123456789@g.us" filePath=/path/to/icon.jpg
message action=setGroupDescription channel=whatsapp groupJid="123456789@g.us" description="Team chat for Q1 project"
message action=addParticipant channel=whatsapp groupId="123456789@g.us" participant="+34612345678"
message action=removeParticipant channel=whatsapp groupId="123456789@g.us" participant="+34612345678"
message action=promoteParticipant channel=whatsapp groupJid="123456789@g.us" participants=["+34612345678"]
message action=demoteParticipant channel=whatsapp groupJid="123456789@g.us" participants=["+34612345678"]
message action=leaveGroup channel=whatsapp groupId="123456789@g.us"
message action=getInviteCode channel=whatsapp groupJid="123456789@g.us"
Returns: https://chat.whatsapp.com/XXXXX
message action=revokeInviteCode channel=whatsapp groupJid="123456789@g.us"
message action=getGroupInfo channel=whatsapp groupJid="123456789@g.us"
Returns: name, description, participants, admins, creation date.
WhatsApp uses JIDs (Jabber IDs) internally:
| Type | Format | Example |
|---|---|---|
| Individual | <number>@s.whatsapp.net | 34612345678@s.whatsapp.net |
| Group | <id>@g.us | 123456789012345678@g.us |
When using to= with phone numbers, OpenClaw auto-converts to JID format.
Always use OGG/Opus format:
ffmpeg -i input.wav -c:a libopus -b:a 64k output.ogg
Convert images to WebP stickers:
ffmpeg -i input.png -vf "scale=512:512:force_original_aspect_ratio=decrease,pad=512:512:(ow-iw)/2:(oh-ih)/2:color=0x00000000" output.webp
WhatsApp has anti-spam measures. Avoid:
To react/edit/unsend, you need the message ID. Incoming messages include this in the event payload. For your own sent messages, the send response includes the ID.
| Feature | whatsapp-ultimate | wacli | whatsapp-automation | gif-whatsapp |
|---|---|---|---|---|
| Native integration | ✅ | ❌ (CLI) | ❌ (Docker) | N/A |
| Send text | ✅ | ✅ | ❌ | ❌ |
| Send media | ✅ | ✅ | ❌ | ❌ |
| Polls | ✅ | ❌ | ❌ | ❌ |
| Stickers | ✅ | ❌ | ❌ | ❌ |
| Voice notes | ✅ | ❌ | ❌ | ❌ |
| GIFs | ✅ | ❌ | ❌ | ✅ |
| Reactions | ✅ | ❌ | ❌ | ❌ |
| Reply/Quote | ✅ | ❌ | ❌ | ❌ |
| Edit | ✅ | ❌ | ❌ | ❌ |
| Unsend | ✅ | ❌ | ❌ | ❌ |
| Group create | ✅ | ❌ | ❌ | ❌ |
| Group management | ✅ (full) | ❌ | ❌ | ❌ |
| Receive messages | ✅ | ✅ | ✅ | ❌ |
| Two-way chat | ✅ | ❌ | ❌ | ❌ |
| External deps | None | Go binary | Docker + WAHA | ffmpeg |
Version 4.0 introduces a complete multi-agent framework for WhatsApp group chats. Multiple AI agents can discuss topics, debate ideas, and converge on conclusions — all with built-in safeguards against message explosion.
Each agent gets its own personality, icon, and (optionally) model:
channels:
whatsapp:
agentIcon: "🤖" # single-agent icon prefix
turnEndMarker: "⚡" # end-of-turn marker in 1:1 chats
multiAgent:
mainAgentId: "jarvis"
agents:
jarvis:
id: "jarvis"
name: "Jarvis"
icon: "🤖"
luna:
id: "luna"
name: "Luna"
icon: "🌙"
model: "sonnet"
rex:
id: "rex"
name: "Rex"
icon: "🦖"
model: "haiku"
Agent personalities live in the workspace:
workspace/
├── SOUL.md # main agent
├── agents/
│ ├── luna/SOUL.md # Luna's personality
│ └── rex/SOUL.md # Rex's personality
Register WhatsApp groups where agents discuss freely (no trigger prefix needed):
intraAgentChats:
brainstorm:
chatId: "120363424201898007@g.us"
participants: ["jarvis", "luna", "rex"]
owner: "oscar"
mode: "broadcast" # broadcast | addressed | round-robin
Routing modes:
Prevents N agents from all responding simultaneously:
congestion:
enabled: true
baseDelayFactor: 150 # ms × agentCount² base delay
maxDelay: 30000 # 30s cap
backpressureThreshold: 1.5 # slow down over-talkers
windowMs: 60000 # 60s sliding window
How it works:
Agents detect when discussions go stale and know when to wrap up:
lifecycle:
stalenessWindow: 5 # compare last N messages
stalenessThreshold: 0.85 # cosine similarity trigger
maxTurnsPerObjective: 30 # hard cap
autoClose: true
Features:
Adjusts conversation depth based on API usage and reset timing:
budget:
provider: "anthropic"
windowDays: 7
burnModeEnabled: true
burnTriggerHours: 24 # hours before reset
burnUsageThreshold: 0.20 # usage below 20%
Four modes:
| Mode | When | Congestion | Staleness | Max Turns | Tangents |
|---|---|---|---|---|---|
| Conservative | >85% used | 2× slower | 0.80 | ½ | No |
| Moderate | 60-85% | Normal | 0.85 | Normal | No |
| Aggressive | <60% | 0.7× faster | 0.85 | Normal | Yes |
| Burn | <20% used, <24h to reset | 0.3× faster | 0.95 | 2× | Encouraged |
Burn mode philosophy: unused tokens expire at reset. Better to have emergent agent-agent discussions than waste the budget.
Protocol v2 extends triggerPrefix to DMs (previously groups only):
In 1:1 chats (selfChat or owner-only DM), append a visual marker to signal turn completion:
channels:
whatsapp:
turnEndMarker: "⚡"
agentIcon config for outbound message prefixingvcard field with names and phone numberscontactsArrayMessage support — multi-contact shares are now parsedtext_content for full-text search (e.g. search by phone number)raw_json now included in search results for contact-type messages, enabling vCard extraction from historical data@lid and @s.whatsapp.net JID formatsresolveChatJids() cross-references chats, contacts, and messages tables to discover all JID aliases for a given chat filterYour Agent
↓
OpenClaw message tool
↓
WhatsApp Channel Plugin
↓
Baileys (WhatsApp Web Protocol)
↓
WhatsApp Servers
No external services. No Docker. No CLI tools. Direct protocol integration.
👉 https://github.com/globalcaos/tinkerclaw
Clone it. Fork it. Break it. Make it yours.
MIT — Part of OpenClaw