Install
openclaw skills install imsg-mediaFetch iMessage/Messages.app attachments (voice memos and images) and process them — transcribe audio via Silicon Flow ASR (SenseVoiceSmall), and analyze imag...
openclaw skills install imsg-mediaFull iMessage multimedia pipeline:
imsg cannot read ~/Library/Messages/chat.db and will return permissionDenied~/.openclaw/.env: SILICON_FLOW_KEY=sk-...--api-key sk-... directly to the scriptimsg CLI: npm install -g imsgActivate this skill when:
Attachment detected?
├── Audio (.m4a / .caf / .wav / .mp3) → transcribe via Silicon Flow ASR
├── Image (.jpg / .png / .heic / .gif) → read with vision model
└── Unknown / not downloaded → increase --limit or ask user to resend
Always read from the message envelope:
[iMessage sender@example.com ...] → use sender@example.com[SMS +1234567890 ...] → use +1234567890# Run from the skill directory
cd ~/.openclaw/skills/imsg-voice-transcribe
python3 scripts/imsg_voice_transcribe.py fetch \
--identifier "sender@example.com" \
--limit 50
Returns JSON with file, type (audio or image), and metadata.
If nothing found, try --limit 100.
# One-liner (fetch + transcribe)
python3 scripts/imsg_voice_transcribe.py auto \
--identifier "sender@example.com" \
--limit 50 --raw
# Or transcribe a specific file
python3 scripts/imsg_voice_transcribe.py transcribe \
--file /path/to/audio.m4a --raw
# Quick test with explicit API key (no env setup needed)
python3 scripts/imsg_voice_transcribe.py transcribe \
--file /path/to/audio.m4a --api-key sk-... --raw
After fetch returns an image path (e.g. {"file": "/path/to/photo.jpg", "type": "image"}):
# Example: fetch image from a sender
python3 scripts/imsg_voice_transcribe.py fetch \
--identifier "sender@example.com" --type image --limit 50
# → {"file": "/Users/.../Messages/Attachments/photo.jpg", "type": "image", ...}
Then in the agent:
sips -s format png "input.heic" --out "output.png"read tool → agent vision model processes itDefault image response format:
| Format | Type | Notes |
|---|---|---|
.m4a | Audio | Standard iMessage voice memo |
.caf | Audio | Older iOS voice memo (AAC in CAF) |
.wav .mp3 | Audio | Other sources |
.jpg .jpeg .png | Image | Standard photos |
.heic .heif | Image | iPhone default — convert to PNG first |
.gif | Image | Animated or static |
| Error | Cause | Fix |
|---|---|---|
permissionDenied | No Full Disk Access | Grant FDA in System Settings |
SILICON_FLOW_KEY not set | Missing API key | Add to ~/.openclaw/.env |
No attachments found | Low limit or iCloud not synced | Increase --limit; ask user to resend |
| Request timed out | Network or large file | Retry; check file < 25MB |
| HEIC not displaying | Format not supported by read | Convert with sips first |