Install
openclaw skills install esp32-cam-eyesSet up ESP32-S3-CAM modules as visual sensors (eyes) for OpenClaw agents. Covers hardware identification, firmware flashing, WiFi configuration, and HTTP cam...
openclaw skills install esp32-cam-eyesGive your OpenClaw agent physical eyes using ESP32-S3-CAM modules.
Each ESP32-CAM module runs a lightweight HTTP server exposing /capture (single JPEG snapshot) and /stream (MJPEG live stream). Once connected to WiFi, the agent can grab images via curl for vision analysis.
pip3 install platformio), pyserial (pip3 install pyserial)ls /dev/cu.usb* (macOS) or ls /dev/ttyUSB* (Linux)curl -o photo.jpg http://<IP>/captureFor the complete step-by-step guide with firmware code, pin definitions, performance benchmarks, and troubleshooting: read references/setup-guide.md.
The sensor model determines your firmware strategy:
| Sensor | PID | Hardware JPEG | Recommended Format |
|---|---|---|---|
| OV2640 | 0x2640 | ✅ Yes | PIXFORMAT_JPEG directly |
| OV5640 | 0x5640 | ✅ Yes | PIXFORMAT_JPEG directly |
| GC2145 | 0x2145 | ❌ No | PIXFORMAT_RGB565 + software frame2jpg() |
If buying new boards, prefer OV2640 — hardware JPEG is significantly faster.
Once flashed and connected:
| Path | Function |
|---|---|
/capture | Single JPEG snapshot |
/stream | MJPEG live stream |
/ | Web UI with stream viewer |
Multiple ESP32-CAMs can join the same WiFi network for multi-angle coverage. Bind fixed IPs via router DHCP reservation to avoid IP changes on reboot.