kami-conflict-detection

Detect physical conflicts (fighting, shoving, scuffling) between 2+ people from RTSP camera streams or local video files. Event-driven mode: the script exits immediately upon detecting a conflict (exit code 10), outputting alert JSON to stdout. OpenClaw reads the alert, reports to the user in chat, then automatically restarts the script for continuous monitoring.

Audits

Pass

ClawScanPass

Agentic behavior and permission review.

Static analysisPass

Pattern checks against bundled files.

VirusTotalPass

Multi-engine malware detections and file reputation.

Install

openclaw skills install kami-conflict-detection

Kami Conflict Detection

Detect physical conflicts (fighting, shoving, scuffling) between 2+ people from RTSP camera streams or local video files. Uses an event-driven architecture where OpenClaw schedules the script in a loop for continuous real-time monitoring.

Execution Architecture (Event-Driven Loop)

┌──────────────────────────────────────────────────────────┐
│                   OpenClaw Scheduling Loop                │
│                                                          │
│  1. OpenClaw starts the script                           │
│     └─ .venv/bin/python conflict_detector_last.py        │
│                                                          │
│  2. Script continuously monitors video stream            │
│     ├─ YOLO: count persons in frame                      │
│     ├─ >= min_persons? Collect multiple frames           │
│     ├─ Send frames to LLM API for conflict analysis      │
│     └─ Conflict detected?                                │
│          ├─ No  → continue monitoring                    │
│          └─ Yes → save video clip, output JSON, exit(10) │
│                                                          │
│  3. OpenClaw reads stdout alert JSON                     │
│     └─ Reports detection result to user in chat window   │
│                                                          │
│  4. OpenClaw automatically restarts script → back to 1   │
└──────────────────────────────────────────────────────────┘

Detection Pipeline

YOLO pre-filter — lightweight person detection to count people in frame (must be >= 2)
Multi-frame collection — collect N frames with configurable time gap
LLM conflict analysis — send frames to Kami detection API for violence/conflict judgment
Event-triggered exit — on conflict detection, save video clip, output alert JSON, exit with code 10

When to Use

Use this skill when the user wants to:

Monitor a camera feed for physical fights or scuffles
Detect shoving, pushing, or violent behavior between people
Run conflict detection on a local video file for testing
Set up automated surveillance alerts for physical altercations

Installation

bash setup.sh

This will:

Detect system Python, create .venv/ virtual environment
Install dependencies: onnxruntime, opencv-python-headless, numpy, requests
Create alerts/ output directory

Idempotent — safe to run repeatedly.

Prerequisites

python3 and python3-venv installed on the system
yolov8s-worldv2.onnx model file in the skill directory
RTSP camera online and network-reachable, OR a local video file for testing
Kami API key (via --kami_api_key or environment variable KAMI_API_KEY). If you don't have one yet, register and obtain a key at: https://kamiclaw-skill.kamihome.com, You can enjoy a free credit limit of 200 credits.
setup.sh has been run at least once

Parameter Confirmation

Before running this skill, confirm the following parameters with the user:

Parameter	Default	Description
`--rtsp_url`	(required)	RTSP camera URL or local video file path
`--kami_api_key`	(required)	Kami API key (also via env `KAMI_API_KEY`). Register at https://kamiclaw-skill.kamihome.com if you don't have one,You can enjoy a free credit limit of 200 credits.
`--yolo_model`	`yolov8s-worldv2.onnx`	YOLO model file path
`--conf_threshold`	`0.25`	YOLO confidence threshold (0.0-1.0)
`--min_persons`	`2`	Minimum person count to trigger LLM analysis
`--sample_interval`	`1.0`	How often (seconds) to run YOLO pre-filter
`--multi_frame_count`	`3`	Number of frames to collect for LLM analysis
`--multi_frame_gap`	`0.5`	Time gap (seconds) between collected frames
`--buffer_seconds`	`30`	Ring buffer duration (seconds) for video clip export
`--clip_before`	`5`	Seconds of video to include before the conflict
`--clip_after`	`5`	Seconds of video to include after the conflict
`--output_dir`	`alerts/`	Directory for saved video clips
`--run_time`	`0`	Max single-round run time in seconds; `0` = unlimited
`--fps`	`15`	Video stream frame rate
`--inbox_file`	`alerts/pending.jsonl`	Alarm inbox file consumed by the heartbeat task to push into the chat window
`--feishu_webhook`	(env `FEISHU_WEBHOOK_URL`)	Feishu custom bot webhook URL — alarms are pushed directly to the user's phone
`--feishu_secret`	(env `FEISHU_WEBHOOK_SECRET`)	Feishu webhook signing secret (only if the bot has signing enabled)

Ask the user: do any parameters need to be changed?

Feishu push setup

Create a Feishu custom bot (自定义机器人) in the target group chat, copy its webhook URL, then either:

export FEISHU_WEBHOOK_URL="https://open.feishu.cn/open-apis/bot/v2/hook/xxxxxxxx"
# Optional, only if the bot has "签名校验" enabled:
export FEISHU_WEBHOOK_SECRET="your_secret_here"

Or pass --feishu_webhook / --feishu_secret on the CLI. When set, every conflict alarm is POSTed to Feishu as an interactive card (title / description / timestamp / clip duration / clip path). If the env var is empty, the Feishu channel is silently skipped — detection still works.

Alarm Delivery Channels (Tri-channel)

Alarms are delivered through three independent, redundant channels so the user is notified even if one path fails:

#	Channel	Trigger	Consumer	Guarantee
1	stdout JSON + exit(10)	On every conflict	OpenClaw reads stdout and reports in chat window, then restarts the script	Primary chat push (OpenClaw loop)
2	Inbox file `alerts/pending.jsonl`	On every conflict (appended)	Heartbeat task in `space/HEARTBEAT.md` polls the file and proactively pushes unreported alarms into chat	Fallback chat push (survives OpenClaw glitches)
3	Feishu webhook (interactive card)	On every conflict (if `FEISHU_WEBHOOK_URL` set)	User's phone via Feishu bot	Instant phone push, regardless of LLM / chat state

Channels 2 and 3 mirror the design already proven in kami-suspicious-person. Channel 1 remains the canonical driver of the OpenClaw detect-report-restart loop.

OpenClaw Execution Instructions

OpenClaw should execute this skill using the following loop logic:

Step 1: Start Detection

.venv/bin/python conflict_detector_last.py \
  --rtsp_url <USER_RTSP_URL> \
  --kami_api_key <API_KEY> \
  [other params]

Step 2: Handle Exit Code

Exit Code	Meaning	OpenClaw Action
`0`	Normal exit (video ended, timeout, no event)	Report "monitoring session ended normally" to user. Optionally restart.
`10`	Event detected — conflict alert JSON on stdout	Parse stdout JSON, report alert details to user in chat, then immediately restart the script.
`1`	Runtime error	Report error to user, check logs at `conflict_detector.log`.

Step 3: Report to User

When exit code is 10, parse the stdout JSON and report to the user like:

🥊 Physical Conflict Alert A physical conflict has been detected: Two people are engaged in a physical altercation. Video clip saved to alerts/conflict_20250115_143022.mp4 (10s). Please review and take appropriate action.

Restarting monitoring...

Step 4: Restart

After reporting, immediately restart the script with the same parameters to continue monitoring.

Complete Loop Example

OpenClaw:
  1. Run: .venv/bin/python conflict_detector_last.py --rtsp_url rtsp://... --kami_api_key ...
  2. Wait for process to exit
  3. If exit_code == 10:
       - Read stdout JSON
       - Display alert message to user in chat
       - Go to step 1 (restart)
     If exit_code == 0:
       - Inform user: "Monitoring session ended normally, no conflicts detected"
       - Ask user if they want to restart
     If exit_code == 1:
       - Report error, show log content

Usage

# Initialize environment (first time only)
bash setup.sh

# Run with RTSP stream
.venv/bin/python conflict_detector_last.py \
  --rtsp_url rtsp://127.0.0.1/live/YOUR-STREAM-ID \
  --kami_api_key YOUR-API-KEY

# Run with local video file (for testing)
.venv/bin/python conflict_detector_last.py \
  --rtsp_url /path/to/test_video.mp4 \
  --kami_api_key YOUR-API-KEY

# Custom parameters
.venv/bin/python conflict_detector_last.py \
  --rtsp_url rtsp://127.0.0.1/live/YOUR-STREAM-ID \
  --kami_api_key YOUR-API-KEY \
  --min_persons 2 \
  --clip_before 10 \
  --clip_after 10 \
  --run_time 3600

Output Format (stdout JSON)

When a conflict is detected (exit code 10), stdout contains:

{
  "alert": "conflict_detected",
  "timestamp": "2025-01-15T14:30:22.123456",
  "description": "Two people are engaged in a physical altercation",
  "video_clip": "alerts/conflict_20250115_143022.mp4",
  "clip_duration": "10s",
  "message": "Warning: Physical conflict detected. Two people are engaged in a physical altercation. Video clip saved to alerts/conflict_20250115_143022.mp4. Please review and take appropriate action."
}

Field	Type	Description
`alert`	string	Always `"conflict_detected"`
`timestamp`	string	ISO format timestamp of the alert
`description`	string	LLM-generated description of the conflict
`video_clip`	string	Path to the saved video clip
`clip_duration`	string	Total duration of the saved clip
`message`	string	Pre-formatted alert message for direct display

Exit Codes

Code	Meaning
`0`	Normal exit (run time exceeded, video ended, user interrupt, no event detected)
`10`	Event detected — conflict alert, JSON output on stdout
`1`	Runtime error (model not found, stream failure, missing API key)

Architecture

RTSP/File → FrameGrabber thread (ring buffer + latest frame)
                ├─ All frames → ring buffer (last 30s)
                └─ Latest frame → main thread (every 1s)
                      ├─ YOLO: person count >= 2?
                      │    ├─ No → skip, clear frame buffer
                      │    └─ Yes → collect N frames (gap=0.5s)
                      │              → LLM conflict analysis
                      │              ├─ No conflict → continue
                      │              └─ Conflict detected:
                      │                   → wait clip_after seconds
                      │                   → export video clip from ring buffer
                      │                   → output JSON to stdout
                      │                   → exit(10)
                      └─ Timeout → exit(0)

OpenClaw drives the loop:
  run script → wait for exit → parse stdout → report to user → re-run script

Launch Checklist (Dual-channel Active Push)

This skill pushes every alarm through three redundant channels (see Alarm Delivery Channels above): stdout+exit(10), inbox file, and Feishu webhook. Before launching the detector, the agent MUST verify:

FEISHU_WEBHOOK_URL env var is set (or --feishu_webhook is passed). If not, warn the user that phone push is disabled — chat-window push still works via stdout+exit(10) and the inbox file.
alerts/ directory is writable so pending.jsonl can be appended.
space/HEARTBEAT.md still contains the conflict-detection inbox task; on every heartbeat, read kami-conflict-detection/alerts/pending.jsonl, if non-empty send a proactive summary to the user then MOVE (not delete) the file to alerts/consumed/<timestamp>.jsonl. If empty, reply HEARTBEAT_OK.
Previous run's alerts/pending.jsonl has been consumed (or archived) to avoid re-reporting stale alarms.

Never rely on tailing stdout to report alarms — the agent is only invoked on user messages / heartbeats, so real-time stdout watching is impossible.

Strict Rules (MUST Follow)

RULE: Alarms flow via (a) stdout+exit(10) → OpenClaw → chat, (b) inbox file → heartbeat → chat, (c) Feishu webhook → user's phone. Never rely solely on one channel.
RULE: Every heartbeat consumes alerts/pending.jsonl; non-empty → proactive chat summary; empty → HEARTBEAT_OK.
RULE: Consumed alarms are MOVED to alerts/consumed/, not deleted.
RULE: Verify FEISHU_WEBHOOK_URL before launch; warn the user if missing (phone push disabled, chat push still active).
RULE: On exit code 10, OpenClaw MUST restart the script immediately to continue monitoring.

Troubleshooting

Virtual environment not found → Run bash setup.sh

Model file missing → Place yolov8s-worldv2.onnx in the skill directory

RTSP connection failure → Verify camera is online, check --rtsp_url, confirm network connectivity

LLM API failure → Check --kami_api_key is correct, or set via export KAMI_API_KEY=your-key. If you don't have a key, register at https://kamiclaw-skill.kamihome.com, You can enjoy a free credit limit of 200 credits. Verify network access to the detection API endpoint.

No alerts generated → Check conflict_detector.log for details. Common causes: fewer than 2 people in frame, confidence threshold too high (try lowering --conf_threshold).

Script exits immediately with code 1 → Check log for error details. Common causes: model file missing, RTSP URL unreachable, API key not set.

OpenClaw restart loop too fast → If the script keeps detecting events immediately on restart, consider increasing --sample_interval or adding a brief delay between restarts.