Install
openclaw skills install @ygq19901001/biobrain-cron-health-monitorProactive cron job health monitoring, failure detection, and auto-repair delegation. Triggers: 'cron failed', 'cron health', 'fix cron', 'consecutive errors'
openclaw skills install @ygq19901001/biobrain-cron-health-monitorDetect, diagnose, and repair cron job failures before they cascade.
A cron that fails silently is worse than no cron at all. Monitor consecutiveErrors — it's the heartbeat of your automation.
cron(action="list")
For each job, check:
state.consecutiveErrors — 0 is healthy, 1 needs attention, 2+ is criticalstate.lastRunStatus — "ok" or "error"state.lastError — the actual error messagestate.lastDurationMs — anomalies in execution time| consecutiveErrors | Severity | Action |
|---|---|---|
| 0 | Healthy | No action |
| 1 | Watch | Log, monitor next run |
| 2 | Warning | Diagnose root cause |
| 3+ | Critical | CEO intervention, consider disabling |
Pattern A: "Message failed"
message tool fails in isolated sessions (missing auth/feishu config)sessions_send to main insteadcron(action="update", jobId="xxx", patch={payload: {message: "...use sessions_send to main, NOT message tool..."}})Pattern B: "LLM request failed"
Pattern C: "Write failed"
{workspace_root_dir}/<task-summary>.md may not resolve in isolated sessionsPattern D: Timeout
timeoutSeconds or simplify the taskPattern E: Silent success but no output
Run weekly (e.g., Friday 22:00):
{
"name": "Weekly Cron Audit",
"schedule": {"kind": "cron", "expr": "0 22 * * 5", "tz": "Asia/Shanghai"},
"sessionTarget": "isolated",
"payload": {
"kind": "agentTurn",
"message": "Weekly cron audit: List all cron jobs. Flag any with consecutiveErrors > 0. For each failure, diagnose root cause and suggest fix. Output summary to company/shared/alerts/ALERT-WEEKLY-AUDIT-YYYYMMDD.md."
}
}
When a failure is detected, delegate repair to the appropriate department:
{
"name": "DataCenter · Fix [failing-cron]",
"schedule": {"kind": "at", "at": "2026-01-01T10:00:00+08:00"},
"sessionTarget": "isolated",
"deleteAfterRun": true,
"payload": {
"kind": "agentTurn",
"message": "You are DataCenter. Fix cron [jobId]: [error description]. Diagnose, fix, verify. Output: ✅ Fixed / ❌ Needs CEO."
}
}
Set deleteAfterRun: true for one-shot repair tasks.