Install
openclaw skills install coe-root-causeRun a Correction of Error root-cause analysis for recurring failures, false success, missed work, data loss, and brittle automation.
openclaw skills install coe-root-causeUse when the user asks for a COE, Correction of Error, postmortem, root-cause analysis, "why did this recur", "what was missed", or "do not let this happen again".
The job is to explain the mechanism that allowed the failure, fix the mechanism where possible, and prove the same class of failure is harder to repeat.
Use this skill for a formal post-failure Correction of Error: a recurring failure, false success, missed work, data loss, brittle automation, or user-visible miss that needs a written record with impact, timeline, root cause, corrective actions, and verification.
Adjacent skills keep their narrower jobs:
Classify the failure before changing anything. Name the primary failure mode:
Then identify evidence-backed contributing conditions:
If an optional diagnostic failure hides whether required work happened, reclassify it as false success, incomplete work, or visible failure. Do not let "optional" obscure the primary task.
Collect the smallest packet that explains the failure:
State uncertainty plainly. Do not bury the answer in unrelated logs.
Bad root causes:
Good root causes identify a durable fix: a test, validator, workflow gate, ownership boundary, safer default, clearer skill instruction, or explicit blocked-state receipt.
For each action, include:
Prefer class-level safeguards over one-off cleanup.
Before saying the COE is complete, run the smallest credible verification:
If a gate cannot run, say why and what evidence substitutes for it.
# COE: <failure name>
Date: <date>
Status: done | planned | blocked
Severity: low | medium | high
## Summary
One short paragraph: what failed, why it mattered, and what changed.
## Impact
- Who or what was affected
- What was wrong or missing
- What was not affected
## Timeline
- <time/order>: <event>
## Failure Classification
Failure mode: <primary failure mode from Failure Classification and why>
Contributing conditions: <supported conditions, or unknown with missing evidence>
## Evidence
- <source or command>: <what it proves>
## Root Cause
### 5+ Whys
1. Why? ...
### Root Cause Statement
<mechanism, not blame>
## Corrective Actions
| Action | Status | Verification |
| ------ | -------------------- | ------------ |
| ... | done/planned/blocked | ... |
## Verification
- <gate>: <result>
## Residual Risk
<what could still fail and how it will be noticed>
Lead with the root cause and verified fix. Keep the user-facing summary short. If anything remains open, say exactly what evidence is still missing.