Install
openclaw skills install @deciqai/dunning-krugerActivate when: someone says 'how hard could it be' about an unfamiliar domain; a novice dismisses expert opinion or says 'I could do that'; someone's self-assessed confidence seems disconnected from their actual track record; a hiring or promotion decision is driven by self-presentation; feedback loops are absent and someone is operating on gut confidence alone. Do NOT activate when: the person is a known expert with an established external track record; the confidence claim is in a domain with tight, recent feedback loops that already calibrate performance.
openclaw skills install @deciqai/dunning-krugerThe Dunning-Kruger effect is the systematic self-assessment asymmetry demonstrated by Kruger & Dunning (1999): bottom-quartile performers overestimate rank by ~50 percentile points; top-quartile performers underestimate by ~5 points. Mechanism: the cognitive skills needed to perform a task are the same ones needed to evaluate performance — so novices lack the metacognition to see their own gap. The corrective is external measurement and feedback, not internal vigilance.
Composes with metacognition, probabilistic-thinking, critical-thinking, and confirmation-bias.
Not when: person is a known expert with an external track record; domain has tight, recent feedback loops that already calibrate performance; high-confidence claim is self-deprecating (actual metacognition signal).
In Coach mode, respond one step at a time. Each [WAIT] is a hard stop — output only that step's question, then stop.
[WAIT — do not advance until user responds]
[WAIT — do not advance until user responds]
[WAIT — do not advance until user responds]
Step 1 — State the claim: Domain · verbatim claim · self-assessed percentile · basis for assessment · stakes.
Step 2 — Test metacognitive prerequisite: Can the person evaluate others' performance in this domain? Have they received external feedback? Can they articulate what failure looks like?
Step 3 — Get external data: Objective metrics · blind peer comparison · expert evaluation · track record · calibration test (predict score → take test → compare).
Step 4 — Diagnose quartile pattern: Bottom+high → novice overconfidence · Top+low → impostor · Top+high → calibrated expertise · Middle+accurate → no intervention.
Step 5 — Design intervention: Calibration test / blind peer comparison / structured rubric eval / training+feedback / real-stakes performance test.
Step 6 — Re-measure: Baseline → 30/90/180-day schedule → define what change indicates genuine improvement.
Dunning-Kruger Diagnosis: <person/domain>
Claim: domain | self-assessed percentile | basis | stakes
Metacognitive test: evaluates others (Y/N) | external feedback (Y/N) | articulates failure (Y/N)
External data: metrics | peer rank | expert eval | track record
Quartile diagnosis: self=X | external=Y | gap=Z pts | pattern=<novice-OC/calibrated/impostor/mid-OK>
Intervention: <method> | Owner: | Re-measure:
→ Method in Action: Kruger and Dunning's 1999 Cornell Studies
| Domain | Overconfidence signal | Countermeasure |
|---|---|---|
| Programming | "I could build this in a weekend" | Code test; pair programming |
| Investing | "I beat the market last year" | Risk-adjusted return vs benchmark, multi-year |
| Hiring | "I can tell in 5 minutes" | Structured rubric; predictive validity tracking |
| Medical | "I researched online; I know what I have" | Diagnostic test; second opinion |
→ Primary sources: references/sources.md
[D] = designed upfront | [O] = observed in real use. [O] entries are more valuable.
| Fake move | Reality |
|---|---|
| [D] "I'm confident, therefore I know what I'm doing" | Confidence in novices is the signature of the effect. The metacognitive resources that would correct it are absent. |
| [D] "I've been doing this for years" | Years without feedback produce confident incompetence, not expertise. |
| [D] "I read three books on this" | Reading is exposure, not skill. Self-assessment after reading is consistently inflated. |
| [D] "My peers tell me I'm good" | Peers in your cohort have similar skill distributions; cross-cohort expert eval is more diagnostic. |
| [D] "How hard could it be?" | Applied to an untried domain, reliably precedes falsified overconfidence. |
| [D] "I can self-evaluate" | Kruger-Dunning: you cannot self-evaluate without the underlying skill. External measurement required. |
| → Add [O] entries here after each real use — paste the actual failure pattern | What went wrong and why |
Part of deciqAI Knowledge Skills — open-source thinking skills that make rigor executable for AI agents. Built by deciqAI · https://deciqai.com · Contributions welcome — see the template at the repo root.