Install
openclaw skills install @deciqai/goodharts-lawActivate when: our KPI is going up but the real outcome isn't improving; people seem to be gaming the metric; we're about to tie bonuses or promotions to a number; an algorithm is producing results nobody intended; a test or audit system is being designed. Do NOT activate when: the metric IS the goal with no proxy gap; measurement is purely descriptive with zero stakes attached.
openclaw skills install @deciqai/goodharts-lawGoodhart's Law: when a metric controls behavior, people optimize the metric rather than the underlying goal. Formulated by economist Charles Goodhart (1975) on UK monetary policy; sharpened by Marilyn Strathern (1997): "When a measure becomes a target, it ceases to be a good measure." Four failure mechanisms (Manheim & Garrabrant 2018): Regressional, Extremal, Causal, Adversarial. Countermeasure is always multi-metric + audit + rotation.
Composes with feedback-loops, principal-agent, okr-goal-setting, survivorship-bias.
Not when: metric and goal are identical; stakes too low for gaming; metric is purely descriptive with no reward/punishment; question is which metric to use, not whether the measurement-reward system is sound.
In Coach mode, respond one step at a time. Each [WAIT] is a hard stop — output only that step's question, then stop.
[WAIT — do not advance until user responds]
[WAIT — do not advance until user responds]
[WAIT — do not advance until user responds]
Step 1 — State metric and goal: metric being targeted / underlying goal / current proxy-goal correlation / who is measured / stakes.
Step 2 — Predict the gaming: list ≥3 ways to game the metric with minimum effort on the goal. If you can't list 3, you haven't thought hard enough.
Step 3 — Categorize mechanism:
| Mechanism | Test |
|---|---|
| Regressional | Is there noise that optimization will push into? |
| Extremal | Does metric-goal correlation break at extremes? |
| Causal | Is the metric a symptom, not a cause? |
| Adversarial | Will agents actively game with intelligence? |
Step 4 — Choose countermeasure: Regressional → constrain range. Extremal → paired constraint metrics. Causal → closer-to-causation metric + direct audit. Adversarial → multi-metric + randomized audits + rotation. Step 5 — Design the system: primary metric / constraint metric(s) / audit mechanism (sampled direct goal observation) / rotation schedule / separation of measure-for-control from measure-for-diagnosis / gaming-detection threshold. Step 6 — Schedule re-evaluation: independent goal measurement (how/when/who) / drift threshold / retirement criteria / owner.
# Goodhart-Robust Design: <metric>
Metric: | Underlying goal: | Correlation: | Who measured: | Stakes:
Gaming vectors (≥3):
Mechanism: Regressional / Extremal / Causal / Adversarial
Primary metric: | Constraint metric(s): | Audit: | Rotation: | Separation: | Gaming threshold:
Goal measurement (independent): | Drift threshold: | Retirement criteria: | Owner:
→ Method in Action: Goodhart 1975 (M3) and Strathern 1997 (RAE)
| Domain | Common gaming | Defense |
|---|---|---|
| Sales quotas | Sandbagging, channel stuffing, end-of-quarter discounts | Multi-period averaging; quality metrics; clawback |
| Hospital wait targets | Ambulance parking, patient reclassification | Outcome audits; paired metrics; randomized inspection |
| Standardized testing | Teaching to test, curriculum narrowing | Sample-based assessment; multi-measure; reduce single-test stakes |
| Algorithmic engagement | Clickbait, outrage, misinformation | Multi-objective optimization; quality + harm constraints |
→ Primary sources: references/sources.md
| Fake move | Reality |
|---|---|
| [D] "If you can't measure it, you can't manage it" | Often false. Judgment, trust, and direct observation are also valid management tools. |
| [D] "Our metric is well-defined; it won't be gamed" | Precision invites precise gaming. Basel II capital ratios were well-defined — extensively gamed. |
| [D] "Our people wouldn't game the metric" | Goodhart's law is structural; individual virtue is insufficient in aggregate. |
| [D] "We just need a better metric" | Often the issue is any single metric under pressure; fix is multi-metric + audit. |
| [D] "We've used this metric for years" | Long use = more time for gaming to mature. Tenure is a warning, not an endorsement. |
| → Add [O] entries here after each real use — paste the actual failure pattern | What went wrong and why |
[D] = designed upfront | [O] = observed in real use. [O] entries are more valuable.
Part of deciqAI Knowledge Skills — open-source thinking skills that make rigor executable for AI agents. Built by deciqAI · https://deciqai.com · Contributions welcome — see the template at the repo root.