Install
openclaw skills install control-mirrorAudits agent and system architecture through a control theory lens for stability, feedback, noise, delay, oscillation, error control, adaptive behavior, and...
openclaw skills install control-mirrorControl Mirror turns the core ideas of Qian Xuesen's Engineering Cybernetics into a practical architecture review skill.
It is not a generic architecture review checklist. It treats a system as a controlled dynamic system and asks one brutal question:
Is this system becoming more controllable, stable, observable, and adaptive — or merely more complicated?
Use it as three tools at once:
Use this skill when the user wants to review, diagnose, or evolve any system architecture, especially:
Do not use this as a generic “is the code clean?” review. Use it when the key question is system behavior over time.
Do not start by asking “how many features does the system have?”
Start with:
If you cannot map these, the review is still too superficial.
Map the user's architecture into this structure:
| Cybernetics concept | Architecture equivalent |
|---|---|
| Reference input | User goal, SLA, product target, policy, task objective |
| Controller | Scheduler, orchestrator, agent, workflow engine, governance layer |
| Controlled object | Codebase, service, team process, model provider, external environment |
| Actuator | Tool calls, deployments, writes, API calls, workflow transitions |
| Sensor | Tests, logs, metrics, review, cost reports, human feedback, evals |
| Feedback | Signals that change later routing, budget, memory, workflow, or policy |
| Noise | Irrelevant context, stale memory, bad retrieval, flaky logs, misleading metrics |
| Delay | Async queues, slow tools, stale state sync, delayed human review |
| Error | Difference between target and actual state |
| Damping | Rate limits, compression, budget caps, confirmations, backoff, gates |
| Saturation | Token limits, API limits, budget limits, human attention limits |
| Stability boundary | Conditions where automation must pause, degrade, or ask for confirmation |
Produce a concise control-loop map:
Target → Controller → Actuator → Controlled object → Sensor → Feedback → Controller
Then identify:
Look for the highest-risk 3-5 patterns only. Do not dump a huge checklist.
Symptoms:
Control diagnosis: the system lacks effective feedback.
Symptoms:
Control diagnosis: error is amplified instead of damped.
Symptoms:
Control diagnosis: feedback exists, but damping and convergence criteria are weak.
Symptoms:
Control diagnosis: the feedback loop has harmful latency.
Symptoms:
Control diagnosis: the sensor layer lacks filtering.
Symptoms:
Control diagnosis: there is no bounded-error mechanism.
Symptoms:
Control diagnosis: adaptation is not closed-loop.
Symptoms:
Control diagnosis: constraints are not part of the controller.
Only count an architectural strength if it improves controllability.
Good strengths include:
Do not praise “many modules”, “many agents”, or “complex workflows” unless they improve stability.
Use this 0-5 scale:
| Level | Name | Meaning |
|---|---|---|
| 0 | Open loop | Executes without reliable feedback |
| 1 | Human-corrected loop | Humans catch and correct most errors |
| 2 | Verified loop | Tests/reviews/metrics gate some actions |
| 3 | Damped closed loop | Budgets, gates, retries, and fallbacks reduce instability |
| 4 | Adaptive closed loop | Historical outcomes tune future routing/policy/budget |
| 5 | Self-evolving controlled system | The system improves its own procedures while preventing drift and pollution |
Give a level with one sentence of evidence.
Prioritize only 1-3 actions.
Use this priority logic:
When the user wants a rigorous review, include this table:
| Dimension | Score / 5 | What to check |
|---|---|---|
| Feedback completeness | Does output affect future decisions? | |
| Stability / convergence | Does the system stop oscillating and converge? | |
| Noise filtering | Are irrelevant/stale signals filtered before decisions? | |
| Delay handling | Are stale/late signals detected and handled? | |
| Error control | Are small errors bounded before they cascade? | |
| Damping / resource control | Are cost/token/API/human limits part of control? | |
| Observability | Can humans see why decisions were made? | |
| Adaptation | Do historical outcomes tune future behavior? | |
| Safety boundary | Does automation pause on irreversible/high-risk actions? |
Then summarize:
Control maturity level: L0-L5
Strongest control loop: ...
Weakest control loop: ...
Highest-risk instability: ...
Next P0 action: ...
Use this default structure:
One concise paragraph or diagram mapping the system into controller / controlled object / feedback / noise / delay / constraints.
List the most important 3-5 problems.
For each:
Only list strengths that improve controllability, stability, observability, or adaptation.
Give 1-3 prioritized actions:
Include when the review is non-trivial or when the user asks for scoring.
Check:
Check:
Check:
Check:
Check:
Use this section only when reviewing AgentOS, multi-agent platforms, OpenClaw-like systems, LLM orchestration, or autonomous workflow engines.
User goal / task queue
→ execution kernel / orchestrator
→ complexity scoring + routing policy
→ model / agent / tool execution
→ tests / review / cost / logs / user feedback
→ memory / metrics / SOP candidates
→ next routing, workflow, budget, or policy decision
| Dimension | Score / 5 | Deduct when... |
|---|---|---|
| Default entrypoint unity | APIs/tools bypass the kernel/orchestrator | |
| Model routing adaptiveness | routing is keyword-only or ignores history/cost/health | |
| Token damping | noisy outputs enter context uncompressed | |
| Workflow gate strength | steps can advance without required evidence | |
| Memory pollution resistance | temporary noise is written into long-term memory | |
| Feedback-to-policy loop | failures/costs are logged but do not affect future decisions | |
| Explainability | decisions lack request id, factors, or human-readable reason | |
| Safety boundary | high-risk automation has no pause/confirm path |
Common AgentOS instability patterns:
Avoid these mistakes:
A good Control Mirror review must be:
If the output does not help the user make the system more controllable, stable, observable, or adaptive, it failed.