Install
openclaw skills install obsidian-harnessProduction-grade Agent Harness combining execution discipline (Superpower), knowledge compounding (CE), and product thinking (Gstack) into a single adaptive...
openclaw skills install obsidian-harnessA unified engineering harness that combines execution discipline, knowledge compounding, and product thinking. Born from 45万字 of real-world AI textbook writing + 9 production incidents.
Agent = Model + Harness. The model provides capability; the harness provides discipline.
Three layers, one workflow:
Before starting any task, assess complexity. This determines which workflow steps to run.
🟢 Simple (bug fix, config change, small tweak)
🟡 Medium (new feature, module, integration)
🔴 Complex (architecture change, multi-module, new system)
When unsure, start at 🟡. Upgrade to 🔴 if you discover hidden complexity. Never downgrade mid-task.
Before writing any code, answer these questions. If any answer is "no" or uncertain, pause and discuss with the user.
Output: A one-paragraph problem statement that the user confirms before proceeding.
Define what you're building before you build it:
Break the spec into atomic tasks:
Execute tasks incrementally. After each task:
Critical rules:
Every deliverable must have evidence, not just "looks good":
| Deliverable type | Required evidence |
|---|---|
| Code change | Tests pass (show output) |
| Config change | Restart + verify (show status) |
| File generation | wc -l + grep key content |
| API integration | Show actual response |
| Documentation | Spot-check 3 claims for accuracy |
Self-review from 5 dimensions:
Pre-ship checklist:
After completing any task (regardless of complexity), spend 30 seconds on:
Only record specific, actionable lessons. Not generic advice like "be more careful".
Good: "Bedrock throttles at >2 concurrent requests to the same model. Use model rotation or serial execution." Bad: "Remember to handle API limits properly."
When you catch yourself thinking any of these, stop and follow the rebuttal:
| Your excuse | Why it's wrong | Do this instead |
|---|---|---|
| "Too simple to need tests" | 40% of P0 incidents come from "too simple" code | Write the test. It takes 2 minutes. |
| "I already checked, looks fine" | Reading ≠ verifying | Run it. ls, wc -l, grep, actual execution. |
| "I'll write tests after the feature is complete" | You won't. Test debt only grows. | Write the test NOW, before moving on. |
| "This old code looks unused, I'll delete it" | Chesterton's Fence: understand before removing | git blame first. Ask why it exists. |
| "It should work" | "Should" is not evidence | Provide logs, output, or data. |
| "Let me refactor this while I'm here" | Scope creep. You weren't asked to refactor. | Do only what was requested. File a separate TODO for the refactor. |
| "I'll handle errors later" | Error handling IS the feature in production | Handle errors now. Happy path without error handling is a prototype. |
| "The context is too long, I'll summarize and skip details" | Skipping details = skipping correctness | Checkpoint to file, compact context, continue with full fidelity. |
When delegating to subagents:
Concurrency limits:
2? Serialize or distribute across regions/models
Task delegation rules:
Failure handling:
sessions_history for the actual error, don't guessFor important deliverables, use an independent verifier:
Protect progress against crashes:
ls shows what's done, not model memorySee references/checkpoint-patterns.md for detailed patterns.
🟢 Simple: Edit → Verify → Done
🟡 Medium: Plan → Build → Test → Review → Done
🔴 Complex: Challenge → Spec → Plan → Build → Test → Review → Ship → Compound