Install
openclaw skills install autooptimiseAutonomously optimise any OpenClaw skill using a benchmark-driven experiment loop. Scores skill outputs 0-10 across 4 dimensions, identifies the lowest-scoring pattern, proposes a targeted SKILL.md change, re-tests, and keeps or discards based on measured improvement. Use when asked to: optimise my [skill] skill, run autooptimise on [skill], benchmark my [skill] skill, improve my skill overnight.
openclaw skills install autooptimiseAutonomous benchmark-driven skill optimisation for OpenClaw. Inspired by Andrej Karpathy's autoresearch — the same modify → test → score → keep/discard loop, applied to agent skill quality instead of GPU training.
"optimise my weather skill""run autooptimise on [skill-name]""benchmark my [skill-name] skill""improve my skill overnight"| File | Purpose |
|---|---|
benchmark/tasks.json | Test task suite (prompts + expected qualities) |
benchmark/scorer.md | LLM judge scoring rubric |
runner/run_experiment.md | Autonomous loop instructions (load this next) |
runner/experiment_log.md | Auto-created run log (gitignored) |
runner/run_experiment.md — it contains the full loop instructionsUse the best available LLM judge model (prefer a strong reasoning model). Score each task 0–10 on:
Full rubric: benchmark/scorer.md
benchmark/tasks.json or benchmark/scorer.md during a run.runner/experiment_log.md.