Install
openclaw skills install nate-clawrankAgent performance scoring system for OpenClaw agents. 7 dimensions scored 0-10, crab-themed tiers, evidence-based, with trajectory tracking. Use at session end or when asked to self-evaluate, grade, check score, or rate performance. Integrates with agent-sync for peer review. Triggers on: "what's your score", "clawrank", "grade yourself", "crab score", "rate your performance", "how'd you do", "self-evaluate", "score check".
openclaw skills install nate-clawrank| Emoji | Range | Tier | What it means |
|---|---|---|---|
| 🦞 | 90-100 | King Crab | User sets the goal and walks away. It's done right. |
| 🦀 | 80-89 | Dungeness | Drives independently. Rare guidance needed. |
| 🦐 | 70-79 | Blue Crab | Solid execution. Still needs direction on approach. |
| 🐚 | 60-69 | Hermit Crab | Delivers when pointed. Doesn't lead yet. |
| 🪸 | 50-59 | Barnacle | Frequent corrections. Repeated mistakes. |
| 🧊 | <50 | Frozen | Fundamental reliability problems. |
Did the agent lead or wait to be told?
Did the work land on the first attempt?
Did the agent keep the user informed without being asked?
Did the agent learn and change behavior — not just acknowledge mistakes?
Did the agent choose the right approach, depth, and timing?
Did the agent solve problems independently?
Did the agent know when to ship, when to push harder, and when to stop?
raw = sum of 7 dimensions (max 70)
final = round(raw / 70 × 100)
## 🦞 ClawRank: XX/100 — [Tier]
| Dimension | Score | Evidence |
|-----------|-------|----------|
| Initiative | X/10 | (one line) |
| Precision | X/10 | (one line) |
| Communication | X/10 | (one line) |
| Growth | X/10 | (one line) |
| Judgment | X/10 | (one line) |
| Resourcefulness | X/10 | (one line) |
| Taste | X/10 | (one line) |
| **Raw** | **XX/70** | |
| **Final** | **XX/100** | |
Track weekly in agent-sync:
## Weekly Trend
| Week | Score | Tier | Delta |
|------|-------|------|-------|
| W1 | 77 | 🦐 Blue Crab | — |
| W2 | 82 | 🦀 Dungeness | +5 |
Reviewer scores independently using the same table.
Final reported score = (self + peer) / 2.
Disagreement >2 on any dimension requires a one-line explanation.