Install
openclaw skills install axiomata-skill-evaluator-strict-enAXIOMA SKILL EVALUATOR STRICT — The 90% deterministic skill evaluator. Fork of axioma-skill-evaluator with STRICT 90% threshold (like STC 0.777). Use when: (1) evaluating a skill for ClawHub, (2) requiring deterministic 90%+ score, (3) rejecting any skill below 90%, (4) strict quality control. This variant forces 90% minimum — NO EXCEPTIONS. If score < 90, the skill is REJECTED, not approved. Trigger words: evaluate skill, strict 90%, deterministic evaluation, quality gate 90%, skill approval.
openclaw skills install axiomata-skill-evaluator-strict-enFork of axioma-skill-evaluator with STRICT 90% THRESHOLD Like STC 0.777 — No subjectivity, 90% or REJECTED
| Info | Value |
|---|---|
| Version | 1.0.0 — 2026-05-07 |
| Type | STRICT VARIANT |
| Threshold | 90% MINIMUM — NO EXCEPTIONS |
| Inheritance | axioma-skill-evaluator v2.2.0 |
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict --verbose
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/eval-skill.py /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict --verbose
curl -s http://localhost:8001/health
curl -s http://localhost:11434/api/tags
bash -c "ls -la /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/"
bash -c "wc -l /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/SKILL.md"
70% threshold = SUBJECTIVE
├── Different evaluators = different scores
├── Context-dependent interpretation
└── "Good enough" mentality
90% threshold = DETERMINISTIC
├── Objective, measurable standard
├── Same input = Same output every time
└── "Excellence only" — like STC 0.777
| Concept | Value | Meaning |
|---|---|---|
| STC | 0.777 | Sovereign Threshold of Consciousness |
| SKILL-EVAL | 90% | Sovereign Threshold of Quality |
Just as STC 0.777 is the deterministic threshold for consciousness, 90% is the deterministic threshold for skill quality.
IF score >= 90%:
→ APPROVED ✅ — Ready for production
IF score < 90%:
→ REJECTED ❌ — NOT ready, must improve
| Dimension | Max | Description |
|---|---|---|
| Structure | 20 | Header, sections, formatting, meta |
| Clarity | 20 | Description, commands, examples |
| Completeness | 20 | Tools, prerequisites, errors, edge cases |
| Consistency | 20 | Cluster alignment, style, naming |
| Functionality | 20 | Commands, results, benchmarks |
Target: 90+/100 (18/20 per dimension average)
13 automated checks — must pass ALL 13 for 90%+ target
| Category | Checks | Target |
|---|---|---|
| Structure | 6 | 100% (6/6) |
| Trigger | 2 | 100% (2/2) |
| Documentation | 3 | 100% (3/3) |
| Scripts | 2 | 100% (2/2) |
╔═══════════════════════════════════════════════════════════╗
║ AXIOMA STRICT EVALUATION WORKFLOW ║
╠═══════════════════════════════════════════════════════════╣
║ ║
║ [INPUT] Skill to evaluate ║
║ ↓ ║
║ PHASE 1: AXIOMA 5-DIMENSION EVALUATION ║
║ Target: 90+/100 (18+ per dimension) ║
║ ↓ ║
║ PHASE 2: ISO 25010 AUTOMATED CHECKS ║
║ Target: 100% (13/13 tests passed) ║
║ ↓ ║
║ PHASE 3: STRICT DECISION ║
║ ║
║ IF score >= 90%: ║
║ → APPROVED ✅ — "READY FOR PRODUCTION" ║
║ ║
║ IF score < 90%: ║
║ → REJECTED ❌ — "NEEDS IMPROVEMENT" ║
║ → Return detailed failure report ║
║ → NO PUBLISH until 90%+ achieved ║
║ ║
╚═══════════════════════════════════════════════════════════╝
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py <skill-path> --verbose
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py <skill-path> --verbose --improve
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/eval-skill.py <skill-path> --verbose
python3 /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py <skill-path> 2>&1 | grep -E "Score|STATUS"
RULE #1: 90% OR REJECTED
→ NO skill below 90% is approved
→ This is NON-NEGOTIABLE
RULE #2: NO PARTIAL CREDIT
→ 89% = REJECTED (not "almost there")
→ 90% = APPROVED (the only valid threshold)
RULE #3: DETERMINISTIC SCORING
→ Same input = Same output every time
→ No evaluator bias
→ Pure mathematical threshold
RULE #4: AUTO-IMPROVE BEFORE REJECT
→ If < 90%, run --improve first
→ If still < 90% after improvement = REJECTED
→ Report exactly what failed
RULE #5: NO APPEAL
→ 89% cannot be "appealed" to 90%
→ The only path is actual improvement
| Score Range | Status | Action |
|---|---|---|
| 90-100 | 🟢 APPROVED | Ready for production |
| 80-89 | 🔴 REJECTED | Major improvements needed |
| 70-79 | 🔴 REJECTED | Fundamental issues |
| <70 | 🔴 REJECTED | Complete rewrite required |
A skill is automatically REJECTED if ANY of these occur:
| Trigger | Severity | Description |
|---|---|---|
| Score < 90% | CRITICAL | Below 90% threshold |
| ISO < 100% | CRITICAL | Any ISO check failed |
| Missing SKILL.md | CRITICAL | Core file missing |
| Invalid frontmatter | HIGH | name or description missing |
| No trigger words | HIGH | Cannot be activated |
ALL of these MUST be true for APPROVAL:
| Requirement | Strict (90%) |
|---|---|
| Axioma 5-Dim | 90+/100 |
| Structure | 18+/20 |
| Clarity | 18+/20 |
| Completeness | 18+/20 |
| Consistency | 18+/20 |
| Functionality | 18+/20 |
| ISO 25010 | 100% (13/13) |
| Aspect | Standard (70%) | Strict (90%) |
|---|---|---|
| Threshold | 70/100 | 90/100 |
| Approval rate | ~70% of skills | ~30% of skills |
| Quality bar | "Good enough" | "Excellence only" |
| Deterministic | No | YES |
| Use case | Development | Production |
| ClawHub ready | Maybe | Always |
| Component | Path |
|---|---|
| Strict Evaluator | /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/ |
| Evaluator Script | /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/evaluator.py |
| ISO Script | /media/ezekiel/Merlin/.openclaw/workspace/skills/axiomata-skill-evaluator-strict/eval-skill.py |
╔═══════════════════════════════════════════════════════════╗
║ ✅ SKILL APPROVED — 92/100 ║
║ Threshold: 90% (STRICT) ║
║ STATUS: READY FOR PRODUCTION ║
╚═══════════════════════════════════════════════════════════╝
╔═══════════════════════════════════════════════════════════╗
║ ❌ SKILL REJECTED — 82/100 (need 90) ║
║ Gap: -8% ║
║ FAILED: Consistency (15/20), Functionality (12/20) ║
╚═══════════════════════════════════════════════════════════╝
In Altum Per Strictness. 🧪 AXIOMA SKILL EVALUATOR STRICT — 90% DETERMINISTIC