Axioma Skill Evaluator

Advanced skill evaluation for OpenClaw agents. Use when: (1) evaluating a skill before publishing, (2) improving a skill based on evaluation results, (3) checking skill quality with automated + manual analysis, (4) any skill audit or quality check. Combines dual evaluation systems: Axioma 5-dimension framework (100 max) with ISO 25010 international framework (25 criteria, 100 max). Features: colorful terminal output, dual evaluation, 25-criteria rubric, self-contained bundled scripts.

Audits

Pass

Install

openclaw skills install axioma-skill-evaluator

AXIOMA SKILL EVALUATOR 🧙‍♂️

Advanced Skill Evaluation: Dual System (Automated + Manual)

InfoValue
Version2.1.0 — 2026-05-07
StatusOPERATIONAL

1. PURPOSE AND SCOPE

Objective

Provide comprehensive skill evaluation using dual systems:

  • Axioma System (5 dimensions, 100 max) — colorful, fast
  • ISO 25010 System (25 criteria, 100 max) — international standard

When to Use

TriggerAction
Before publishing a skillRun both evaluations
Improving a skillGet both automated + manual scores
Quality auditUse 25-criteria rubric
Pre-publication checkRun all checks

2. BUNDLED TOOLS

evaluator.py (Axioma System)

# Run Axioma 5-dimension evaluation
python3 evaluator.py <skill-path> --verbose --improve

eval-skill.py (ISO 25010 System)

# Run automated ISO 25010 checks
python3 eval-skill.py <skill-path> --verbose

# JSON output
python3 eval-skill.py <skill-path> --json

3. AXIOMA EVALUATION SYSTEM

Quick Start

python3 evaluator.py <skill-path> --verbose --improve

5 Dimensions (100 max)

DimensionWeightFocus
Structure20%Header, sections, formatting, meta
Clarity20%Description, instructions, examples
Completeness20%Tools, prerequisites, errors, edge cases
Consistency20%Style, naming, integration
Functionality20%Commands work, expected results

Output Format

╔═══════════════════════════════════════════════════════════╗
║ 📊 SKILL EVALUATION REPORT — [Skill Name]              ║
║ Score: XX/100 [STATUS]                                  ║
╠═══════════════════════════════════════════════════════════╣
║ STRUCTURE:     XX/20 ████████████░░░░ XX%                ║
║ CLARITY:      XX/20 ████████████░░░░ XX%                ║
║ COMPLETENESS: XX/20 ████████████░░░░ XX%                ║
║ CONSISTENCY:  XX/20 ████████████░░░░ XX%                ║
║ FUNCTIONALITY: XX/20 ████████████░░░░ XX%                ║
╠═══════════════════════════════════════════════════════════╣
║ STATUS: ✅ APPROVED (score >= 70%)                      ║
╚═══════════════════════════════════════════════════════════╝

Thresholds

ScoreStatusAction
90-100🟢 EXCELLENTReady for production
70-89🟡 GOODPublishable, minor notes
50-69🟠 NEEDS_WORKFix before publishing
<50🔴 POORMajor rework needed

4. ISO 25010 EVALUATION SYSTEM

Automated Checks (eval-skill.py)

Runs 13 automated checks:

  • File structure validation
  • Frontmatter YAML parsing
  • Description quality (65+ words, trigger contexts)
  • Script syntax validation
  • Credential scanning
  • Dependency audit

Target: 90%+ (12+/13 checks passed)

Manual Assessment (25 Criteria)

CategoryFrameworkMaxCriteria
1. Functional SuitabilityISO 25010/12Completeness, Correctness, Appropriateness
2. ReliabilityISO 25010/12Fault Tolerance, Error Reporting, Recoverability
3. PerformanceISO 25010/8Token Cost, Execution Efficiency
4. Usability (AI)Shneiderman/12Learnability, Consistency, Feedback
5. Usability (Human)Tognazzini/8Discoverability, Forgiveness
6. SecurityISO 25010/12Credentials, Input Validation, Data Safety
7. MaintainabilityISO 25010/12Modularity, Modifiability, Testability
8. Agent-SpecificNovel/24Trigger Precision, Progressive Disclosure, Composability
TOTAL/100

5. COMPLETE EVALUATION WORKFLOW

1. AUTOMATED: python3 eval-skill.py <path> --verbose
   → Target: 90%+ structural score
   ↓
2. AXIOMA: python3 evaluator.py <path> --verbose --improve
   → Target: 70+ score
   ↓
3. MANUAL: Score 25 criteria rubric
   → Target: 80+ score
   ↓
4. FIX: Issues from all three sources
   ↓
5. RE-EVALUATE: Until all targets met
   ↓
6. PUBLISH: To ClawHub

6. ERROR HANDLING

Common Issues

IssueCauseSolution
No frontmatterYAML not at startAdd --- at start of SKILL.md
Poor descriptionMissing triggersAdd "Use when:" clauses
Empty directoriesUnused foldersRemove or populate
Name mismatchDirectory ≠ frontmatterRename to match

Security Issues

IssueSeverityAction
Hardcoded credentialsCRITICALRemove immediately
Missing input validationHIGHAdd validation
No error handlingMEDIUMAdd try/catch blocks

7. EDGE CASES

CaseInputExpected Output
Empty SKILL.mdEmpty fileError message, suggest template
Very long SKILL.md>500 linesWarning, recommend split
Missing descriptionNo frontmatterFail with instructions
No scriptsNo scripts/ dirPass, document as standalone

8. DEPENDENCIES

DependencyPurposeRequired
Python 3.6+Script executionYes
PyYAMLFrontmatter parsingOptional

In Altum Per Quality. 🧙‍♂️ Axioma Skill Evaluator v2.1