Axiomata Skill Evaluator En

Axiomata Skill Evaluator — Universal OpenClaw Agent Skill Quality System. Dual evaluation: (1) Axioma 5-Dimension Framework (Structure 20%, Clarity 20%, Completeness 20%, Consistency 20%, Functionality 20%, 100pts total), (2) ISO 25010 Structural Framework (13 automated checks, 100% target). Self-contained: bundles evaluator.py (Axioma 5-Dim) and eval-skill.py (ISO 25010). Use when: evaluating a skill before publishing, improving a skill based on evaluation results, checking skill quality with automated analysis, auditing skill quality, or verifying a skill meets production standards.

Audits

Pass

Install

openclaw skills install axiomata-skill-evaluator-en

Axiomata Skill Evaluator v3.0

Universal, impersonal skill quality evaluator for OpenClaw agents.

InfoValue
Version3.0.0
TypeSelf-contained evaluation system
EvaluationDual (Axioma 5-Dim + ISO 25010)
Target70+ (Axioma), 90%+ (ISO 25010)

1. Purpose and Scope

Objective

Provide complete skill quality evaluation using dual evaluation system:

  1. Axioma 5-Dimension Framework (100 points)
  2. ISO 25010 Structural Framework (13 automated checks)

Design Principles

SELF-CONTAINED + UNIVERSAL + IMPERSONAL
PrincipleDescription
Self-containedAll tools bundled in skill directory
UniversalWorks for any OpenClaw agent
ImpersonalNo agent-specific references

When to Use

TriggerAction
Evaluate skillRun dual evaluation
Pre-publish checkRun full evaluation pipeline
Improve skillAnalyze report and fix issues
Skill auditRun complete audit
Check qualityRun automated checks

2. Dual Evaluation System

╔═══════════════════════════════════════════════════════════╗
║              DUAL EVALUATION ARCHITECTURE                ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  ┌─────────────────────────────────────────────────┐      ║
║  │  1. Axioma 5-Dim Evaluation (100 pts)          │      ║
║  │                                                 │      ║
║  │  Dimensions:                                     │      ║
║  │  ├─ Structure     → 20%                         │      ║
║  │  ├─ Clarity      → 20%                         │      ║
║  │  ├─ Completeness → 20%                         │      ║
║  │  ├─ Consistency  → 20%                         │      ║
║  │  └─ Functionality → 20%                        │      ║
║  │                                                 │      ║
║  │  Target: 70+ score                             │      ║
║  └─────────────────────────────────────────────────┘      ║
║                        ↓                                ║
║  ┌─────────────────────────────────────────────────┐      ║
║  │  2. ISO 25010 Structural (13 checks)          │      ║
║  │                                                 │      ║
║  │  Categories: 8 categories, 25 standards       │      ║
║  │  Automated checks: 13 tests                    │      ║
║  │                                                 │      ║
║  │  Target: 90%+ (12/13 checks passing)          │      ║
║  └─────────────────────────────────────────────────┘      ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

Evaluation Flow

[INPUT] Skill to evaluate
          ↓
 Phase 1: Axioma 5-Dim Evaluation
          ↓
 Phase 2: ISO 25010 Structural Checks
          ↓
 [OUTPUT] Quality report + recommendations

3. Bundled Tools

FileSystemPurpose
evaluator.pyAxioma 5-Dim5-dimension evaluation, bundled
eval-skill.pyISO 25010Automated structural checks, bundled

Tool Paths

EVAL_PATH="<skill-directory>/scripts/"
SKILL_PATH="<skill-to-evaluate>"

# Axioma 5-Dim evaluation
python3 ${EVAL_PATH}evaluator.py <SKILL_PATH> --verbose

# ISO 25010 checks
python3 ${EVAL_PATH}eval-skill.py <SKILL_PATH> --verbose

4. Axioma 5-Dimension Evaluation

4.1 Dimension Breakdown

DimensionMaxTargetWhat it checks
Structure2014+Header, sections, formatting, metadata
Clarity2014+Description, commands, examples, constraints
Completeness2014+Tools, prerequisites, errors, edge cases
Consistency2014+Naming, format, structure, style
Functionality2014+Valid commands, documented outputs

4.2 Score Thresholds

ScoreStatusAction
90-100EXCELLENTReady for production
70-89GOODMinor improvements possible
50-69NEEDS_WORKMajor improvements needed
<50POORSignificant rewrite required

5. ISO 25010 Structural Checks

5.1 Automated Checks (13 total)

CategoryChecksWhat it verifies
Structure6Frontmatter, header, sections, formatting
Trigger2Description length, trigger contexts
Documentation3Body length, references, linked docs
Scripts2Python parse, no external deps

5.2 Pass Threshold

ResultMeaning
13/13Perfect structural score
11-12/13Acceptable, minor warnings
<11/13Needs structural improvement

6. Command Reference

6.1 Axioma 5-Dim Evaluation

# Basic evaluation
python3 <skill-directory>/scripts/evaluator.py <skill-path>

# Verbose output
python3 <skill-directory>/scripts/evaluator.py <skill-path> --verbose

# With auto-improvement suggestions
python3 <skill-directory>/scripts/evaluator.py <skill-path> --verbose --improve

# JSON output
python3 <skill-directory>/scripts/evaluator.py <skill-path> --json

6.2 ISO 25010 Checks

# Basic ISO check
python3 <skill-directory>/scripts/eval-skill.py <skill-path>

# Verbose output
python3 <skill-directory>/scripts/eval-skill.py <skill-path> --verbose

# JSON output
python3 <skill-directory>/scripts/eval-skill.py <skill-path> --json

6.3 Evaluate All Skills

# Evaluate all skills in parent directory
python3 <skill-directory>/scripts/evaluator.py --all

# Verbose all
python3 <skill-directory>/scripts/evaluator.py --all --verbose

7. Output Formats

7.1 Console Output

=== EVALUATION RESULTS ===
STRUCTURE       20/20 ████████████████████ 100%
CLARITY         17/20 █████████████████░░░ 85%
COMPLETENESS    14/20 ████████████░░░░░░░░ 70%
CONSISTENCY     10/20 ██████████░░░░░░░░░ 50%
FUNCTIONALITY   12/20 ████████████░░░░░░░░ 60%
------------------------------------------------------------
STATUS: GOOD (score 73%)

7.2 JSON Output

{
  "skill": "example-skill",
  "scores": {
    "structure": 20,
    "clarity": 17,
    "completeness": 14,
    "consistency": 10,
    "functionality": 12
  },
  "total": 73,
  "max": 100,
  "passed": true,
  "recommendations": [
    "[CONSISTENCY] Low score (50%)",
    "  -> Style: 2/5"
  ]
}

8. Examples

Example 1: Basic Evaluation

$ python3 evaluator.py /path/to/skill --verbose
Read /path/to/skill/SKILL.md (2048 chars)
Structure: 20/20
Clarity: 17/20
Completeness: 14/20
Consistency: 10/20
Functionality: 12/20
============================================================
STATUS: GOOD (score 73%)

Example 2: ISO 25010 Check

$ python3 eval-skill.py /path/to/skill --verbose
[STRUCTURE]
    Pass: 5/6
[TRIGGER]
    Pass: 2/2
[DOCUMENTATION]
    Pass: 2/3
[SCRIPTS]
    Pass: 2/2
[SECURITY]
    Pass: 2/2
==================================================
  Pass: 13  Warn: 0  Fail: 0
  Structural score: 100% (13/13 checks passed)

Example 3: JSON Output

$ python3 evaluator.py /path/to/skill --json
{
  "skill": "example-skill",
  "scores": {
    "structure": 20,
    "clarity": 17,
    "completeness": 14,
    "consistency": 10,
    "functionality": 12
  },
  "total": 73,
  "max": 100,
  "passed": true,
  "recommendations": []
}

9. Self-Evaluation

The skill evaluates itself automatically:

# Self-evaluation (when no path given, evaluates own directory)
python3 evaluator.py

# Expected output for v3.0:
# Score: 85+/100
# Status: GOOD to EXCELLENT

10. Recommendations Format

When a dimension scores below 70%:

[{DIMENSION}] Low score ({percentage}%)
  -> {specific issue}
  -> {specific issue}

Example:

[CONSISTENCY] Low score (50%)
  -> Cluster alignment partial: 2
  -> Style: 2/5

11. Constraints

ConstraintDescriptionPriority
70% minimumSkills must score 70+ on Axioma 5-DimHIGH
90%+ structuralTarget 90%+ on ISO 25010 checksHIGH
Self-containedAll tools bundled, no external depsHIGH
ImpersonalNo agent-specific referencesHIGH

Quality Thresholds

SystemMinimumTarget
Axioma 5-Dim70/10090/100
ISO 2501011/1313/13
Structure14/2018/20
Clarity14/2018/20
Completeness14/2018/20
Consistency14/2018/20
Functionality14/2018/20

12. Error Handling

ErrorCauseSolution
SKILL.md not foundMissing fileCreate SKILL.md first
Parse errorCorrupt fileCheck file encoding
Module not foundMissing dependenciesUse bundled tools only

13. Workflow Integration

╔═══════════════════════════════════════════════════════════╗
║         SKILL PUBLISHING WORKFLOW                        ║
╠═══════════════════════════════════════════════════════════╣
║                                                           ║
║  1. Create Skill                                          ║
║      ↓                                                    ║
║  2. Self-Evaluate                                         ║
║      ↓ 70+ → Continue                                    ║
║  3. ISO 25010 Check                                      ║
║      ↓ 90%+ → Continue                                   ║
║  4. Fix Issues if Needed                                 ║
║      ↓                                                    ║
║  5. Publish to ClawHub                                   ║
║                                                           ║
║  RESULT: QUALITY SKILLS FOR PRODUCTION                   ║
║                                                           ║
╚═══════════════════════════════════════════════════════════╝

In Altum Per Qualitatem. 🧪 AXIOMATA SKILL EVALUATOR v3.0 — UNIVERSAL QUALITY SYSTEM