Axioma Skill Evaluator Strict EN

PassAudited by ClawScan on May 10, 2026.

Overview

This appears to be a local skill-quality evaluator, but users should treat its 90% approval score as advisory and verify the local commands before running it.

Before installing, verify the complete scripts in your environment, run evaluations only on intended skill directories, be cautious with --improve, and treat the 90% threshold as a heuristic quality gate rather than a guarantee that a skill is safe or production-ready.

Findings (3)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Low

#ASI02: Tool Misuse and Exploitation

What this means

Running the skill executes local Python code against a chosen skill directory and may affect that directory if the improvement mode writes changes.

Why it was flagged

The documented workflow runs local evaluator code and includes an --improve mode. This is aligned with the evaluator purpose, but users should understand whether --improve edits files before using it.

Skill content

python3 .../evaluator.py <skill-path> --verbose --improve

Recommendation

Run it only on a skill path you intentionally select, review or back up files before using --improve, and avoid broad modes unless you understand their scope.

Info

#ASI04: Agentic Supply Chain Vulnerabilities

What this means

The tool may fail, scan an unexpected directory, or write reports to an unexpected local path if those hard-coded paths are used.

Why it was flagged

The evaluator contains hard-coded, user-specific local paths. This is not malicious by itself, but it is a portability and scope issue for users outside that environment.

Skill content

SKILL_DIR = Path("/media/ezekiel/Morgana/skills")
REPORTS_DIR = Path("/media/ezekiel/Morgana/skills/SKILL_EVALUATOR/reports")

Recommendation

Prefer explicit skill paths, inspect the full script before broad runs, and adjust hard-coded paths for your own environment.

Low

#ASI09: Human-Agent Trust Exploitation

What this means

A user or agent could over-trust a numeric score and treat it as final approval.

Why it was flagged

The skill uses strong approval and production-readiness language for a deterministic score. That can be useful as a quality gate, but it should not be mistaken for a full security or human review.

Skill content

IF score >= 90%: → APPROVED ✅ — Ready for production ... NO PUBLISH until 90%+ achieved

Recommendation

Use the score as one input, not as a replacement for security review, functional testing, and human approval.