Axioma Skill Evaluator Strict EN

AdvisoryAudited by Static analysis on May 8, 2026.

Overview

No suspicious patterns detected.

Findings (0)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

Running the skill executes local Python code against a chosen skill directory and may affect that directory if the improvement mode writes changes.

Why it was flagged

The documented workflow runs local evaluator code and includes an --improve mode. This is aligned with the evaluator purpose, but users should understand whether --improve edits files before using it.

Skill content
python3 .../evaluator.py <skill-path> --verbose --improve
Recommendation

Run it only on a skill path you intentionally select, review or back up files before using --improve, and avoid broad modes unless you understand their scope.

What this means

The tool may fail, scan an unexpected directory, or write reports to an unexpected local path if those hard-coded paths are used.

Why it was flagged

The evaluator contains hard-coded, user-specific local paths. This is not malicious by itself, but it is a portability and scope issue for users outside that environment.

Skill content
SKILL_DIR = Path("/media/ezekiel/Morgana/skills")
REPORTS_DIR = Path("/media/ezekiel/Morgana/skills/SKILL_EVALUATOR/reports")
Recommendation

Prefer explicit skill paths, inspect the full script before broad runs, and adjust hard-coded paths for your own environment.

What this means

A user or agent could over-trust a numeric score and treat it as final approval.

Why it was flagged

The skill uses strong approval and production-readiness language for a deterministic score. That can be useful as a quality gate, but it should not be mistaken for a full security or human review.

Skill content
IF score >= 90%: → APPROVED ✅ — Ready for production ... NO PUBLISH until 90%+ achieved
Recommendation

Use the score as one input, not as a replacement for security review, functional testing, and human approval.