Axioma Skill Evaluator Strict EN

AdvisoryAudited by Static analysis on May 8, 2026.

Overview

No suspicious patterns detected.

Findings (0)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

NoteHigh Confidence

ASI02: Tool Misuse and Exploitation

What this means

Running the skill executes local Python code against a chosen skill directory and may affect that directory if the improvement mode writes changes.

Why it was flagged

The documented workflow runs local evaluator code and includes an --improve mode. This is aligned with the evaluator purpose, but users should understand whether --improve edits files before using it.

Skill content

python3 .../evaluator.py <skill-path> --verbose --improve

Recommendation

Run it only on a skill path you intentionally select, review or back up files before using --improve, and avoid broad modes unless you understand their scope.

NoteHigh Confidence

ASI04: Agentic Supply Chain Vulnerabilities

What this means

The tool may fail, scan an unexpected directory, or write reports to an unexpected local path if those hard-coded paths are used.

Why it was flagged

The evaluator contains hard-coded, user-specific local paths. This is not malicious by itself, but it is a portability and scope issue for users outside that environment.

Skill content

SKILL_DIR = Path("/media/ezekiel/Morgana/skills")
REPORTS_DIR = Path("/media/ezekiel/Morgana/skills/SKILL_EVALUATOR/reports")

Recommendation

Prefer explicit skill paths, inspect the full script before broad runs, and adjust hard-coded paths for your own environment.

NoteHigh Confidence

ASI09: Human-Agent Trust Exploitation

What this means

A user or agent could over-trust a numeric score and treat it as final approval.

Why it was flagged

The skill uses strong approval and production-readiness language for a deterministic score. That can be useful as a quality gate, but it should not be mistaken for a full security or human review.

Skill content

IF score >= 90%: → APPROVED ✅ — Ready for production ... NO PUBLISH until 90%+ achieved

Recommendation

Use the score as one input, not as a replacement for security review, functional testing, and human approval.