Install
openclaw skills install skill-scorer对任何 SKILL.md(或 skill 文件夹)进行质量评估和打分,基于行业最佳实践,生成 8 维度 100 分制的结构化质检报告,精准定位问题并提供可执行的优化建议。当用户要求评审、审计、评分、检测、质检任何 skill 时使用——哪怕只是说「这个 skill 写得怎么样?」也会触发。也支持:skill质检、skill评分、检测skill。 | Evaluate and score any SKILL.md (or skill folder) against industry best practices. Generates a structured quality report with a 100-point score across 8 dimensions, pinpoints issues, and provides actionable optimization suggestions. Use this skill whenever the user asks to review, audit, evaluate, grade, score, lint, or quality-check a skill — even if they just say 'is this skill any good?' or 'help me improve this skill'. Also triggers on: 'skill review', 'rate my skill'.
openclaw skills install skill-scorerA meta-skill that evaluates the quality of other skills. Given a SKILL.md file (or a complete skill folder), it performs a systematic audit across 8 dimensions, assigns a score out of 100, identifies issues by severity, and generates actionable optimization suggestions.
This skill synthesizes quality criteria from Anthropic's official skill authoring best practices, the Skill Engineering Standard (v1.4.3), and community-tested patterns from production skill ecosystems.
User provides a skill and asks any of:
Do NOT activate for: creating a new skill from scratch → use skill-creator. This skill is for evaluation, not generation.
Determine what the user has provided:
| Input | Action |
|---|---|
Single SKILL.md file | Evaluate that file |
Skill folder (with references/) | Evaluate all files, cross-reference consistency |
| URL / GitHub link | Fetch and evaluate |
| Pasted markdown content | Treat as SKILL.md |
If the user has not provided a skill → ask: "请提供要评估的 SKILL.md 文件或 skill 文件夹路径。"
Input validation — before proceeding to Step 1, verify the input is actually a skill:
| Check | Condition | Action |
|---|---|---|
| Binary / garbled content | File is not valid text, or text is unreadable gibberish | STOP. Report: "This file does not appear to be a valid SKILL.md — it contains binary or unreadable content. Please provide a markdown-based skill file." Do NOT attempt to score. |
| No skill markers at all | Text is valid but contains zero skill indicators (no YAML frontmatter ---, no markdown headings resembling skill sections, no workflow/instructions) | STOP. Report: "This appears to be a {detected_type} file (e.g., Python script, JSON config, plain prose), not a SKILL.md. skill-scorer evaluates SKILL.md files only." Do NOT force-fit 8 dimensions onto non-skill content. |
| Partial skill structure | Has some skill-like elements (e.g., YAML frontmatter exists but body is minimal, or has headings but no workflow) | PROCEED with caveats. Evaluate normally, but note in the report header: "⚠️ This file has incomplete skill structure — scores reflect what is present." Score missing sections as 0 in relevant dimensions rather than guessing. |
Extract and inventory:
name, description, version, compatibility)references/, scripts/, assets/)Read references/rubric.md for the complete scoring rubric.
Evaluate the skill across these 8 dimensions (each scored 0-100, then weighted):
| # | Dimension | Weight | What It Measures |
|---|---|---|---|
| 1 | Metadata & Triggering | 15% | Name clarity, description quality, trigger coverage |
| 2 | Structure & Architecture | 15% | File organization, section order, progressive disclosure |
| 3 | Instruction Clarity | 15% | Actionability, conciseness, examples, tone |
| 4 | Workflow & Logic | 15% | Step completeness, parameter handling, validation |
| 5 | Error Handling | 10% | Fallbacks, edge cases, failure recovery |
| 6 | Context Efficiency | 10% | Token budget, redundancy, information density |
| 7 | Portability & Compatibility | 10% | Self-containment, cross-platform support |
| 8 | Safety & Robustness | 10% | No injection risk, no hallucination traps, identity lock |
For each issue found, classify severity:
| Severity | Meaning | Score Impact |
|---|---|---|
| 🔴 Critical | Skill will malfunction or not trigger | -10 to -15 per issue |
| 🟡 Warning | Skill works but suboptimally | -3 to -8 per issue |
| 🟢 Suggestion | Nice-to-have improvement | -1 to -2 per issue |
Read references/report-template.md for the output format.
The report includes:
After presenting the report, ask:
--- separator, then the complete report in English. Never mix languages within a section. Both versions must contain identical scores, issues, and suggestions — only the language differs.| File | Purpose | When to read |
|---|---|---|
| references/rubric.md | Detailed scoring criteria for all 8 dimensions | Step 2: scoring |
| references/report-template.md | Output format and report structure | Step 4: generating report |
| references/anti-patterns.md | Common skill mistakes and how to detect them | Step 3: finding issues |