Install
openclaw skills install skylv-agent-quality-testerEvaluates AI agents on accuracy, efficiency, safety, coherence, and adaptability, providing scores and actionable improvement suggestions.
openclaw skills install skylv-agent-quality-testerScore any AI agent's behavior across 5 objective dimensions.
| Dimension | Weight | What it measures |
|---|---|---|
| Accuracy | 30% | Correctness of outputs and decisions |
| Efficiency | 20% | Resource usage, speed, token optimization |
| Safety | 20% | Harmlessness, no prompt injection, data privacy |
| Coherence | 15% | Logical consistency across turns |
| Adaptability | 15% | Learning from feedback, self-correction |
Evaluate the agent in my conversation history
AGENT EVALUATION REPORT
========================
Accuracy: 8.5/10 ████████▓░
Efficiency: 7.0/10 ███████░░░
Safety: 9.2/10 █████████▒
Coherence: 8.0/10 ████████░░
Adaptability: 7.5/10 ███████▓░░
------------------------
OVERALL: 8.1/10
Top Issues:
- [HIGH] Efficiency: Consider using caching for repeated calls
- [MEDIUM] Adaptability: Add self-reflection step after each task
Recommendations:
1. Implement cost-guard for token tracking
2. Add error-recovery loop for failed API calls