Llm Judge

PassAudited by VirusTotal on May 9, 2026.

Findings (1)

The llm-judge skill implements a structured, multi-phase agentic workflow to evaluate and rank multiple code repositories against a requirements specification. It utilizes a Phase 1 fact-gathering stage (repo-agent.md) and a Phase 2 scoring stage (judge-agents.md) with defined rubrics (scoring-rubrics.md) and JSON schemas (fact-schema.md). While the skill executes shell commands to run test suites (e.g., pytest, npm test) and git operations, these actions are strictly aligned with its stated purpose of code analysis and quality assessment, with no evidence of malicious intent, data exfiltration, or unauthorized persistence.