llm-benchmark-analyst

PassAudited by VirusTotal on May 14, 2026.

Findings (1)

The skill bundle is a highly structured and legitimate tool designed to enable an AI agent to perform LLM benchmark analysis. It includes a comprehensive reference list of global benchmarks (benchmark-source.md), detailed routing logic (core-dimensions.md), and a search playbook that emphasizes identity normalization and evidence-based reporting. The instructions in SKILL.md and search-playbook.md are strictly aligned with the stated purpose of model comparison and include sophisticated guidance on handling data defects and multimodal extraction for non-text leaderboards. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found; in fact, the bundle includes references to security-focused benchmarks like SKILL-INJECT to evaluate agent robustness.