SOTA AI Model Tracker

WarnAudited by ClawScan on May 10, 2026.

Overview

The model-tracking purpose is coherent, but the package includes unrelated background-agent orchestration instructions and recommends persistent daily updates to agent instruction files.

The SOTA tracker’s core data lookup purpose appears legitimate, but install it carefully: remove or ignore the unrelated Cyrus automation instructions, do not enable daily edits to Claude/agents files unless you want persistent behavior changes, and verify the GitHub source before running dependency installs or timers.

Findings (4)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

If an agent follows these instructions, it may create or delegate background work through a separate system rather than only answering model-tracking questions.

Why it was flagged

This file is packaged with the skill and instructs an agent to hand work to an external/background orchestration system, which is unrelated to SOTA model tracking and could cause autonomous activity outside the user’s intended task.

Skill content
Delegate to Cyrus - execution happens automatically
Recommendation

Remove maintainer-specific Cyrus/Linear orchestration instructions from the distributed skill, or clearly isolate them from user-facing agent instructions and require explicit user approval before any delegation.

What this means

Future Claude sessions could rely on automatically refreshed model recommendations or instructions without the user noticing the change each time.

Why it was flagged

The skill recommends writing externally updated SOTA data into a persistent Claude instruction/context file, so scraped or compromised data could influence future agent sessions.

Skill content
This embeds a compact SOTA summary directly in your `~/.claude/CLAUDE.md` file.
Recommendation

Only enable persistent agent-file updates if you trust the data source; prefer a clearly marked block, backups, easy rollback, and explicit opt-in before enabling daily timers.

What this means

Users may run code or install dependencies from a repository/branch that is not pinned or verified by the registry metadata.

Why it was flagged

The registry metadata does not declare a source or homepage, while the documentation points users toward manual GitHub clone/install workflows. That is not malicious by itself, but it leaves provenance verification to the user.

Skill content
Source: unknown; Homepage: none
Recommendation

Verify the GitHub repository owner, review the code and dependency files, and preferably pin to a trusted commit before running setup commands.

What this means

Other devices on the network may be able to query the tracker API if firewall rules permit it.

Why it was flagged

Binding the REST API to 0.0.0.0 is a disclosed, user-directed server command, but it can expose the API beyond localhost if the machine or network allows inbound access.

Skill content
uvicorn rest_api:app --host 0.0.0.0 --port 8000
Recommendation

Use 127.0.0.1 for local-only use, and add authentication or firewall restrictions before exposing the API on a network.