SOTA Tracker (Claw)

Security checks across malware telemetry and agentic risk

Overview

This appears to be a legitimate AI model tracker, but it should be reviewed because it can repeatedly rewrite agent instruction files and includes reduced-safety model recommendations without strong user controls.

Review before installing. Use it only if you want a SOTA model recommendation tool, keep the API local or behind a firewall, pin and audit dependencies, and do not enable cron/systemd updates to CLAUDE.md or agents.md until you have reviewed the exact file path, backup behavior, and how to disable it. Consider filtering or disabling uncensored-model recommendations unless that is an explicit user-approved requirement.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (18)

Intent-Code Divergence

Medium

Confidence: 96% confidence
Finding: The script behavior is destructive in a way the docstring does not disclose: `main()` unconditionally deletes the existing SQLite database before recreating it. That mismatch can lead operators or automation to run it in production or against valuable local data, causing irreversible data loss.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill recommends setting up a script to automatically update the user's ~/.claude/CLAUDE.md on a recurring schedule, but it does not clearly warn that this repeatedly modifies a user-owned configuration/context file. Because CLAUDE.md can materially affect agent behavior and may contain custom instructions, regular unattended edits can overwrite or drift existing content in ways the user did not explicitly review.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill provides commands to automate recurring updates to agents.md without clearly warning that this may alter existing prompts, instructions, or manually maintained agent configuration. In agent ecosystems, silent or scheduled prompt-file modification is security-relevant because it can change future model behavior persistently and without fresh user review.

Vague Triggers

Medium

Confidence: 87% confidence
Finding: The example companion skill description is broad enough to auto-activate across many ordinary AI-model conversations, which can inject hidden behavioral constraints into unrelated sessions. In this context, the risk is increased because the skill mandates tool use and hard-coded model-selection behavior, so overly broad triggering can silently steer recommendations without clear user intent.

Natural-Language Policy Violations

Medium

Confidence: 94% confidence
Finding: The embedded statement that the user 'prefers uncensored models' hard-codes a sensitive preference into the skill without runtime consent or verification. This can bias outputs toward reduced-safety model recommendations and misrepresent user intent, especially if the skill activates automatically in general AI-model discussions.

Natural-Language Policy Violations

Medium

Confidence: 80% confidence
Finding: This entry promotes an uncensored variant in model metadata without any indication of user opt-in, gating, policy constraints, or safety warnings. In a catalog consumed by agents or selection logic, that can normalize or silently route users toward less restricted models, increasing the chance of harmful or policy-violating outputs.

Natural-Language Policy Violations

Medium

Confidence: 86% confidence
Finding: This local model entry advertises an uncensored variant as an available option without any surrounding control language or consent mechanism. If this JSON feeds recommendation, UI, or automated tooling, users or downstream agents may be steered to unsafe models by default or without understanding the risk tradeoff.

Natural-Language Policy Violations

High

Confidence: 95% confidence
Finding: This entry goes beyond neutral documentation and explicitly markets 'uncensored' behavior as a strength and use case. That framing encourages selection of guardrail-free models for unrestricted interactions, which materially raises the likelihood of abuse-enabling, harmful, or noncompliant outputs in an agent ecosystem.

Natural-Language Policy Violations

High

Confidence: 97% confidence
Finding: Describing an uncensored model as providing capability 'without restrictions' is an affirmative endorsement of removed safety controls. In a model registry, this can bias automated or human selection toward less safe systems and undermine platform-level protections intended to prevent harmful generation.

Natural-Language Policy Violations

High

Confidence: 98% confidence
Finding: Advertising 'No refusals on complex topics' for an uncensored reasoning model directly signals that safety refusals have been stripped and frames that as desirable functionality. In the context of an agent skill that catalogs state-of-the-art models, this is especially dangerous because it can guide downstream systems toward models more likely to generate harmful, disallowed, or abuse-facilitating content.

Natural-Language Policy Violations

Medium

Confidence: 91% confidence
Finding: The server instructions and recommendation logic explicitly support prioritizing 'uncensored' model variants as a user preference, which can steer downstream agents toward models with fewer safety controls. In a model-recommendation skill, this increases the chance of facilitating harmful or policy-violating outputs, especially because the feature is embedded in normal recommendation flows rather than tightly constrained administrative settings.

Natural-Language Policy Violations

Medium

Confidence: 94% confidence
Finding: The get_best_in_class tool advertises categories such as 'Best uncensored' and 'no restrictions,' normalizing selection of less-restricted models as a desirable feature. Because this skill's purpose is to guide model choice, these labels directly encourage use of models that may bypass content safeguards, making misuse easier and more likely.

Missing User Warnings

Low

Confidence: 89% confidence
Finding: The code persists a hardware profile to disk and, when no profile name is supplied, uses the machine hostname as the profile identifier. Hostnames and hardware characteristics can be system-identifying metadata, so storing them without explicit user disclosure or consent creates a privacy issue, especially on shared systems or when the data directory may later be collected, synced, or exfiltrated.

Known Vulnerable Dependency: fastmcp — 8 advisory(ies): CVE-2025-69196 (FastMCP OAuth Proxy token reuse across MCP servers); GHSA-c2jp-c369-7pvx (FastMCP Auth Integration Allows for Confused Deputy Account Takeover); CVE-2025-64340 (FastMCP has a Command Injection vulnerability - Gemini CLI) +5 more

Critical

Category: Supply Chain
Confidence: 98% confidence
Finding: fastmcp

Known Vulnerable Dependency: aiohttp — 10 advisory(ies): CVE-2024-52303 (aiohttp has a memory leak when middleware is enabled when requesting a resource ); CVE-2026-34514 (AIOHTTP has CRLF injection through multipart part content type header constructi); CVE-2026-34517 (AIOHTTP has late size enforcement for non-file multipart fields causes memory Do) +7 more

High

Category: Supply Chain
Confidence: 93% confidence
Finding: aiohttp

Known Vulnerable Dependency: python-dotenv — 1 advisory(ies): CVE-2026-28684 (python-dotenv: Symlink following in set_key allows arbitrary file overwrite via )

Low

Category: Supply Chain
Confidence: 77% confidence
Finding: python-dotenv

Known Vulnerable Dependency: fastapi — 3 advisory(ies): CVE-2021-32677 (Cross-Site Request Forgery (CSRF) in FastAPI); CVE-2021-32677 (FastAPI is a web framework for building APIs with Python 3.6+ based on standard ); CVE-2024-24762 (FastAPI is a web framework for building APIs with Python 3.8+ based on standard )

High

Category: Supply Chain
Confidence: 90% confidence
Finding: fastapi

Known Vulnerable Dependency: uvicorn — 4 advisory(ies): CVE-2020-7694 (Log injection in uvicorn); CVE-2020-7695 (HTTP response splitting in uvicorn); CVE-2020-7694 (This affects all versions of package uvicorn. The request logger provided by the) +1 more

High

Category: Supply Chain
Confidence: 91% confidence
Finding: uvicorn

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal