token-slim

Security checks across malware telemetry and agentic risk

Overview

This appears to be a real token-cleanup skill, but it can persistently change future agent behavior and optionally install/download tokenizer components, so it needs user review before installation.

Install only if you are comfortable letting the skill read workspace context files and modify agent configuration. Run scans in dry-run mode first, review the exact config blocks before accepting them, avoid batch mode unless you trust every proposed change, and skip the tiktoken installer in restricted environments unless you accept pip installation and external cache downloads.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (19)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 97% confidence
Finding: The skill document describes capabilities to read/write workspace files, invoke Python scripts, access environment-dependent paths, and perform network-backed package installation/cache fetches, but it does not declare permissions or present a clear least-privilege model. In an agent runtime, hidden or undeclared capabilities reduce operator visibility and can lead to unintended file modification or outbound network access when the skill is triggered.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 95% confidence
Finding: The stated purpose is token optimization, but the documented behavior expands into package installation, remote cache retrieval, persistent local state, and scanning of skill files. That mismatch is dangerous because users may invoke a seemingly harmless cleanup skill without realizing it can execute networked install flows and create or modify additional filesystem state.

Description-Behavior Mismatch

Medium

Confidence: 92% confidence
Finding: A workspace-cleanup skill should primarily operate locally, but this one instructs the agent to install packages and fetch external resources from the internet. Introducing outbound network operations increases supply-chain and data-exposure risk, especially in constrained agent environments where users may not expect internet access from a maintenance task.

Context-Inappropriate Capability

Medium

Confidence: 91% confidence
Finding: The skill includes package installation from multiple mirrors and remote vocabulary downloads that are not essential to its baseline cleanup role. This broadens the attack surface through dependency compromise, mirror trust issues, and unreviewed external content retrieval, all under the banner of a low-risk optimization utility.

Description-Behavior Mismatch

Medium

Confidence: 95% confidence
Finding: The onboarding flow goes beyond temporary workspace cleanup and requires persistent edits to the workspace agent configuration that change future behavior every session. This is dangerous because a cleanup/optimization skill is using a one-time maintenance action to install long-lived behavioral policy, creating hidden persistence and changing how the agent responds later without clear scope limitation.

Intent-Code Divergence

Medium

Confidence: 91% confidence
Finding: Step 3 says backups are optional and only created if the user confirms, while the Undo Mechanism later requires backups before any modification. This inconsistency is dangerous because an operator may follow the weaker instruction and irreversibly modify or delete workspace content, especially when the same document also instructs immediate deletion of files like BOOTSTRAP.md.

Description-Behavior Mismatch

Medium

Confidence: 89% confidence
Finding: The installer reaches out to external package indexes and an OpenAI-hosted blob to download code and model assets, which expands the skill's trust boundary beyond simple workspace token optimization. In an agent/runtime context, this creates supply-chain and unexpected network-exfiltration risk because running the skill causes third-party package installation and remote fetches that are not strictly necessary for safe local analysis.

Context-Inappropriate Capability

Medium

Confidence: 92% confidence
Finding: This code performs subprocess-driven pip installation from multiple remote indexes, including mirrors, which materially increases supply-chain exposure. In agent environments, an installer that can mutate the Python environment is more dangerous than the stated token-cleanup purpose suggests, because compromise of an index, mirror, or dependency chain could lead to arbitrary code execution during install.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The scanner can trigger outbound network access when tiktoken initializes and downloads tokenizer data into the cache. In an agent skill meant to scan a local workspace, unexpected network egress can violate offline assumptions, leak metadata such as timing/IP/environment usage, and create nondeterministic behavior in restricted environments.

Intent-Code Divergence

Medium

Confidence: 98% confidence
Finding: The help text says --dry-run will not modify files, but startup still preloads tiktoken, creates cache directories, and may fetch tokenizer data over the network before scanning. This breaks the user's safety expectation for preview mode and can cause filesystem changes or network activity in environments where dry-run is relied on for non-invasive inspection.

Vague Triggers

Medium

Confidence: 90% confidence
Finding: The trigger list includes broad, natural follow-up phrases such as 'scan again', 'find more savings', and 'what else can I optimize', which can easily appear in ordinary conversation. In an agent runtime that auto-activates skills based on phrase matching, this increases the chance of unintended invocation, causing the agent to read workspace files or initiate cleanup workflows without the user explicitly requesting this skill.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: The example phrase 'save tokens' / '帮我省 Token' is short, generic, and likely to overlap with normal user requests about efficiency rather than a deliberate request to activate this specific skill. In environments where skills are selected heuristically, this can trigger the skill unexpectedly and expand the agent's behavior to scanning workspace context files, which may expose or alter project state beyond the user's immediate intent.

Vague Triggers

Medium

Confidence: 90% confidence
Finding: The trigger phrases are broad everyday language such as requests to save or optimize tokens, which can cause accidental invocation in unrelated conversations. In agent systems, unintended activation is dangerous because it can lead to workspace scanning, prompts to modify files, or execution of helper scripts without the user intentionally selecting this skill.

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The re-scan examples are similarly ambiguous and may react to casual follow-up phrases like 'scan again' or 'what else can I optimize' without clear scope. While less severe than initial activation, this can still cause repeated file inspection or pressure the user toward further modifications they did not explicitly request.

Natural-Language Policy Violations

Medium

Confidence: 89% confidence
Finding: The skill requires inserting a Chinese-only persistent session-discipline block into workspace configuration, which can alter future agent behavior while being unreadable or less reviewable for some users. The danger comes less from the language itself and more from coupling unreadable persistent config changes with mandatory installation, reducing informed consent and increasing the chance of unnoticed behavioral persistence.

Natural-Language Policy Violations

Medium

Confidence: 88% confidence
Finding: The token-habits template forces Chinese-language persistent configuration and confirmations, including a default 'brutal mode' behavior for future sessions. This is dangerous because it combines opaque or less-reviewable text with persistent response-policy changes, making it easier to alter agent behavior without fully informed user approval.

Vague Triggers

Medium

Confidence: 84% confidence
Finding: The re-scan mode is triggered by broad natural-language phrases such as 'scan again' or 'what else can I optimize', which can be matched in contexts unrelated to intentional workspace modification. In this skill, that ambiguity matters because the workflow can progress from scanning into file-moving or batch execution, increasing the chance of unintended actions from casual or adversarially planted text.

Ssd 3

Medium

Confidence: 93% confidence
Finding: The instruction to record key decisions into memory files each session creates persistent storage of user-provided information by default. This is dangerous because it may retain sensitive decisions, operational context, or private data beyond the current session without granular consent, increasing exposure if the workspace is shared, synced, or later accessed by other tools or agents.

Ssd 3

Medium

Confidence: 94% confidence
Finding: The skill instructs the agent to make workspace-wide persistent behavior changes that apply every session without obtaining clear per-session or per-setting consent. In the context of an optimization skill, this is more dangerous because users may expect temporary cleanup assistance, not installation of lasting operating policy that affects future agent conduct.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal