leader-skill(create--leader)

Security checks across malware telemetry and agentic risk

Overview

This skill is not clearly malicious, but it needs review because it can collect and store sensitive workplace chats/documents while profiling a leader’s weaknesses and replacement path.

Install only if you have clear authorization to process the workplace data involved. Avoid private chats, coworker identifiers, and company documents unless all required consent and workplace approvals are in place, and review/delete generated profiles, knowledge files, backups, tokens, and ~/.create-leader configuration when finished.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (35)

Context-Inappropriate Capability

High

Confidence: 97% confidence
Finding: The skill instructs bulk workplace surveillance and ingestion of messages, docs, meetings, and reports far beyond what is necessary to provide generic leadership coaching. In context, this creates a reusable dossier on a real person and increases the risk of privacy violations, insider misuse, and unauthorized profiling.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: Private-chat collection using access tokens introduces credentialed access to sensitive communications that are especially likely to contain confidential or personal information. This is dangerous because it normalizes harvesting private workplace conversations to profile an individual, which can enable privacy abuse and policy violations.

Context-Inappropriate Capability

Medium

Confidence: 93% confidence
Finding: The skill includes a destructive shell operation (`rm -rf leaders/{slug}`) unrelated to the core analytical purpose and exposes deletion through a parameterized command. If misused or insufficiently validated, this can cause data loss and expands the blast radius of the skill beyond content generation.

Intent-Code Divergence

Medium

Confidence: 88% confidence
Finding: The ethics section attempts to frame the skill as benign self-improvement, but the operational core explicitly prioritizes identifying weaknesses and planning how to replace a leader. This contradiction is dangerous because it can mislead reviewers and users about the true risk profile while still facilitating adversarial profiling behavior.

Description-Behavior Mismatch

High

Confidence: 98% confidence
Finding: The prompt directs collection of private chats, shared group messages, documents, wikis, and tables that go well beyond the stated leadership-analysis purpose. This creates unnecessary surveillance and over-collection of highly sensitive workplace data, including third-party communications that may not be relevant or consented to.

Context-Inappropriate Capability

High

Confidence: 97% confidence
Finding: The prompt includes directory-scoping steps to discover another person's open_id, enabling identification and targeting of coworkers without clear necessity for the advertised function. In context, this supports surveillance and data gathering on third parties, making abuse against employees or managers substantially easier.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The prompt instructs sending a message to a target user solely to retrieve a private chat_id, which is an active interaction with a third party unrelated to safe passive analysis. This creates a capability to contact or probe coworkers and establish access paths into private conversations under the guise of leadership analysis.

Context-Inappropriate Capability

Medium

Confidence: 88% confidence
Finding: Allowing the agent to bypass the provided collector and instead write arbitrary API scripts turns the skill into a general Feishu automation and data-extraction framework. That flexibility weakens safety boundaries, makes auditing harder, and increases the chance of unauthorized collection or misuse of credentials and tokens.

Description-Behavior Mismatch

Medium

Confidence: 93% confidence
Finding: The prompt explicitly instructs the agent to regenerate and overwrite persistent skill artifacts under leaders/{slug}/, which expands the skill from analysis into repository content management. In this skill context, that is risky because the broader skill is designed to distill sensitive workplace data about a leader, including weaknesses and replacement strategies, so automatic persistence can create durable sensitive dossiers and modify operational artifacts without clear user consent or scope controls.

Context-Inappropriate Capability

Medium

Confidence: 88% confidence
Finding: The backup step invokes a local python3 command against a version-management script, giving the prompt authority to execute local code rather than only analyze text. Even if intended for backups, this broadens the attack surface to command execution and could be abused if parameters, environment variables, or surrounding tooling are manipulated.

Context-Inappropriate Capability

Low

Confidence: 80% confidence
Finding: Allowing ingestion from Feishu/DingTalk links adds external data collection capability that is not clearly bounded in the skill description or accompanied by privacy and authorization checks. In this skill’s context, those links may contain highly sensitive workplace communications, making unauthorized collection and downstream persistence more dangerous than in a generic document-processing skill.

Intent-Code Divergence

Medium

Confidence: 96% confidence
Finding: The file explicitly instructs users to identify a leader’s 'fatal weaknesses,' relationship conflicts, power dependencies, and exploitable timing for replacement, while only adding a superficial ethics disclaimer. That disclaimer does not neutralize the operational guidance; instead, it legitimizes collection and strategic use of sensitive interpersonal weaknesses in a workplace context, which can enable manipulation, retaliation, or targeted career harm.

Context-Inappropriate Capability

Medium

Confidence: 90% confidence
Finding: The rollback flow deletes the existing knowledge directory with shutil.rmtree() and replaces it from a selected backup without any confirmation, dry-run mode, or path-safety checks. If invoked with an unexpected slug/base-dir combination or on the wrong target, it can cause destructive local data loss and unauthorized replacement of skill content.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The README advertises automatic Feishu/DingTalk data collection but does not warn about consent, privacy, retention, or access boundaries. In a workplace context, this can lead users to ingest sensitive employee communications or corporate data without understanding the legal, ethical, and security implications.

Missing User Warnings

Medium

Confidence: 96% confidence
Finding: Referencing Feishu private chat collection without a prominent warning normalizes access to highly sensitive direct messages. Because private chats often contain confidential business information and personal data, this materially increases the risk of unauthorized surveillance, privacy violations, and downstream misuse in model building.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: The trigger set is broad enough to activate on ordinary workplace-advice requests, causing the skill to enter a high-risk workflow without clear user intent. In this skill, accidental activation is more dangerous than usual because the workflow leads into surveillance, persistence, and replacement-oriented profiling.

Vague Triggers

Medium

Confidence: 87% confidence
Finding: Ambiguous evolution triggers like 'that's wrong' or 'append' can match normal conversation and silently modify stored profiles. Because this skill persists sensitive dossiers, unintended updates can corrupt records, expand retained data, or cause the system to treat ordinary feedback as authorization to alter files.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: The English triggers overlap with common requests for workplace help, making unintended activation likely for benign users seeking advice. In this context, accidental activation can funnel users into a workflow that collects and stores sensitive third-party information about managers.

Vague Triggers

Medium

Confidence: 86% confidence
Finding: Natural correction phrases are treated as workflow control signals, which can cause ordinary chat to mutate persisted artifacts. When the persisted artifacts are behavioral profiles of real people, accidental state changes create both privacy and integrity risks.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The example shows the assistant reading newly provided files and offering to append extracted information into an existing leader profile, then persisting the update after a simple confirmation. Because this skill stores sensitive workplace inferences about real individuals, insufficient warning about persistence, provenance, and possible misclassification can lead users to unknowingly create or contaminate a long-lived dossier that may later be used for manipulation, retaliation, or unfair profiling.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: The correction flow records user-supplied claims as persistent updates to the leader skill without an explicit warning that stored behavioral assessments are being modified and versioned. In this skill's context—analyzing leaders, weaknesses, and replacement paths—this makes it easy to permanently encode biased, false, or adversarial narratives about a person, increasing the risk of harmful downstream advice and reputation damage.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The prompt explicitly instructs the agent to edit multiple files, append records, and regenerate `SKILL.md`, but it does not require explicit user confirmation or warn about overwrite side effects. In an agent setting, this can lead to unintended persistent modifications to project artifacts, especially when a casual correction request triggers broad file updates.

Missing User Warnings

Low

Confidence: 86% confidence
Finding: The instructions require automatic updates to `meta.json` fields such as version, timestamp, and correction count without notifying the user. While lower impact than content rewrites, silent metadata mutation can still create audit confusion, trigger downstream automation, or misrepresent the state of the skill without informed consent.

Missing User Warnings

High

Confidence: 96% confidence
Finding: The skill tells the agent to collect private chats, documents, and tables but does not prominently warn about privacy, third-party confidentiality, employment sensitivity, or consent requirements. Because the data includes workplace communications and content from other people, omission of these warnings materially raises the risk of covert or inappropriate surveillance.

Missing User Warnings

Medium

Confidence: 91% confidence
Finding: The prompt asks for app credentials, OAuth codes, and user access tokens without clear secret-handling precautions. These credentials enable direct API access to enterprise communications, so mishandling them could expose organizational data well beyond a single analysis session.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal