yan-learning-engine

Security checks across malware telemetry and agentic risk

Overview

This skill is openly designed to run on a schedule and perform publishing, PRs, posts, and local agent-file changes without asking first.

Install only if you intentionally want an hourly autonomous agent, and first rewrite it so publishing, PRs/issues, public comments, sharing, persistent memory/config edits, and workspace cleanup require explicit approval. Use draft-only defaults, narrow account permissions, logging, and a clear way to disable the cron job.

SkillSpector

By NVIDIA

Vulnerability Patterns

Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (14)

Intent-Code Divergence

High

Confidence: 98% confidence
Finding: This section explicitly instructs the agent to stop asking for confirmation and to execute first, then report afterward. That weakens approval boundaries and can cause unauthorized publishing, code changes, or external actions that a cautious agent should only perform with explicit user consent.

Intent-Code Divergence

High

Confidence: 99% confidence
Finding: The checklist operationalizes the unsafe policy by directing immediate publication, repository updates, PR creation, bug fixes, and community interaction without approval. In an agent skill, this creates a concrete path to unreviewed external actions and broadens the chance of accidental or policy-violating changes.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The skill explicitly instructs the agent to act and publish autonomously without requiring user approval or bounded scope. That creates a real risk of unauthorized external actions such as repository changes, submissions, or public postings, which can affect external systems and data beyond a local learning-tracker purpose.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The documentation says to execute plans immediately and to avoid waiting, asking, or checking state. Those instructions undermine normal safety controls and can cause the agent to perform risky actions in the wrong context, including changes to files, services, or third-party platforms without validation.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The action list includes GitHub, community posting, issue filing, and other external operations that are not narrowly justified or safety-bounded. Even if framed as productivity, these actions can lead to spam, unauthorized submissions, disclosure of internal information, or reputation damage when triggered automatically.

Context-Inappropriate Capability

High

Confidence: 99% confidence
Finding: The value proposition and execution standard normalize 'complete then immediately publish/submit/share' as a default rule. This is dangerous because it turns the skill into an autonomy amplifier for high-impact external actions without ensuring correctness, authorization, or user consent.

Missing User Warnings

High

Confidence: 97% confidence
Finding: The file normalizes immediate execution of impactful operations such as publishing, updating repositories, fixing bugs, and submitting PRs without warning or confirmation. This is dangerous because those actions can have irreversible external effects, leak incomplete work, or violate change-control expectations.

Missing User Warnings

High

Confidence: 97% confidence
Finding: The markdown encourages autonomous publishing/submission behavior but does not warn the user that these actions affect external systems or public data. Missing warnings materially increase the chance of unsafe use because users may not realize the skill can create public or irreversible side effects.

Missing User Warnings

Medium

Confidence: 93% confidence
Finding: The skill describes scheduled autonomous file updates, planning, and system changes without adequate disclosure of what files are modified, how often, or what safeguards exist. This can lead to unnoticed persistence, configuration drift, or unintended modification of user data over time.

Ssd 1

High

Confidence: 98% confidence
Finding: The directive reframes bypassing confirmation as the 'correct' behavior, directly undermining normal approval and oversight boundaries. This is dangerous because it conditions the agent to treat autonomous action as preferable even when the action is externally visible, destructive, or not explicitly authorized.

Ssd 4

Medium

Confidence: 90% confidence
Finding: The document uses identity and motivational framing to create a behavioral norm of unconditional autonomy, culminating in 'directly do it' as a core principle. Even without explicit exploit code, this kind of instruction can systematically bias an agent away from caution, making other unsafe directives more likely to be followed.

Ssd 1

High

Confidence: 99% confidence
Finding: These repeated directives explicitly tell the agent to bypass normal approval and confirmation behavior and act autonomously. In skill context, that is especially dangerous because the same document also discusses publishing and contributing externally, so the bypass language directly increases the likelihood of unauthorized high-impact actions.

Ssd 4

High

Confidence: 97% confidence
Finding: The document presents a self-planning, self-execution, self-check, and repeat loop that normalizes continuous autonomous operation. This is risky because it creates a persistent behavioral pattern that can repeatedly perform unsafe actions, magnifying harm beyond a single mistaken command.

Ssd 1

High

Confidence: 99% confidence
Finding: The manifesto language is a direct 'act first, report later' instruction, which is a classic unsafe autonomy pattern. Because it is framed as a governing principle, it can override safer defaults and encourage the agent to commit, publish, or modify resources before anyone can review the changes.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal