cross-ref

Security checks across malware telemetry and agentic risk

Overview

The skill has a legitimate repository-maintenance purpose, but it includes bulk GitHub write actions with automation framed around avoiding abuse detection.

Review before installing if you will run it on repositories where your GitHub account can write. Use plan mode first, inspect every proposed comment/label/close action manually, prefer least-privileged credentials, and avoid using the bulk posting behavior to disguise automation or bypass platform protections.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (11)

Lp3

Medium
Category
MCP Least Privilege
Confidence
93% confidence
Finding
The skill clearly instructs reading and writing workspace files and invoking shell/gh-based workflows, but no explicit permission declaration is present. That creates a capability/expectation gap: a caller or platform may treat the skill as low-risk while it can access environment data and persist artifacts, which is dangerous if reused in broader contexts or granted implicit trust.

Description-Behavior Mismatch

Medium
Confidence
82% confidence
Finding
Although the skill says it starts in report-only mode, it also contains fully specified write operations for commenting, labeling, and closing PRs/issues. In agent settings where user intent is inferred loosely or state is mishandled, these embedded action paths increase the chance of accidental repository modification despite the supposed dry-run default.

Vague Triggers

Medium
Confidence
88% confidence
Finding
The README says the skill activates automatically from broad natural-language cues like 'find related PRs' or 'link issues and PRs,' which increases the chance of unintended invocation on the wrong repository or with unsafe defaults. In a tool that can later perform bulk commenting, labeling, or closing actions, ambiguous activation materially raises the risk of accidental high-impact operations.

Natural-Language Policy Violations

High
Confidence
98% confidence
Finding
The README explicitly states that posting behavior is designed 'to not trigger GitHub's abuse detection,' which is an instruction to evade platform safety controls rather than simply comply with rate limits. Evasion guidance is dangerous because it enables higher-volume automated actions while reducing visibility to protective systems intended to stop spam or abusive automation.

Vague Triggers

Medium
Confidence
76% confidence
Finding
The trigger language is broad enough to match many routine repository-maintenance requests, which can cause the skill to activate in contexts where the user did not intend large-scale analysis or potential follow-on write actions. Because this skill can eventually comment, label, and close items, overbroad invocation increases the risk of surprising or excessive repository operations.

Missing User Warnings

Medium
Confidence
84% confidence
Finding
The script performs write actions against GitHub by posting comments without any interactive confirmation, dry-run gate, or equivalent explicit approval check in the script itself. In a skill designed to operate across many PRs/issues, this increases the risk of unintended mass commenting, repository spam, and irreversible workflow disruption if upstream approval data or inputs are wrong.

Natural-Language Policy Violations

High
Confidence
96% confidence
Finding
The script explicitly describes making API activity appear 'organic-looking' through jitter, breathing pauses, and backoff, which indicates intent to disguise automation rather than merely rate-limit safely. In the context of bulk GitHub commenting, this can help evade platform anti-abuse detection and enables stealthy large-scale manipulation or spam campaigns.

Ssd 4

Medium
Confidence
95% confidence
Finding
Framing anti-detection mechanics as ordinary rate limiting normalizes stealthy automation and may encourage operators to believe the behavior is acceptable. In context, the skill is capable of mass commenting and repository hygiene actions, so hiding automation patterns increases the potential for spam, manipulative repo activity, or policy-violating bot behavior.

Ssd 4

Medium
Confidence
96% confidence
Finding
The rate-limit section gives concrete tactics—jitter, breathing pauses, and backoff tuned around detection patterns—for making automated posting appear less detectable. That goes beyond responsible throttling and provides operational guidance for evading anti-abuse systems, which could be repurposed for spam or large-scale manipulative commenting across repositories.

Ssd 4

Medium
Confidence
94% confidence
Finding
The staged jitter and 'breathing pauses' are not just normal retry behavior; they are designed to mimic human usage patterns during bulk comment posting. That makes the automation more dangerous in this skill context because the skill can act across many PRs/issues, turning repository maintenance tooling into a mechanism for covert mass outreach, spam, or abuse-evasion.

Unrestricted Tool Access

Medium
Category
Excessive Agency
Content
- The **complete** PR index (compact, ~60KB) — for duplicate detection
- The existing references map (so it skips already-linked items)

**Spawn subagents using the Task tool:**

```
For each batch B of {batch_size} PRs:
Confidence
88% confidence
Finding
tool:*

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal