simplify-and-harden

Security checks across malware telemetry and agentic risk

Overview

This is mostly a bounded code-review helper, but it also instructs agents to create durable learning records and promote rules into future agent instruction files without a clear approval gate.

Install only if you want an agent to perform automatic post-task cleanup and simple hardening edits. Before enabling the learning loop, require explicit approval for any writes to `.learnings/LEARNINGS.md`, `AGENTS.md`, `CLAUDE.md`, `.github/copilot-instructions.md`, or similar persistent agent-instruction files.

SkillSpector

By NVIDIA

Vulnerability Patterns

Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (2)

Context-Inappropriate Capability

Medium

Confidence: 95% confidence
Finding: The skill defines a cross-task learning loop that logs findings and promotes them into persistent agent context files, which extends behavior beyond a bounded post-completion review of the current diff. This creates a durable prompt-modification pathway that can amplify mistaken or adversarially influenced findings across future tasks, increasing the blast radius from one review run to many subsequent agent sessions.

Intent-Code Divergence

High

Confidence: 98% confidence
Finding: The documentation establishes a hard scope lock that the agent must only touch code modified in the current task, but later instructs writing promotion-ready patterns into agent context/system prompt files. That contradiction undermines the primary safety boundary and can authorize edits to high-leverage control files outside the task diff, enabling persistent behavior changes and policy drift across unrelated work.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal