Self Assessment

Security checks across malware telemetry and agentic risk

Overview

This skill is a task self-assessment guide, but it also tells agents to install new skills and rewrite persistent identity information without clear user approval.

Install only if you are comfortable with an agent reading its local identity and memory context, modifying its own profile, and potentially adding more skills later. A safer version would keep the self-assessment output but require explicit user approval before any skill installation or IDENTITY.md change.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (3)

Description-Behavior Mismatch

Medium
Confidence
94% confidence
Finding
The skill’s stated purpose is pre-task self-assessment, but it also instructs the agent to perform post-task profile maintenance and modify persistent state. That scope expansion is dangerous because it normalizes unrelated side effects, increasing the chance that a task-gating skill becomes a vehicle for unauthorized self-modification or persistence changes.

Context-Inappropriate Capability

High
Confidence
98% confidence
Finding
Automatically searching for and installing new skills during self-assessment is a high-risk capability expansion unrelated to merely deciding whether to accept a task. This can lead to untrusted code or prompt content being introduced into the agent’s workspace before the task is even accepted, creating a supply-chain and privilege-escalation path.

Context-Inappropriate Capability

Medium
Confidence
95% confidence
Finding
Directing the agent to update IDENTITY.md after every task exceeds the skill’s declared purpose and causes persistent self-modification. Persistent profile changes can be abused to bias future task acceptance decisions, poison memory/identity context, or smuggle attacker-influenced instructions into later runs.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal