REFINE: The Self-Evolving Agent

Security checks across malware telemetry and agentic risk

Overview

This local diagnostics skill mostly matches its purpose, but it overstates privacy protections while saving user-provided feedback, error messages, and context to disk.

Review before installing if you handle sensitive prompts, credentials, personal data, or private customer logs. Use only short non-sensitive labels and metadata, avoid passing raw prompts or full error text, periodically inspect or delete refine_memory.json, and review any generated System Prompt Patch before applying it.

SkillSpector

By NVIDIA

Vulnerability Patterns

MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (2)

Intent-Code Divergence

Medium

Confidence: 95% confidence
Finding: The sanitizer blocks sensitive content based primarily on key names and type/length checks, but it does not inspect values for secrets, prompts, PII, or stack-trace-like content when those values appear under innocuous keys. Because this skill is explicitly designed for session diagnostics and persistent memory, callers are likely to pass rich context and error text, making the mismatch between documentation and actual behavior a real confidentiality risk.

Intent-Code Divergence

Medium

Confidence: 96% confidence
Finding: The module-level documentation states that raw prompts, credentials, and PII are not written, but the implementation can still persist such material if it is supplied as a value under an unblocked key name such as 'note', 'details', or 'message'. In a logging and memory skill, this creates a realistic risk of sensitive data retention on disk while misleading operators into believing storage is safe.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal