Knot Agent Eval

Security checks across malware telemetry and agentic risk

Overview

This is a coherent Knot RAG evaluation skill that uses user-provided credentials and writes expected evaluation artifacts, but users should treat those outputs as sensitive.

Install and use this only if you are authorized to evaluate the target Knot agent and knowledge base. Use environment-variable tokens, run it in a dedicated access-controlled working directory, keep generated JSON/cache/report files out of version control, and redact outputs before sharing them.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Findings (2)

Missing User Warnings

Low
Confidence
82% confidence
Finding
The skill instructs the agent to generate and write multiple files such as eval_questions.json, eval_results.json, ai_review_results.json, and reports without an explicit user-facing warning at the point of modification. In an agent setting, silent file creation or overwrite can cause accidental data loss, workspace pollution, or unintended persistence of sensitive evaluation data.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The script persistently writes raw evaluation artifacts to disk, including model answers, source document metadata, conversation identifiers, chat URLs, and partial thinking traces, without minimization, masking, encryption, or an explicit consent/disclosure step. In this skill’s context, the data comes from enterprise knowledge-base evaluation flows, so the stored outputs may contain sensitive internal content and retrievable links that expand exposure if the files are copied, synced, or accessed by other users/processes.

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal