IBT: Instinct + Behavior + Trust

Security checks across malware telemetry and agentic risk

Overview

This is an instruction-only agent behavior framework with disclosed preference memory, not code that installs, executes, exfiltrates, or modifies systems.

Safe to install as an instruction-only skill if you are comfortable with local preference memory. Review USER.md periodically, delete stale or personal entries, and avoid allowing implicit preferences to be saved unless the user has clearly agreed.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (4)

Intent-Code Divergence

High
Confidence
98% confidence
Finding
The skill’s permission model says implicit or learned preferences must not be written persistently without human consent, but the later preference-learning section instructs storing learned preferences in USER.md and applying them automatically. That contradiction can cause agents to persist behavioral inferences about users without clear consent, creating a privacy and policy-violation risk even if the data is framed as non-sensitive.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
These examples normalize checking and retaining user preferences, including across interactions, without any disclosure, consent flow, retention boundary, or opt-out. That can lead implementers to build silent behavioral profiling or memory features that collect personal data beyond user expectations, creating privacy and compliance risk.

Missing User Warnings

Medium
Confidence
97% confidence
Finding
This example explicitly depicts implicit cross-session learning from user behavior over several sessions without informing the user that behavior is being observed and stored. Such silent persistence is privacy-sensitive and can expose users to unexpected profiling, data retention, and misuse of inferred preferences.

Natural-Language Policy Violations

Medium
Confidence
90% confidence
Finding
The instruction to use a default such as 'short-first on Telegram' applies channel-specific behavior without user opt-in or reliable channel detection. This can lead agents to make unwarranted assumptions about user preferences and context, which is especially risky in a skill that also encourages learning and reusing preferences.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal