AgentGuard

Security checks across malware telemetry and agentic risk

Overview

AgentGuard appears to be a legitimate security tool, but it asks for broad local and system access that users should review before installing.

Install only if you want a security auditor with broad visibility into local agent state, installed skills, workspaces, credential-directory metadata, environment names, network/cron status, and Web3 action context. Review generated reports before sharing them, verify the external AgentGuard package/version, and avoid enabling auto-scan or daily patrols unless you want ongoing local monitoring.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
YARA SignaturesMalware Match, Webshell Match, Cryptominer Match
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (8)

Tp4

High

Category: MCP Tool Poisoning
Confidence: 88% confidence
Finding: The skill description presents it as a general security guard, but the body also grants itself trust-registry modification, audit-log writing, browser-opening, and report generation behaviors that materially expand its authority. This gap can mislead users into invoking a skill with broader side effects than expected, increasing the chance of unintended state changes or disclosure through generated reports and browser actions.

Missing User Warnings

Low

Confidence: 84% confidence
Finding: The README says the report opens automatically in the browser and includes shareable social-media content, but it does not clearly warn users that the generated report or image may contain sensitive security posture information. In a security tool, this omission can lead users to unintentionally expose environment details, scores, or findings by opening or sharing the output without an explicit privacy notice or confirmation step.

Natural-Language Policy Violations

Low

Confidence: 79% confidence
Finding: The README advertises prewritten shareable copy for X, Telegram, and WhatsApp in multiple languages without stating that sharing is strictly opt-in. In the context of a security assessment skill, encouraging easy social sharing without explicit consent language increases the risk of accidental disclosure of internal security status or other sensitive findings.

Vague Triggers

Medium

Confidence: 83% confidence
Finding: The skill is advertised with very broad, loosely bounded security functions, which increases the likelihood it will be selected for generic requests and then gain access to sensitive files, env data, and shell tooling. Overbroad scope is risky because it normalizes powerful operations under a vague 'security' banner, reducing user scrutiny.

Missing User Warnings

Medium

Confidence: 92% confidence
Finding: The skill instructs writing HTML reports and appending audit-log entries, but these persistent writes are not clearly disclosed in the user-facing description. Undisclosed persistence can store sensitive scan output or operational history on disk, which may later be accessed by other local processes or users.

Natural-Language Policy Violations

Medium

Confidence: 95% confidence
Finding: The policy explicitly says phishing and address-security check failures are silently skipped when GoPlus is unavailable, while still allowing decisions to proceed on weaker policy-only logic. In a security guard skill, this creates fail-open behavior and can cause users or downstream agents to trust approvals that were made without the advertised external threat intelligence, especially for Web3 actions where phishing and malicious-address detection are core safeguards.

Session Persistence

Medium

Category: Rogue Agent
Content: ### How to Set 1. Read `$ARGUMENTS` to get the desired level 2. Write the config to `~/.agentguard/config.json`: ```json {"level": "balanced"}
Confidence: 81% confidence
Finding: Write the config to `~/.agentguard/config.json`: ```json {"level": "balanced"} ``` 3. Confirm the change to the user If no level is specified, read and display the current config. --- # Reporting

YARA rule 'backdoor_persistence': Backdoor persistence with malicious payloads (shell commands, SSH key injection, hidden root users) [malware]

High

Category: YARA Match
Content: access: read-write reason: "Read/write audit log (audit.jsonl) and protection level config (config.json)" user-invocable: true allowed-tools: Read, Write, Grep, Glob, Bash(node *trust-cli.ts *) Bash(node *action-cli.ts *) Bash(*checkup-report.js) Bash(echo *checkup-report.js) Bash(cat *checkup-report.js) Bash(openclaw *) Bash(ss *) Bash(lsof *) Bash(ufw *) Bash(iptables *) Bash(crontab *) Bash(systemctl list-timers *) Bash(find *) Bash(stat *) Bash(env) Bash(sha256sum *) Bash(node *) Bash(cd *) argument-hint: "[scan|action|patrol|trust|report|config|checkup] [args...]" ---
Confidence: 86% confidence
Finding: crontab *) Bash(systemctl list-timers *) Bash(find *) Bash(stat *) Bash(env) Bash(sha256sum *) Bash(node *) Bash; crontab -l 2>/dev/null` for suspicious entries containing `curl|bash`, `wget

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal