Security audit

Autoresearch.Bak

Security checks across malware telemetry and agentic risk

Overview

This is a legitimate experiment-running skill, but it can autonomously edit code, run shell commands, and hard-reset git state until the user stops it.

Install only if you intend to let an agent run experiments in a clean disposable branch, worktree, or clone. Before running it, recreate or inspect `autoresearch.config.md`, approve exact target files and commands, set a hard time or experiment limit, require a clean `git status`, and do not allow automatic hard resets in a repository with unrelated work.

SkillSpector

By NVIDIA

Vulnerability Patterns

Tool MisuseTool Parameter Abuse, Chaining Abuse, Unsafe Defaults
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (5)

Vague Triggers

Medium

Confidence: 88% confidence
Finding: The trigger list includes broad phrases like "optimize" and "find the best config," which can activate this skill for ordinary requests that do not imply consent for autonomous code modification, shell execution, branching, or repeated experiments. In this skill, accidental invocation is more dangerous because the behavior includes persistent file edits, command execution, and looping actions.

Missing User Warnings

Medium

Confidence: 95% confidence
Finding: The skill instructs the agent to write files, run arbitrary experiment commands, create branches, and perform destructive git operations, but it does not require an upfront safety warning or explicit user confirmation immediately before those actions. Because the skill is user-invocable and defaults into setup/run behavior, a user may trigger high-impact local actions without understanding the consequences.

Tool Parameter Abuse

High

Category: Tool Misuse
Content: # If the run crashed or timed out: # - Read the error from run.log # - Record as crash in results.tsv # - Revert: git reset --hard HEAD~1 # - Diagnose and try a different approach ```
Confidence: 97% confidence
Finding: git reset --hard

Tool Parameter Abuse

High

Category: Tool Misuse
Content: → Log: "KEEP: <description> (<metric>: <old> → <new>)" ELIF metric equal or worse: → DISCARD: git reset --hard HEAD~1 → Log: "DISCARD: <description> (<metric>: <value> vs best <best>)" ELIF crashed or timed out:
Confidence: 98% confidence
Finding: git reset --hard

Tool Parameter Abuse

High

Category: Tool Misuse
Content: → Log: "DISCARD: <description> (<metric>: <value> vs best <best>)" ELIF crashed or timed out: → CRASH: git reset --hard HEAD~1 → Log: "CRASH: <description> (error: <brief error>)" ```
Confidence: 98% confidence
Finding: git reset --hard

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal

Static analysis

No suspicious patterns detected.