Forge 🔨 Repair-Inspect Loop

Security checks across malware telemetry and agentic risk

Overview

Forge is a coherent code-repair helper, but it grants broad automated repair authority and can run project-local code without a clear approval gate.

Install only if you are comfortable with an automated repair workflow modifying a repository. Use it in a clean branch or disposable checkout, define protected-files.txt for sensitive files, review generated task files and diffs before accepting results, disable or avoid automatic commits unless explicitly desired, and do not run summary/doc-sync behavior on untrusted repositories unless you are willing to execute that repository's scripts.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Findings (7)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
checker = workdir / "scripts" / "tools" / "doc-sync-checker.py"
    if checker.exists():
        try:
            result = subprocess.run(
                [sys.executable, str(checker), "--json"],
                capture_output=True, text=True, cwd=str(workdir), timeout=30
            )
Confidence
99% confidence
Finding
result = subprocess.run( [sys.executable, str(checker), "--json"], capture_output=True, text=True, cwd=str(workdir), timeout=30 )

Lp3

Medium
Category
MCP Least Privilege
Confidence
93% confidence
Finding
The skill clearly instructs the agent to execute shell commands and perform project file reads/writes, but the metadata declares no permissions or capability boundaries. This creates a trust and enforcement gap: a caller may assume the skill is documentation-only or low-privilege when it actually orchestrates code modification, state persistence, and git-affecting actions.

Description-Behavior Mismatch

Medium
Confidence
92% confidence
Finding
The skill writes learned 'universal' repair patterns to a shared location under `FORGE_DIR/reflections`, outside the target workdir. That creates a cross-project persistence channel that can exfiltrate sensitive project details into future sessions and lets one repository poison prompts used when repairing another repository.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
This code explicitly discovers and executes `scripts/tools/doc-sync-checker.py` from the repository being analyzed. Because the repository content is potentially adversarial and the skill's purpose is orchestration rather than trusted plugin execution, this introduces unnecessary arbitrary code execution risk during routine summary/check operations.

Vague Triggers

Medium
Confidence
91% confidence
Finding
The trigger conditions are broad enough to activate the repair loop on ordinary user phrasing such as 'fix X' or on generic review findings, which can cause the skill to begin autonomous code modification unexpectedly. In this skill context, that is more dangerous because the protocol also permits repeated automated repair cycles and downstream commits, increasing the chance of unintended repository changes from ambiguous requests.

Missing User Warnings

High
Confidence
96% confidence
Finding
The protocol states that a PASS result leads to an automatic commit, but it does not require a clear user warning or fresh consent before modifying repository state. In a code-repair orchestrator, this is especially risky because commits create durable changes, may trigger CI/CD or downstream automation, and can persist incorrect or unsafe modifications without the user's awareness.

Missing User Warnings

High
Confidence
97% confidence
Finding
The protocol promises automatic FAIL-to-repair looping 'without asking the user,' which allows repeated code changes across multiple rounds without renewed consent or visibility. In this skill, that materially increases risk because autonomous iteration can amplify mistakes, broaden the blast radius of a bad repair strategy, and accumulate repository changes before a human notices.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal