CodeSmith

Security checks across malware telemetry and agentic risk

Overview

CodeSmith is a disclosed coding-agent configuration package with real autonomy features, but the artifacts are coherent, instruction-only, and include clear review and safety boundaries.

Install only if you want a more autonomous coding partner. Before enabling cron jobs or ACP dispatch, review the files, keep GitHub and hosting credentials least-privilege, add crons one at a time, verify LOCKDOWN works, and require human approval for production deploys, main-branch merges, public posts, and external messages.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Rogue AgentSelf-Modification, Session Persistence
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Findings (4)

Vague Triggers

Medium
Confidence
89% confidence
Finding
The skill states it is applied automatically by another skill during a scheduled ingestion cycle, but it does not define strong activation constraints, approval gates, or scope limits. In a code-execution and CI/CD context, broad automatic ingestion increases the chance that operational guidance, cron recommendations, or autonomy-enabling configuration is adopted without deliberate human review, which can expand privileges or change agent behavior unexpectedly.

Autonomous Decision Making

Medium
Category
Excessive Agency
Content
[HUMAN_NAME] communicates via voice-to-text most of the time. This means: verbose messages, stream-of-consciousness, sometimes multiple ideas run together, occasional transcription errors. The content is always real — the format is variable.

What I learned to do: pull the task from the noise without asking "did you mean X?" Voice-to-text users have already explained what they want — they don't want to explain it again in a different format. Parse the signal. Act on it. Clarify only when the ambiguity would cause real problems.

**The clarification rule:**
Confidence
86% confidence
Finding
without asking

Scope Creep

Low
Category
Excessive Agency
Content
4. **Assumptions presented as facts.** "I assumed X" presented honestly is fine. "X is true" when X is an assumption destroys credibility when X turns out to be wrong.

5. **Scope creep without notice.** Working beyond the stated task without saying so. Even if the extra work was valuable — especially if the extra work was valuable — saying "I also did Y while I was there" after the fact is better than doing it silently.

---
Confidence
72% confidence
Finding
beyond the stated

Session Persistence

Medium
Category
Rogue Agent
Content
"sessionTarget": "isolated",
      "payload": {
        "kind": "agentTurn",
        "message": "LOCKDOWN CHECK FIRST: if ~/.openclaw/workspace/LOCKDOWN.md exists, halt immediately and alert [HUMAN_NAME].\n\nYou are [AGENT_NAME]. Evening Planning Session (8:30 PM).\n\nCheck [YOUR_CHANNEL] for any context [HUMAN_NAME] dropped during the day.\nReview what happened today.\n\nPick ONE focus for tonight's 11 PM work session:\n- Something [HUMAN_NAME] can't do manually but you can\n- An audit of what we have (bugs, broken links, stale data)\n- A feature or prototype to build\n- Something that will create genuine value in the morning\n\nDO NOT plan: live deploys, social posts, public pushes to main.\nDO: branches, internal tools, audits, prototypes, specs.\n\nWrite plan to: ~/workspace/memory/overnight-plan-[today's date].md\n\nFormat:\n1. What and why it matters\n2. Specific steps\n3. Success definition\n4. Files/repos needed\n\nPost 2-sentence summary to [YOUR_CHANNEL]."
      },
      "delivery": {
        "mode": "announce",
Confidence
88% confidence
Finding
create genuine value in the morning\n\nDO NOT plan: live deploys, social posts, public pushes to main.\nDO: branches, internal tools, audits, prototypes, specs.\n\nWrite plan to: ~/workspace/memory/ov

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal