ClawLock

Security checks across malware telemetry and agentic risk

Overview

ClawLock appears to be a real security tool, but it asks the agent to install/update external code and replace its own skill file during normal scan flows, so it needs review before installation.

Install only if you trust both the ClawLock PyPI package and the GitHub repository, not just this skill text. Prefer offline/report-only scans first, decline in-band self-updates unless you have reviewed the package and skill-file diff, and run red-team tests only against endpoints you own or are authorized to test.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (6)

Intent-Code Divergence

High
Confidence
96% confidence
Finding
The document makes contradictory safety claims: it says the skill cannot perform real attacks or verify exploitability, yet earlier instructions explicitly enable red-team testing that can send attack payloads to a target endpoint. This can mislead users and operators about the operational risk, consent requirements, and network effects of using the skill.

Intent-Code Divergence

Medium
Confidence
88% confidence
Finding
The drift-detection section claims it only reads metadata, permissions, and hashes, but the same workflow also says it scans file contents for prompt-injection and encoding-obfuscation patterns. This inconsistency creates a privacy/transparency issue because users may consent under a narrower data-access model than what the skill actually describes.

Description-Behavior Mismatch

Medium
Confidence
95% confidence
Finding
The skill's documented behavior goes beyond passive scanning and includes self-update actions that install packages and replace local skill files from remote sources. That expands the trust boundary from analysis into code/data modification and creates supply-chain risk if the update source, transport, or version selection is compromised.

Description-Behavior Mismatch

Medium
Confidence
96% confidence
Finding
The file claims static-analysis-style safety boundaries, but later instructs the agent to perform hardening edits, backups, rollback state management, and watch-mode monitoring. This mismatch can cause users or host systems to grant the skill more trust than warranted, while the actual workflow includes persistent local changes and ongoing monitoring behavior.

Intent-Code Divergence

Medium
Confidence
88% confidence
Finding
The document states the tool cannot execute real attacks, yet it advertises red-team and jailbreak testing against endpoints. Even if framed as security testing, this is active interaction with targets and may trigger harmful or unauthorized behavior if used against production or third-party systems.

Intent-Code Divergence

Medium
Confidence
97% confidence
Finding
The file says the LLM only explains tool output, but the hardening workflow explicitly authorizes the LLM to locate files, propose edits, back up originals, and apply changes. This inconsistency weakens user understanding of what the skill may do and increases the chance of unintended config modification.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal