Codex Deep Search

Security checks across malware telemetry and agentic risk

Overview

This deep-search skill does the advertised research work, but it also sends research details to Telegram and triggers a local authenticated wake hook in ways that are under-scoped for general installation.

Review before installing. Use it only if you intentionally want background Codex searches, Telegram result delivery, and local OpenClaw hook signaling. Replace or remove the hardcoded Telegram group, avoid using it for confidential research unless local-only mode is enforced, and audit the hook-token behavior and hardcoded paths first.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Rogue AgentSelf-Modification, Session Persistence
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration

Findings (8)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 91% confidence
Finding: The skill invokes shell commands and background processes but does not declare corresponding permissions, which weakens the trust boundary and hides its actual execution capabilities from reviewers and policy controls. In a skill that accepts user-provided prompts and launches a script, undeclared shell access increases the chance of unreviewed command execution paths or abuse through surrounding tooling.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 97% confidence
Finding: The documented purpose is deep search, but the skill also references Telegram callbacks, reading local configuration for a hook token, and authenticated POSTs to a local wake endpoint. That hidden behavior materially expands data flow and privilege use beyond user expectations, creating risks of covert exfiltration, internal service interaction, and secret handling without transparent disclosure or consent.

Context-Inappropriate Capability

Medium

Confidence: 95% confidence
Finding: The script performs Telegram notifications and a local wake-hook callback even though the stated skill purpose is deep web search. This expands the skill's behavior into out-of-band messaging and inter-process signaling, which can leak user queries, output paths, and summaries to other channels without being inherent to search functionality.

Description-Behavior Mismatch

Medium

Confidence: 94% confidence
Finding: The implementation includes side-channel communication capabilities beyond the manifest's described deep-search behavior. Hidden or undocumented messaging/callback behavior is dangerous because users may assume the skill only searches and writes a report locally, while it also signals external/local services with task data.

Vague Triggers

Medium

Confidence: 89% confidence
Finding: The trigger phrases include broad everyday language such as Chinese phrases meaning 'search in detail' or 'help me look this up,' which can cause the skill to activate in routine conversations where the user did not intend this higher-privilege workflow. Because the skill can spawn background tasks and send results externally, accidental invocation increases the chance of unnecessary data handling and unintended outbound communication.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: The script sends the search prompt, output path, and a result summary to Telegram, which can expose sensitive research topics or derived content to a third-party messaging channel. Because there is no in-file warning, consent flow, or content redaction, users may unknowingly leak confidential information through normal use of the skill.

Missing User Warnings

Medium

Confidence: 90% confidence
Finding: The script silently reads a bearer token from local configuration and uses it to authenticate a callback to a local HTTP hook. Even though the destination is localhost, this is still an undisclosed privileged action that can trigger other components or workflows, potentially causing unintended automation or data propagation.

Session Persistence

Medium

Category: Rogue Agent
Content: ### Dispatch Mode (recommended — background + callback) ```bash nohup bash /home/ubuntu/clawd/skills/codex-deep-search/scripts/search.sh \ --prompt "Your research query" \ --task-name "notebooklm-research" \ --telegram-group "-5006066016" \
Confidence: 86% confidence
Finding: nohup

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal