Back to skill

Security audit

humanize

Security checks across malware telemetry and agentic risk

Overview

This skill is a disclosed Chinese copy-rewriting tool that installs a local runtime, uses model backends, and stores run artifacts as part of its stated workflow.

Install only if you are comfortable with a local Python runtime, dependency/model downloads, run folders containing your drafts and outputs, and use of your configured CoPaw or local OpenAI-compatible model backend. Avoid submitting confidential customer, legal, or internal text unless your active model and local filesystem retention are acceptable.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
Findings (21)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
run_dir = output_root / case_id
    if run_dir.exists():
        subprocess.run(["rm", "-rf", str(run_dir)], check=True)

    cmd = [
        python_bin,
Confidence
91% confidence
Finding
subprocess.run(["rm", "-rf", str(run_dir)], check=True)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
check=True,
        env=_subprocess_env(),
    )
    subprocess.run(
        [
            str(runtime_python()),
            "-m",
Confidence
84% confidence
Finding
subprocess.run( [ str(runtime_python()), "-m", "pip", "install", "--disable-pip-version-check", "-r", st

subprocess module call

Medium
Category
Dangerous Code Execution
Content
def run_command(command: Iterable[str], *, env: dict[str, str] | None = None) -> None:
    subprocess.run(list(command), check=True, env=_subprocess_env(env))
Confidence
89% confidence
Finding
subprocess.run(list(command), check=True, env=_subprocess_env(env))

Description-Behavior Mismatch

High
Confidence
91% confidence
Finding
This script implements a generic baseline-versus-challenger scoring pipeline that is materially broader than the declared purpose of a Chinese copy humanization skill. Capability mismatch is dangerous because it can hide undeclared evaluation or optimization workflows inside a benign-looking skill package, reducing transparency and making it easier to repurpose the skill for covert ranking or policy-bypass tuning.

Context-Inappropriate Capability

Medium
Confidence
82% confidence
Finding
The automated keep/discard decision logic enables systematic comparison and selection of outputs, which is not justified by the stated end-user function of rewriting Chinese copy to sound more natural. In this context, the undisclosed automation increases risk because it provides hidden optimization infrastructure that could be used to iteratively tune prompts or outputs beyond the user-visible purpose of the skill.

Context-Inappropriate Capability

Medium
Confidence
89% confidence
Finding
The installer copies the repository into the user's CoPaw skill pool and workspace, then edits the local workspace manifest to register and optionally enable the skill. That behavior is outside the humanization skill's declared purpose and creates persistent local state changes, which increases risk because installing the skill also grants it a foothold in the agent environment and may surprise users or bypass normal review expectations.

Context-Inappropriate Capability

Medium
Confidence
95% confidence
Finding
For a text-humanization skill, shell-level recursive deletion is unnecessary and expands the skill's capabilities beyond its stated purpose. Even in a regression script, this creates avoidable destructive filesystem power that could delete arbitrary directories if an operator passes an unsafe output root or if path handling is later weakened.

Description-Behavior Mismatch

Medium
Confidence
95% confidence
Finding
The skill advertises text humanization, but this module creates a runtime, installs packages, and downloads model artifacts from Hugging Face. That mismatch increases risk because users may grant trust appropriate for a copywriting tool while the code performs software installation and network retrieval on the host.

Context-Inappropriate Capability

High
Confidence
94% confidence
Finding
The module probes host interpreters and includes a general pattern of launching local processes, which is not tightly aligned with the stated purpose of rewriting Chinese copy. In this skill context, unjustified execution capability is more dangerous because it expands the attack surface far beyond content transformation.

Vague Triggers

Medium
Confidence
77% confidence
Finding
The invocation guidance is broad enough that ordinary user phrasing about making text sound more human could trigger the skill automatically. Because this skill performs shell execution, local file writes, and possibly model bootstrap/network access, over-broad activation raises the risk of unintended tool use and data handling without a clear user request for those operations.

Vague Triggers

Medium
Confidence
84% confidence
Finding
The loose fallback examples allow very ambiguous natural-language requests to count as valid activation input. In context, that means the agent may infer permission to launch the full local optimization pipeline, persist user drafts, and reveal process artifacts when the user may have wanted only a simple rewrite.

Missing User Warnings

Medium
Confidence
83% confidence
Finding
The code sends prompts and model payloads over HTTP to a configurable endpoint without any in-code disclosure, consent, or trust-boundary checks. In this skill context, user-provided drafts may contain sensitive communications content, so silently transmitting them to a service selected by environment or log-derived URL increases privacy and data exfiltration risk.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
This path forwards full system and user prompts to another script via a subprocess, again without visible disclosure or minimization. Although the bridge is local, it expands the data exposure surface to another process and potentially another provider selected by the bridge, which matters because this skill processes user communication text that may include confidential business or personal content.

Missing User Warnings

Medium
Confidence
89% confidence
Finding
The script persists the raw user brief, parsed payload, session plan, and original text to disk by default under the runs directory, with no minimization, redaction, consent check, or retention control. Because this skill is designed for human communication tasks, inputs are likely to contain sensitive personal, business, customer-service, or internal communication data, so local disclosure risk is materially increased if the filesystem is shared, backed up, synced, or later exposed.

Missing User Warnings

Medium
Confidence
90% confidence
Finding
The script persistently writes user briefs, parsed payloads, source text, model generations, scoring details, and trace artifacts to the run directory. If those inputs contain sensitive business, customer, or personal data, the tool creates an undeclared local data-retention surface that can expose information to other users, backups, logs, or later processes.

Missing User Warnings

Medium
Confidence
91% confidence
Finding
The code sends the task, original/source text, constraints, and rewrite content to a model backend via call_chat without any visible notice or consent mechanism in this file. In a skill intended for humanizing Chinese communications, that content may routinely include customer messages, internal business updates, complaint handling, recruiting emails, and other sensitive text, making undisclosed third-party transmission materially risky.

Missing User Warnings

Low
Confidence
85% confidence
Finding
The output payload includes backend metadata such as llm_base_url, llm_model, and backend error details. While not usually critical alone, exposing internal service endpoints or configuration details can aid reconnaissance, leak provider relationships, or reveal internal infrastructure to downstream consumers who do not need that information.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
This runtime bootstrap installs packages and later enables model download without any visible warning or consent in the file. Silent software installation and network activity are risky because they can surprise users, bypass informed trust decisions, and increase exposure to supply-chain compromise.

Missing User Warnings

Medium
Confidence
87% confidence
Finding
The force-reinstall path deletes the runtime virtual environment recursively without a visible warning or confirmation. While the target path is scoped under the skill runtime directory, destructive file operations still warrant explicit safeguards to prevent accidental data loss or misuse if path assumptions ever break.

Missing User Warnings

Medium
Confidence
92% confidence
Finding
The code persists a rolling history that includes the full task text plus previews of baseline and challenger outputs to a JSON file on disk. In a copywriting skill, those fields can contain sensitive business communications, personal data, customer messages, or confidential drafts, so local persistence creates an unnecessary data-retention and privacy exposure if the runtime directory is accessible, backed up, or reused across sessions.

Ssd 3

Medium
Confidence
96% confidence
Finding
The skill mandates always revealing the full optimization process and relaying user-visible output verbatim, including baseline text, candidate drafts, scores, failure tags, and traces. For rewrite tasks, this can unnecessarily expose sensitive user-provided message content and internal session metadata, violating data minimization and potentially disclosing confidential drafts or context to the end user or downstream logs.

VirusTotal

62/62 vendors flagged this skill as clean.

View on VirusTotal

Static analysis

No suspicious patterns detected.