违规词·发帖前合规检查

Security checks across malware telemetry and agentic risk

Overview

The checker’s core wordlist function is legitimate, but it can send draft text to DeepSeek and expose an unauthenticated local API without clear disclosure in the main skill instructions.

Install only after reviewing the privacy tradeoff. To keep drafts local, do not set DEEPSEEK_API_KEY and remove it from generic env files before use. Avoid running api_server.py on shared or public networks unless you add localhost binding and authentication.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
Taint TrackingDirect Taint Flow, Variable-Mediated Taint Flow, Credential Exfiltration Chain
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection

Findings (8)

Tainted flow: 'req' from os.environ.get (line 142, credential/environment) → urllib.request.urlopen (network output)

Critical

Category: Data Flow
Content: headers={'Authorization': f'Bearer {key}', 'Content-Type': 'application/json'}, method='POST', ) with urllib.request.urlopen(req, timeout=30) as r: out = json.loads(r.read().decode()) return out['choices'][0]['message']['content'].strip() except Exception:
Confidence: 98% confidence
Finding: with urllib.request.urlopen(req, timeout=30) as r:

Lp3

Medium

Category: MCP Least Privilege
Confidence: 91% confidence
Finding: The skill declares only benign local checking behavior, but detected capabilities include environment access, file read/write, and network use without any corresponding permission disclosure. In a compliance-checking context, this is dangerous because users may submit sensitive draft content expecting local-only processing, while undeclared network and file capabilities can enable silent exfiltration, persistence, or broader system interaction.

Tp4

High

Category: MCP Tool Poisoning
Confidence: 97% confidence
Finding: The documented behavior says the skill performs local comparison against built-in wordlists, but the actual behavior reportedly sends user content and hits to an external DeepSeek service, exposes an HTTP server, and overstates continuous updates. This mismatch undermines informed consent and can cause users to disclose sensitive or unpublished content to third parties or remote interfaces they were never told about.

Context-Inappropriate Capability

Medium

Confidence: 96% confidence
Finding: The skill description suggests a local compliance checker, but the implementation additionally reads credentials and sends user content to a third-party API for suggestions. That mismatch is security-relevant because users may trust the tool with draft ad copy or sensitive text without realizing it leaves the host.

Natural-Language Policy Violations

Medium

Confidence: 91% confidence
Finding: This wordlist enforces a large, hard-coded Chinese-language censorship/moderation policy without any visible locale selection, jurisdiction scoping, or user-configurable policy controls. In a compliance-checking skill, that can cause overblocking, viewpoint-based suppression, and inappropriate moderation of users outside the intended region or regulatory context, especially because the list mixes sexual terms, politics, names, websites, and ordinary strings like place names or numbers.

Natural-Language Policy Violations

High

Confidence: 97% confidence
Finding: This is a true policy and safety issue: the wordlist explicitly includes ethnically derogatory slurs targeting protected groups, which can normalize or operationalize hateful language handling in a way that may enable discriminatory moderation, logging, or downstream misuse. In this skill’s context, a compliance/sensitive-word checker is intended to screen content, which makes the presence of such slurs somewhat expected, but it is still dangerous because the list is broad, uncurated, and includes hateful terms without clear safeguards, provenance, or category boundaries.

Missing User Warnings

Medium

Confidence: 97% confidence
Finding: Potentially sensitive user text is transmitted to a third-party model API without any user-facing warning, consent, masking, or disclosure. In the context of a compliance-check skill, submitted content may include unpublished marketing copy, regulated claims, or other confidential material, making silent transmission especially risky.

Missing User Warnings

Low

Confidence: 89% confidence
Finding: The script silently searches for credentials in environment variables and local .env-like files, which expands its access to secrets beyond explicit user input. While it only targets a named API key, undisclosed credential harvesting behavior is risky and violates least surprise for a tool that appears to be a local text checker.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal