Security audit

Alibabacloud Waf Rule Management

Security checks across malware telemetry and agentic risk

Overview

The skill is mostly a WAF diagnostic helper, but it contradicts its read-only promise by requiring local CLI configuration changes, plugin updates/installs, and shipping an unrelated script that can rewrite a skill file.

Install only if you are comfortable with an agent using Alibaba Cloud credentials to query WAF/SLS data and changing local Alibaba Cloud CLI/plugin state. Prefer a constrained RAM user, review plugin update/install commands before running them, avoid storing sensitive rule or log JSON in shared `/tmp` paths, and remove the unrelated translation script from runtime packages if possible.

SkillSpector

By NVIDIA

Vulnerability Patterns

Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
System Prompt LeakageDirect Leakage, Indirect Extraction, Tool-Based Exfiltration
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands

Findings (12)

Tp4

High

Category: MCP Tool Poisoning
Confidence: 95% confidence
Finding: This is a mismatch because, while the main rule_matcher.py script is broadly consistent with read-only WAF rule-effectiveness diagnosis, the included translate_to_english.py script performs an unrelated write operation on a local skill file. That behavior is not represented in the declared description and violates the stated framing of human-readable, read-only diagnostic guidance. Additionally, the described SLS-querying capability is not present in the provided code; the code consumes provided JSON rather than querying logs itself.

Description-Behavior Mismatch

High

Confidence: 98% confidence
Finding: The skill is advertised as read-only and text-only, but it instructs the agent to run `aliyun configure ai-mode enable`, set user-agent, update plugins, and later disable ai-mode. These are local state-changing operations that alter CLI configuration and environment behavior, violating least surprise and potentially affecting other workflows on the host.

Intent-Code Divergence

High

Confidence: 98% confidence
Finding: The file explicitly promises no write operations, but later requires plugin updates/installation and CLI configuration changes. This contradiction is dangerous because operators may trust the 'read-only' claim and approve execution that unexpectedly mutates the local system and toolchain.

Intent-Code Divergence

High

Confidence: 97% confidence
Finding: The skill bans script/JSON generation and claims text-only output, yet later depends on executing `python scripts/rule_matcher.py` with `rule.json` and `log.json` inputs. This creates a hidden expansion of capability from guidance into local data preparation and code execution, which can mislead reviewers and users about what the skill actually does.

Description-Behavior Mismatch

Medium

Confidence: 96% confidence
Finding: The documented workflow includes running a local Python diagnosis engine over generated files, which exceeds 'human-readable guidance only' and adds code-execution risk on the host. Even if the script is intended for diagnostics, local execution broadens the trust boundary and can expose users to unintended side effects or supply-chain risk from bundled scripts.

Context-Inappropriate Capability

Medium

Confidence: 95% confidence
Finding: Plugin installation/update capability is not necessary for a nominally read-only WAF diagnostic guidance assistant, yet the skill instructs the agent to update plugins. This introduces software-change and supply-chain risk on the host, and may affect unrelated CLI behavior beyond the immediate task.

Context-Inappropriate Capability

Medium

Confidence: 95% confidence
Finding: Executing `python scripts/rule_matcher.py` goes beyond the skill's stated text-only purpose and introduces arbitrary local script execution. In a security-sensitive environment, bundled scripts must be treated as code with full review requirements, not as harmless documentation.

Description-Behavior Mismatch

High

Confidence: 97% confidence
Finding: The script opens the skill's SKILL.md for writing and overwrites its contents, which contradicts the stated read-only behavior of the skill. Even though it appears intended for maintenance/translation, this creates an undocumented local file-modification capability that could alter agent behavior or documentation without user awareness.

Context-Inappropriate Capability

Medium

Confidence: 90% confidence
Finding: This script implements documentation rewriting against a local SKILL.md file, which is unrelated to the advertised WAF diagnostic function. Extra capabilities that modify local skill assets expand the attack surface and can be abused to tamper with prompts, instructions, or operator-visible documentation.

Missing User Warnings

Medium

Confidence: 87% confidence
Finding: The skill directs plugin update/installation without clearly warning the user that this changes local software state and may download code from external sources. Lack of user-impact disclosure increases the chance of unintended host modification in environments that expected read-only diagnostics.

Missing User Warnings

Medium

Confidence: 87% confidence
Finding: The document instructs users to save WAF rule configuration JSON and log entries to predictable paths under /tmp, which is a shared temporary directory on many Unix-like systems. Those files may contain sensitive traffic metadata, rule logic, IPs, headers, or request details, and can be read, overwritten, or retained longer than intended by other local users or processes.

Direct Prompt Extraction

High

Category: System Prompt Leakage
Content: ### 2. NO Script Generation - ❌ NEVER generate configuration scripts (bash, python, etc.) - ❌ NEVER output rule JSON examples - ❌ NEVER write executable CLI commands for configuration - ❌ NEVER save files to outputs directory or any location
Confidence: 26% confidence
Finding: output rule

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal