Alicloud Security Content Moderation Green

Security checks across malware telemetry and agentic risk

Overview

This skill is a disclosed Alibaba Cloud Content Moderation helper that uses normal cloud credentials and local output files, with no evidence of hidden exfiltration or unsafe automatic behavior.

Install only if you intend to let the agent inspect or change Alibaba Cloud Content Moderation resources. Configure least-privilege Alibaba Cloud credentials, confirm region/resource IDs and every create/update/modify/set action before execution, and review or delete generated output files if they contain internal cloud details.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Findings (3)

Lp3

Medium
Category
MCP Least Privilege
Confidence
84% confidence
Finding
The skill instructs use of environment credentials, networked API access, and local file writes, but does not declare permissions that would let a caller or platform understand and constrain those capabilities. This creates a transparency and containment problem: users may invoke a skill that can access sensitive cloud credentials and write artifacts without an explicit permission boundary.

Tp4

High
Category
MCP Tool Poisoning
Confidence
90% confidence
Finding
The declared purpose says the skill manages Alibaba Cloud Green moderation resources and policies, but the described executable path focuses on metadata discovery and saving API inventory artifacts locally rather than actually performing the claimed resource operations. This mismatch is dangerous because it can mislead users and orchestration systems about what the skill really does, expanding trust and invocation scope for behavior that is not accurately disclosed.

Vague Triggers

Medium
Confidence
79% confidence
Finding
The invocation description is broad enough to match many generic cloud-management tasks, which can cause the skill to be selected outside a narrowly intended context. Because the skill also involves credential use, network access, and potential mutation workflows, overly broad routing increases the chance of unintended execution against sensitive cloud environments.

VirusTotal

61/61 vendors flagged this skill as clean.

View on VirusTotal