MiniMax Vision Captcha

WarnAudited by ClawScan on May 10, 2026.

Overview

The skill is mostly aligned with its stated vision-analysis purpose, but its helper script builds and runs a shell command from unescaped user-controlled input, which can allow unintended command execution.

Review this skill before installing. If you use it, provide explicit image paths, avoid sending sensitive screenshots, and do not use CAPTCHA/slider analysis to bypass third-party protections without authorization. The helper script should be fixed to avoid shell command injection before use.

Findings (5)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

A malicious or malformed prompt/path could cause the agent environment to run unintended local commands.

Why it was flagged

The shell command is assembled from command-line prompt and image path values without escaping or argument separation, so crafted input could break out of the quoted fields and run arbitrary shell commands.

Skill content
const cmd = `mcporter call minimax-coding-plan.understand_image prompt="${prompt}" image_source="${imagePath}"`;
const result = execSync(cmd, { encoding: 'utf-8', timeout: 30000 });
Recommendation

Replace execSync with execFileSync or spawn using an argument array, validate image paths, and avoid passing untrusted strings through a shell.

What this means

Using it on third-party sites may bypass verification checks, violate terms of service, or trigger anti-abuse responses.

Why it was flagged

The skill explicitly supports CAPTCHA and slider-challenge analysis. This is disclosed and central to the skill, but it is a sensitive automation capability that can be misused against anti-bot or account-protection controls.

Skill content
使用MiniMax MCP的视觉理解能力识别图片中的内容,解决验证码、滑块分析等问题。... 场景1:微信滑块验证码 ... 获取滑块位置描述
Recommendation

Use CAPTCHA/slider analysis only in authorized testing, accessibility, or owned-account workflows, and avoid automating protected third-party services without permission.

What this means

Sensitive information visible in screenshots may be shared with the configured MiniMax MCP/provider.

Why it was flagged

The workflow sends screenshots or image files to a MiniMax MCP/provider for analysis. This is expected for the skill, but screenshots can contain private page content.

Skill content
mcporter call minimax-coding-plan.understand_image prompt="描述图片内容" image_source="/path/to/screenshot.png"
Recommendation

Review screenshots before analysis, avoid sending private pages unless necessary, and confirm the MiniMax MCP configuration and data handling are acceptable.

What this means

The latest browser screenshot may not be the image the user intended to analyze and could contain private information.

Why it was flagged

If no image path is supplied, the script selects the newest browser screenshot from the OpenClaw media directory and then sends it to the MiniMax MCP call.

Skill content
const mediaDir = '/root/.openclaw/media/browser'; ... imagePath = path.join(mediaDir, files[0].name); ... const result = execSync(cmd, { encoding: 'utf-8', timeout: 30000 });
Recommendation

Prefer supplying an explicit image path and add a confirmation step before sending automatically selected screenshots.

What this means

The skill may fail or behave differently depending on whatever mcporter/MiniMax MCP setup already exists in the environment.

Why it was flagged

The marketplace file declares a dependency on mcporter, while the registry requirements show no required binaries and there is no install spec. This is not malicious, but it means the runtime dependency and provenance are not fully captured in the registry metadata.

Skill content
"dependencies": {
    "node": ["mcporter"]
  }
Recommendation

Document the required mcporter and MiniMax MCP versions/configuration in the registry requirements or install instructions.