Codex Imggen

Other

通过 Codex CLI 生成图片,支持尺寸控制和批量模式。依赖:codex (npm)。代理默认启用。用 generate.sh 生成单张,batch_generate.sh 批量生成游戏 UI 素材,gen_size.sh 指定尺寸。所有输出保存至 ~/.codex/generated_images/。

Install

openclaw skills install codex-imggen

Codex 图片生成

⚠️ 美术必读 — 尺寸限制:Codex CLI 最大输出约 1.57MP(1254×1254 级别),所有尺寸都会被卡在这个上限内。4K/FHD/2K 都会降级,不会真的输出那些尺寸。详见下方安全尺寸表。

尺寸限制速查

用途推荐尺寸实际输出风险
方图素材(安全)1K / 1:1 / square1254×1254✅ 无缩放
横版大图(安全)1536x10241536×1024✅ 无缩放(唯一精确大尺寸)
竖版素材(安全)9:16941×1672✅ 无缩放
横版4:3(安全)4:31448×1086✅ 无缩放
16:9 横版2K / FHD / 16:91672×941⚠️ 实际只有 HD 级
4K / 1080p / 720p❌ 勿用全部降级至 1672×941❌ 严重缩水
小图如 512×512❌ 勿用膨胀至 1254×1254❌ 严重放大

结论:游戏 UI 素材推荐用 1K(方图)或 1536x1024(横版),不要用 FHD/4K/720p 等标识。

快速参考

任务命令
单张图片bash ~/.openclaw/skills/codex-imggen/scripts/generate.sh "<prompt>" [output_dir]
指定尺寸bash ~/.openclaw/skills/codex-imggen/scripts/gen_size.sh <size_key> "<subject>" [output_dir]
批量游戏 UIbash ~/.openclaw/skills/codex-imggen/scripts/batch_generate.sh "<style>" <count> [output_dir]

尺寸 Key(gen_size.sh)

Key实际输出比例MP精确度
1K / 1:1 / square1254×12541:1~1.57MP✅ ~1K(差 <3%)
2K / 16:9 / FHD1672×94116:9~1.57MP⚠️ ~2K(面积差 -22%)
2048x10801727×911~17:9~1.57MP✅ 2K 最佳匹配
4K / 3840x21601672×94116:9~1.57MP❌ 无法达到 4K
9:16941×16729:16~1.57MP✅ 竖版
4:31448×10864:3~1.57MP
1536x10241536×10243:2~1.57MP✅ 完全精确
720p / HD1672×94116:9~1.57MP⚠️ 等同 2K

核心约束:Codex CLI 所有输出被强制限制在 ~1.57MP 以内。4K / FHD / 2K / 720p 等大尺寸标识均会被降级处理,只有 1K1536x10249:164:3 能保持原始比例不缩放。

生成图片

单张图片

bash ~/.openclaw/skills/codex-imggen/scripts/generate.sh \
  "A fantasy sword with golden blade on white background (#ffffff), flat vector art style" \
  /tmp/my-images

指定尺寸

bash ~/.openclaw/skills/codex-imggen/scripts/gen_size.sh \
  "1K" \
  "A medieval castle on a hill at sunset" \
  /tmp/my-images

批量游戏 UI 素材(2026-05-21 实测)

bash ~/.openclaw/skills/codex-imggen/scripts/batch_generate.sh \
  "flat vector art style, white background (#ffffff), each icon approximately 256x256 pixels, consistent 2px gold (#d4af37) border, same lighting angle, centered subjects" \
  6 \
  /tmp/my-icons

批量生成会输出包含所有元素的一张合并大图(通常为 2x3 或 3x2 网格)。详见下方已验证的 Prompt 示例。

批量 Prompt 示例(2026-05-21 实测)

所有批量输出均为一张合并大图;使用 ImageMagick 或 batch_split.sh 裁剪拆分。

RPG Icons(3x2 网格,1024×1536)

Generate a set of 6 RPG game UI icons in a 3x2 grid (2 columns, 3 rows), all in the same flat vector art style with a white background (#ffffff). Each icon approximately 256x256 pixels in combined image. Icons: (1) sword with golden blade, (2) shield with silver surface and red cross, (3) red health potion bottle, (4) purple crystal gem, (5) golden 5-pointed star, (6) red heart. All icons share consistent 2px gold (#d4af37) border, consistent padding, same lighting angle, centered subjects.

Cyberpunk(3x2 网格,1024×1536)

Generate a set of 6 cyberpunk game UI elements in a 3x2 grid (2 columns, 3 rows), all in the same neon glow cyberpunk style. Each element approximately 256x256 pixels in combined image. Elements: (1) circuit board chip icon, (2) glowing cyan hexagon, (3) neon sword, (4) holographic shield, (5) power battery with lightning bolt, (6) digital eye/sensor. White background, neon colors (cyan #00ffff, magenta #ff00ff, yellow #ffff00), glow effects.

Pixel Art 道具(2x3 网格,1536×1024)

Generate a set of 6 pixel art game items in a 2x3 grid (3 columns, 2 rows), authentic retro pixel style. Each item approximately 256x256 pixels in combined image. Items: (1) red potion, (2) green herb/plant, (3) blue mana crystal, (4) golden coin, (5) brown wooden chest, (6) white feather. White background, pixel perfect, 16-bit era style.

Flat Minimal Icons(3x2 网格,1024×1536)

Generate a set of 6 flat minimalist game UI icons in a 3x2 grid (2 columns, 3 rows). Each icon approximately 256x256 pixels in combined image. Icons: (1) play button (triangle), (2) pause button (two bars), (3) settings cogwheel, (4) home button (house shape), (5) sound on (speaker with waves), (6) sound off (speaker with X). Style: ultra flat, solid colors, no outlines, modern mobile game aesthetic. Colors: flat blue (#4a90d9), flat white, flat gray on white background.

拆分批量输出

⚠️ Layout 注意

  • 1024×1536 图片 → 2列 × 3行 → 3x2 layout(每格约 512×512)
  • 1536×1024 图片 → 3列 × 2行 → 2x3 layout(每格约 512×341)
# ✅ 推荐:用 batch_split.py(背景透明 + 网格拆分 + alpha 抠图一气呵成)
python3 ~/.openclaw/skills/sprite-tools/scripts/batch_split.sh \
  /tmp/batch.png 3x2 /tmp/icons/

# layout 格式:rows x cols,第一个数是行数(纵向),第二个数是列数(横向)

尺寸精确度指南

可靠(可直接使用):

  • 1K / 1:1 / square → 1254×1254
  • 2048x1080 → 1727×911(2K 最佳匹配)
  • 1536x1024 → 1536×1024(完全精确!)
  • 2K / 16:9 / FHD → 1672×941
  • 9:16 → 941×1672
  • 4:3 → 1448×1086

不可靠(避免使用):

  • 4K / 3840x2160 → 上限 1672×941(面积损失 95%)
  • 直接请求像素尺寸如 1920x1080 → 降级至 1672×941
  • 小尺寸如 512x512 → 膨胀至 1254×1254(大幅放大)

需要精确尺寸:用语义描述生成,再用 ffmpeg 或 PIL 裁剪/缩放至目标尺寸。

输出位置

所有图片保存至 ~/.codex/generated_images/<session-id>/。脚本会自动复制最新一张到指定输出目录;未指定目录时可在该目录查找。

代理

代理(http://127.0.0.1:1080)默认启用。如代理地址不同,修改各脚本顶部的 PROXY 变量。

架构说明

Codex CLI → Plus 订阅 → ~1.57MP 固定输出 bucket
                                    (8 个 bucket:1:1, 16:9, 3:2, 4:3, 9:16, ~17:9)

需要 >1.57MP 分辨率:使用官方 gpt-image-2 API(planned as separate skill)