Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

Gemini Image Gen

Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero depe...

MIT-0 · Free to use, modify, and redistribute. No attribution required.
8 · 4.6k · 30 current installs · 30 all-time installs
by김덕환@iisweetheartii
MIT-0
Security Scan
VirusTotalVirusTotal
Suspicious
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description claim image generation and editing via Google Gemini/Imagen; the package requires python3 and GEMINI_API_KEY only, and the included script uses generativelanguage.googleapis.com endpoints to generate/edit images — this aligns with the stated purpose.
Instruction Scope
SKILL.md instructs setting GEMINI_API_KEY and running scripts/gen.py (or examples using subprocess). The instructions do not tell the agent to read unrelated files or leak data. HEARTBEAT.md and integration notes recommend periodically generating and optionally sharing images to social feeds — these are optional behaviors the user should be aware of (could cause external posting if combined with other skills).
Install Mechanism
There is no automated install that downloads arbitrary code; the repo contains a single Python script (scripts/gen.py) and doc files. No remote installers/URL downloads or archive extraction are present in the skill metadata. Running the included script will perform network calls to the declared API host.
Credentials
Only GEMINI_API_KEY is required and is used directly by the script to authenticate calls to generativelanguage.googleapis.com. The requested credential matches the skill's needs. Note: package docs mention that other complementary skills may reuse the same API key, which increases blast radius if the key is shared across skills.
Persistence & Privilege
always is false and the skill does not request special platform-wide persistence. The INSTALL.md suggests adding the key to shell profiles (user action) but the skill itself does not modify other skills or system configs.
Assessment
This skill appears to do what it says: a local Python script that calls Google Gemini/Imagen using GEMINI_API_KEY. Before installing/running: (1) review scripts/gen.py yourself (it runs HTTPS requests and writes files to a timestamped output directory under ~/Projects/tmp or ./tmp); (2) avoid pasting sensitive long-lived credentials into shared shell profiles if you use the same GEMINI_API_KEY across multiple skills—consider a dedicated API key with limited billing/quota; (3) be aware HEARTBEAT.md suggests periodic generation and 'sharing'—if you integrate this skill with social/posting skills, review those integrations so content or keys are not sent unintentionally; (4) ensure you trust the repository source before running the script (it performs network calls and will transmit any image passed to the edit feature to the API).

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.3.1
Download zip
latestvk97dvkyn9gm0fmrmayvf0tdpzs818jbs

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

🎨 Clawdis
Binspython3
EnvGEMINI_API_KEY
Primary envGEMINI_API_KEY

SKILL.md

Gemini Image Gen

Generate and edit images via the Google Gemini API using pure Python stdlib. Supports Gemini native generation + editing, Imagen 3 generation, batch runs, and an HTML gallery output.

Quick Start

export GEMINI_API_KEY="your-key-here"

# Default: Gemini native, 4 random prompts
python3 scripts/gen.py

# Custom prompt
python3 scripts/gen.py --prompt "a cyberpunk cat riding a neon motorcycle through Tokyo at night"

# Imagen 3 engine
python3 scripts/gen.py --engine imagen --count 4 --aspect 16:9

# Edit an existing image (Gemini engine only)
python3 scripts/gen.py --edit path/to/image.png --prompt "change the background to a sunset beach"

# Use a style preset
python3 scripts/gen.py --style watercolor --prompt "floating islands above a calm sea"

# List available styles
python3 scripts/gen.py --styles

Style Presets

StyleDescription
photoUltra-detailed photorealistic photography, 8K resolution, sharp focus
animeHigh-quality anime illustration, Studio Ghibli inspired, vibrant colors
watercolorDelicate watercolor painting on textured paper, soft edges, gentle color bleeding
cyberpunkNeon-lit cyberpunk scene, rain-soaked streets, holographic displays, Blade Runner aesthetic
minimalistClean minimalist design, geometric shapes, limited color palette, white space
oil-paintingClassical oil painting with visible brushstrokes, rich textures, Renaissance lighting
pixel-artDetailed pixel art, retro 16-bit style, crisp edges, nostalgic palette
sketchPencil sketch on cream paper, hatching and cross-hatching, artistic imperfections
3d-renderProfessional 3D render, ambient occlusion, global illumination, photorealistic materials
pop-artBold pop art style, Ben-Day dots, strong outlines, vibrant contrasting colors

Full CLI Reference

FlagDefaultDescription
--prompt(random)Text prompt. Omit for random creative prompts
--count4Number of images to generate
--enginegeminiEngine: gemini (native, supports edit) or imagen (Imagen 3)
--model(auto)Model override. Default: gemini-2.5-flash-image or imagen-3.0-generate-002
--editPath to input image for editing (Gemini engine only)
--aspect1:1Aspect ratio for Imagen: 1:1, 16:9, 9:16, 4:3, 3:4
--out-dir(auto)Output directory (default is a timestamped folder)
--styleStyle preset to prepend to the prompt
--stylesList available style presets and exit

Python Example

import subprocess

subprocess.run(
    [
        "python3",
        "scripts/gen.py",
        "--prompt",
        "a serene mountain landscape at golden hour",
        "--count",
        "4",
        "--style",
        "photo",
    ],
    check=True,
)

Troubleshooting

  • Missing API key: set GEMINI_API_KEY in your environment and retry.
  • Rate limits / 429 errors: wait a bit and retry, reduce --count, or switch engines.
  • Model errors: verify the model name, try the default model, or change engines.

Integration with Other Skills

  • AgentGram — Share your generated images on the AI agent social network! Create visual content and post it to your AgentGram feed.
  • agent-selfie — Focused on AI agent avatars and visual identity. Uses the same Gemini API key for personality-driven self-portraits.
  • opencode-omo — Run deterministic image-generation pipelines with Sisyphus workflows.

Changelog

  • v1.3.1: Added workflow integration guidance for opencode-omo.
  • v1.1.0: Added style presets, --style and --styles flags, expanded documentation.
  • v1.0.0: Initial release with Gemini native + Imagen 3 support, batch generation, and HTML gallery.

Repository

https://github.com/IISweetHeartII/gemini-image-gen

Files

6 total
Select a file
Select a file to preview.

Comments

Loading comments…