Image Sprout

v1.0.2

Generate and iterate on images using Image Sprout projects. Creates consistent outputs from reference images, style guides, and subject guides. Use when an a...

0· 227·0 current·0 all-time
byTrevin@tmchow
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
Purpose & Capability
Name/description align with needing an image-sprout CLI binary and local project storage; requiring the image-sprout binary is proportionate. However, the SKILL.md expects an OpenRouter API key to be configured, which is not declared in the registry metadata (no required env or primary credential).
Instruction Scope
Instructions stay within the image-generation scope: creating projects, adding refs, deriving guides, generating runs, and reading returned image paths. The skill explicitly warns about concurrent state and the unauthenticated web UI. It does instruct storing a secret (OpenRouter key) on disk via the CLI config, and it exposes agent patterns for reading app data paths — both expected for a local CLI-based tool.
Install Mechanism
Instruction-only skill with no install spec; lowest install risk. The binary requirement is limited to 'image-sprout' on PATH, but the registry lacks a homepage/source field even though SKILL.md metadata references a GitHub repo; users should verify binary provenance before installing/running.
!
Credentials
SKILL.md requires an OpenRouter API key to be persisted via 'image-sprout config set apiKey', but the registry metadata declares no required environment variables or primary credential. This is an inconsistency: the skill needs a secret but the manifest doesn't enumerate it. Storing an API key on disk (and letting the CLI manage it) is plausible for this tool, but users should understand where it is stored and whether that storage is acceptable.
Persistence & Privilege
The skill does not request always:true, requires no config paths in the registry, and is user-invocable only. It does cause the CLI to persist configuration (api key, model selection, current project) to disk, which is expected for a local CLI tool.
What to consider before installing
This skill appears to be a wrapper around a local CLI ('image-sprout') and is mostly coherent, but note two issues: (1) the runtime docs require storing an OpenRouter API key on disk, yet the registry manifest does not declare any required credentials — confirm you are comfortable with how and where the CLI stores that key before using it; (2) the SKILL.md references a GitHub repo but the skill metadata has no homepage/source — verify the 'image-sprout' binary you install comes from the official project (check the GitHub repo and release checksums). Also avoid exposing the web UI to the public internet (the docs explicitly warn it has no authentication). If you want higher assurance, ask the publisher for a release URL, checksums, and an explanation of where config (the API key) is stored and protected.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

Binsimage-sprout
latestvk97bjgrfk6h8zm9c48ha3qqg4n82e2cs
227downloads
0stars
1versions
Updated 1mo ago
v1.0.2
MIT-0

image-sprout

Generate and iterate on images with consistent style and subject identity. Image Sprout turns reusable project context — reference images, derived guides, and persistent instructions — into repeatable outputs.

1. OpenRouter Key Setup

Image Sprout stores its OpenRouter key on disk. Set it once per machine:

image-sprout config set apiKey <your-openrouter-key>
image-sprout config show    # confirm key is set (does not reveal the raw key)

How the calling environment stores or injects that key is outside this skill's scope.

2. The Project Model

Three context layers drive every generation:

  • Visual Style — consistent look and feel across outputs
  • Subject Guide — consistent subject identity across outputs
  • Instructions — persistent generation constraints (watermarks, framing, branding)

Two reference pools:

  • Shared refs — drive both guides (default, simplest)
  • Split refs — separate style and subject pools (advanced; use --role style or --role subject when adding)

Understanding this model prevents the most common agent mistake: generating without saved context and wondering why outputs are inconsistent.

3. Core CLI Workflow

# Create a project
image-sprout project create <name>

# Add references (3+ recommended; more refs = better derivation)
image-sprout ref add --project <name> ./ref1.png ./ref2.png ./ref3.png

# Optional: persistent instructions
image-sprout project update <name> --instructions "Watermark bottom-right: subtle."

# Derive guides from refs
image-sprout project derive <name> --target both   # or: style, subject

# Check readiness before generating
image-sprout project status <name> --json

# Generate (--count controls images per run: 1, 2, 4, 6; default is 4)
image-sprout project generate <name> --prompt "hero in neon rain"
image-sprout project generate <name> --prompt "hero in neon rain" --count 1

# Inspect results
image-sprout run latest --project <name> --json

# Delete a session and all its runs/images
image-sprout session delete --project <name> <session-id>

Top-level aliases for convenience:

image-sprout generate --project <name> --prompt "hero in neon rain"   # same as project generate
image-sprout analyze --project <name> --target both                    # same as project derive

4. JSON Output — the Agent Pattern

Always use --json for structured output:

image-sprout project show <name> --json
image-sprout project status <name> --json
image-sprout run latest --project <name> --json
image-sprout run list --project <name> --json --limit 5

Use --value PATH to pluck a single field:

image-sprout run latest --project <name> --json --value images[0].path

This is how agents hand image paths to downstream tools. Run images land in image-sprout's internal app data directory — use run latest --json --value images[0].path to get the path and leave what to do with it to the calling workflow.

5. Parallel-Safe Usage

image-sprout project use <name> sets a shared "current project" state on disk. When multiple agents or processes run concurrently, this state can collide. Always pass --project <name> explicitly — never rely on the current project shortcut in agent workflows.

6. Web UI — Agent Awareness

The web app runs over the same on-disk store as the CLI. Agents won't use it directly, but should know it exists so they can offer it to users when interactive review is appropriate.

image-sprout web              # launches local app
image-sprout web --open       # also opens in default browser
image-sprout web --port 8080  # custom port (default: 4310)

Useful for:

  • reviewing and comparing generated images visually
  • setting up a project interactively before handing off to CLI/agent use
  • iterating on outputs via the canvas interface

Security: do not expose the web UI to the public internet. The server has no authentication. Safe options are localhost only, or a private network like Tailscale. The risk is public internet exposure — LAN and tailnet access are fine.

7. Model Management

image-sprout model list
image-sprout model set-default google/gemini-3.1-flash-image-preview
image-sprout model add openai/gpt-5-image
image-sprout model restore-defaults

Default generation model is Nano Banana 2 (google/gemini-3.1-flash-image-preview). Custom models must accept image input and produce image output via OpenRouter.

Guide derivation uses a separate configurable analysis model (default: google/gemini-3.1-flash-image-preview):

# Set a persistent analysis model
image-sprout config set analysisModel google/gemini-2.5-flash

# Override per-derive
image-sprout project derive <name> --target both --analysis-model google/gemini-2.5-flash

Comments

Loading comments...