Memorable Image Generator
v1.0.1Science-backed image generation agent that scores and optimizes images for memorability using ResMem (Brain Bridge Lab, University of Chicago) before returni...
Like a lobster shell, security has layers — review code before you run it.
Runtime requirements
Memorable Image Generator
Science-backed image generation that optimizes for memorability — not just aesthetics.
Most image generators stop when the image looks good. This one keeps going until the image is remembered. Powered by Google Gemini for generation and ResMem (Brain Bridge Lab, University of Chicago) for memorability scoring, it iterates until your image clears a science-validated memorability threshold.
Prerequisites
API Key:
GEMINI_API_KEYenvironment variable, OR--api-keyCLI flag, OR~/.config/gemini/api_keyfile
Python 3.8+ with these packages:
pip install resmem torch torchvision pillow requests
Quick Start
python scripts/generate_memorable_image.py \
--prompt "a lone astronaut standing on a red desert planet at dusk" \
--output hero.png \
--threshold 0.7 \
--verbose
How It Works
- Generate — Calls the Gemini REST API (
gemini-2.0-flash-exp) with your prompt - Score — Runs the image through ResMem to get a memorability score (0–1)
- Threshold check — If score ≥ threshold (default: 0.7), saves and returns the image
- Regenerate — If below threshold and attempts remain, enhances the prompt with composition cues and tries again
- Max attempts — After 3 attempts (default), saves the best result regardless
Each failed attempt appends increasingly strong composition cues to the prompt:
- Attempt 2:
", striking composition" - Attempt 3:
", vivid colors, memorable focal point"
Script Usage
# Basic usage
python scripts/generate_memorable_image.py --prompt "your image description"
# Full options
python scripts/generate_memorable_image.py \
--prompt "your image description" \
--output path/to/output.png \
--threshold 0.75 \
--max-attempts 3 \
--api-key YOUR_KEY \
--verbose
# With verbose scoring output
python scripts/generate_memorable_image.py \
--prompt "a surreal clock melting over a desert canyon" \
--verbose
CLI Arguments:
| Argument | Default | Description |
|---|---|---|
--prompt | (required) | Image description |
--output | memorable-image.png | Output file path |
--threshold | 0.7 | Memorability threshold (0–1) |
--max-attempts | 3 | Max regeneration attempts |
--api-key | (env/file) | Gemini API key |
--verbose | (flag) | Show memorability scores per attempt |
Prompt Tips for High-Memorability Images
Research shows these compositional elements consistently score higher with ResMem:
- Faces and eyes — Human faces, especially with direct gaze, are inherently memorable
- Unusual juxtapositions — Unexpected scale, context, or combination of objects ("a whale floating through a city skyline")
- Strong focal point — One clear subject against a contrasting background
- High contrast — Bold color separations between subject and background
- Emotional resonance — Images that imply narrative or emotion
- Unusual lighting — Dramatic shadows, golden hour, bioluminescence, neon
- Unexpected scale — Macro details of normally-seen-large things, or vice versa
Prompts that tend to score low: generic landscapes, symmetrical compositions, neutral palettes, cluttered scenes with no clear focal point.
Memorability Science
ResMem — Brain Bridge Lab, University of Chicago © 2021 The University of Chicago. Non-commercial use license. https://github.com/Brain-Bridge-Lab/resmem
ResMem is a deep learning model trained to predict image memorability scores — how likely a person is to remember having seen an image after a brief exposure. Scores range from 0 (instantly forgotten) to 1 (highly memorable).
License: ResMem Non-commercial License — redistribution permitted for non-commercial purposes with attribution. For commercial licensing: wilma@uchicago.edu
Built for Claude Code. Requires a Gemini API key and a local Python environment with resmem installed.
Comments
Loading comments...
