Photo to 3D

One-click photo to 3D model pipeline. Upload any photo, AI generates a clean isometric view via Gemini (Nano Banana), then converts it to a production-ready...

MIT-0 · Free to use, modify, and redistribute. No attribution required.
1 · 127 · 0 current installs · 0 all-time installs
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The name/description require Gemini and Tripo3D access; the script uses GEMINI_API_KEY and TRIPO_API_KEY and calls the expected endpoints. Required credentials and network calls are proportionate to converting images → isometric view → 3D model.
Instruction Scope
Instructions and the script operate within the stated pipeline: they read the input image, call Gemini and Tripo3D APIs, and write output files. Important privacy note: the user's image is uploaded to external services (Google generativelanguage and api.tripo3d.ai); this is expected but should be considered before use.
Install Mechanism
No install spec (instruction-only with an included script) — low install risk. One practical omission: the script uses the Python 'requests' library but the skill metadata does not declare dependencies or give an install step; users must ensure Python and requests are available.
Credentials
Only GEMINI_API_KEY and TRIPO_API_KEY are required, which directly match the two external services used. No unrelated secrets or config paths are requested.
Persistence & Privilege
always is false, the skill does not request persistent or elevated agent privileges, and it does not modify other skills or system-wide settings.
Assessment
This skill appears internally consistent and does what it describes, but review these points before installing: - The script uploads your images to two external services (Google generativelanguage and Tripo3D). Do not use sensitive or private images unless you accept those services' privacy/retention policies. - You must provide GEMINI_API_KEY and TRIPO_API_KEY; check billing and permissions for those keys. The script prints small token fragments in logs (not full keys) but network requests will include your keys — treat them as sensitive. - Ensure your environment has Python and the 'requests' library installed (the skill does not include dependency installation instructions). - Origin is unknown and there's no homepage; if you need higher assurance, request provenance or a vetted package/repository for the code, and consider running it in an isolated environment first. - If you plan to run at scale, verify Tripo3D's rate/usage limits and costs and confirm the Gemini usage method matches your Google/API account setup.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0
Download zip
latestvk97djd4mqwmzbw66rq07eppkw18359z4

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

Photo to 3D Model

Two-step pipeline that converts any photo into a 3D model:

  1. Gemini preprocess — Transform photo into a clean white-background 45° isometric view
  2. Tripo3D generate — Convert the isometric image into a .glb 3D model

Requirements

Usage

# Full pipeline: photo → isometric view → 3D model
python3 scripts/photo_to_3d.py <image_path>

# Custom prompt for Gemini preprocessing
python3 scripts/photo_to_3d.py <image_path> --prompt "your custom prompt"

# Skip preprocessing (image is already a clean isometric view)
python3 scripts/photo_to_3d.py <image_path> --skip-preprocess

# Custom output directory
python3 scripts/photo_to_3d.py <image_path> --output-dir ./my_output

Output

  • output/{name}_isometric.png — Gemini-generated isometric view
  • output/{name}_model.glb — Final 3D model file

Default Gemini Prompt

The built-in prompt generates a 45° isometric "3D-printed model" style render with PBR materials on a pure white background. Override with --prompt for custom styles (e.g., game assets, architectural models, product renders).

Notes

  • Supported input: .jpg, .jpeg, .png, .webp
  • Tripo3D generation takes ~1-3 minutes depending on complexity
  • For higher quality, preprocess source images with upscayl before running
  • Output .glb files can be converted to .usdz for Apple AR using usdzconvert

Files

2 total
Select a file
Select a file to preview.

Comments

Loading comments…