Skip to content

Gemini Image Simple

Verified

Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.

2,750

Install

Claude Code

Add to .claude/skills/

About This Skill

# Gemini Image Simple

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) - the highest quality image generation model.

Why This Skill

| Feature | This Skill | Others (nano-banana-pro, etc.) | |---------|------------|-------------------------------| | Dependencies | None (stdlib only) | google-genai, pillow, etc. | | Requires pip/uv | ❌ No | ✅ Yes | | Works on Fly.io free | ✅ Yes | ❌ Fails | | Works in containers | ✅ Yes | ❌ Often fails | | Image generation | ✅ Full | ✅ Full | | Image editing | ✅ Yes | ✅ Yes | | Setup complexity | Just set API key | Install packages first |

Bottom line: This skill works anywhere Python 3 exists. No package managers, no virtual environments, no permission issues.

Quick Start

```bash # Generate python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "A cat wearing a tiny hat" cat.png

# Edit existing image python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "Make it sunset lighting" edited.png --input original.png ```

Usage

Generate new image

```bash python3 {baseDir}/scripts/generate.py "your prompt" output.png ```

Edit existing image

```bash python3 {baseDir}/scripts/generate.py "edit instructions" output.png --input source.png ```

Supported input formats: PNG, JPG, JPEG, GIF, WEBP

Environment

Set `GEMINI_API_KEY` environment variable. Get one at https://aistudio.google.com/apikey

How It Works

  • Uses Nano Banana Pro (`nano-banana-pro-preview`) - Google's highest quality image generation model:
  • Pure `urllib.request` for HTTP (no requests library)
  • Pure `json` for parsing (stdlib)
  • Pure `base64` for encoding (stdlib)

That's it. No external packages. Works on any Python 3.10+ installation.

Model

Currently using: `nano-banana-pro-preview` (also known as Gemini 3 Pro Image)

  • Other available models (can be changed in generate.py if needed):
  • `gemini-3-pro-image-preview` - Same as Nano Banana Pro
  • `imagen-4.0-ultra-generate-001` - Imagen 4.0 Ultra
  • `imagen-4.0-generate-001` - Imagen 4.0
  • `gemini-2.5-flash-image` - Gemini 2.5 Flash with image gen

Examples

```bash # Landscape python3 {baseDir}/scripts/generate.py "Misty mountains at sunrise, photorealistic" mountains.png

# Product shot python3 {baseDir}/scripts/generate.py "Minimalist product photo of a coffee cup, white background" coffee.png

# Edit: change style python3 {baseDir}/scripts/generate.py "Convert to watercolor painting style" watercolor.png --input photo.jpg

# Edit: add element python3 {baseDir}/scripts/generate.py "Add a rainbow in the sky" rainbow.png --input landscape.png ```

Use Cases

  • Generate simple images quickly using a streamlined Gemini API wrapper
  • Create basic illustrations and graphics without complex configuration
  • Produce placeholder images and quick visual prototypes
  • Generate icons and simple visual elements for development use
  • Test image generation prompts with minimal setup overhead

Pros & Cons

Pros

  • + Simplified interface removes configuration complexity from image generation
  • + Quick setup enables immediate image generation without learning the full API
  • + Suitable for rapid prototyping and development-time image needs

Cons

  • - Simplified interface may not expose advanced Gemini image generation options
  • - Only available on claude-code and openclaw platforms
  • - Requires Gemini API credentials despite the simplified interface

Frequently Asked Questions

What does Gemini Image Simple do?

Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.

What platforms support Gemini Image Simple?

Gemini Image Simple is available on Claude Code, OpenClaw.

What are the use cases for Gemini Image Simple?

Generate simple images quickly using a streamlined Gemini API wrapper. Create basic illustrations and graphics without complex configuration. Produce placeholder images and quick visual prototypes.

Stay Updated on Agent Skills

Get weekly curated skills + safety alerts