Skip to content

Gemini Nano Images

Verified

Generate ultra-realistic images and Instagram content using Gemini 2.0 Flash Experimental. Use when creating photorealistic images, social media content, or...

92 downloads
$ Add to .claude/skills/

About This Skill

# Gemini Nano Images

Generate ultra-realistic, photorealistic images using Google's Gemini 2.0 Flash Experimental model with native image generation capabilities.

Quick Start

1. Set API Key

```bash export GEMINI_API_KEY="your-api-key-here" ```

Get your API key at: https://aistudio.google.com/app/apikey

2. Generate Single Image

```bash python3 scripts/generate_image.py "A cozy family breakfast scene with warm morning light" ```

3. Generate Full Instagram Post

```bash python3 scripts/generate_instagram_post.py "Morning routine with kids" --mood cozy ```

Scripts

generate_image.py

Creates ultra-realistic images from text prompts.

Usage: ```bash python3 scripts/generate_image.py "PROMPT" [options] ```

  • Options:
  • `-o, --output DIR` - Output directory (default: current)
  • `-k, --api-key KEY` - Gemini API key
  • `--style {realistic,artistic,minimal}` - Style preset

Example: ```bash python3 scripts/generate_image.py "Father reading bedtime story to child" -o ~/images ```

generate_instagram_post.py

Creates complete Instagram posts: image + caption.

Usage: ```bash python3 scripts/generate_instagram_post.py "TOPIC" [options] ```

  • Options:
  • `-m, --mood MOOD` - Post mood: inspiring, cozy, energetic, calm, family, productive
  • `-o, --output DIR` - Output directory (default: ~/.openclaw/workspace/assets)
  • `-k, --api-key KEY` - Gemini API key

Example: ```bash python3 scripts/generate_instagram_post.py "Organized family calendar" --mood productive ```

Mood Presets

| Mood | Description | |------|-------------| | `inspiring` | Uplifting, motivational, golden hour | | `cozy` | Warm, comfortable, soft lighting | | `energetic` | Vibrant, dynamic, bright colors | | `calm` | Peaceful, serene, soft pastels | | `family` | Loving family moments, candid | | `productive` | Organized, clean, modern aesthetic |

Output

  • Images: Saved as PNG with timestamp prefix
  • Captions: Saved as .txt file alongside image
  • Location: Default is `~/.openclaw/workspace/assets`

Integration with Instagram Workflow

Generated content can be directly used with the Social Media Suite:

  1. Generate content:
  2. ```bash
  3. python3 scripts/generate_instagram_post.py "Weekend family adventure"
  4. ```
  1. Results saved to assets folder, ready for posting
  1. Use with instagram-poster or instagrapi workflow

Stock-Only Mode (Community Building Phase)

Für 1-2 Wochen nur Stockfotos posten (keine KI-Generierung):

```bash # Stock-Only Mode für 14 Tage aktivieren cd ~/.openclaw/workspace/skills/ig-automation python3 smart_poster_v4.py --set-mode stock_only --stock-days 14

# Stockfotos in Ordner legen mkdir -p assets/stock cp ~/deine-stockfotos/*.jpg assets/stock/

# Jetzt werden nur Stockfotos gepostet (zufällige Auswahl, keine Doppelungen) python3 smart_poster_v4.py

# Nach 14 Tagen automatischer Wechsel zu "auto" (KI + Stock) ```

  • Modes:
  • `auto` - Bevorzugt KI, Fallback zu Stock
  • `stock_only` - Nur Stockfotos aus `assets/stock/`
  • `ai_only` - Nur KI-generierte Bilder

Wichtig: Bei Stockfotos kein Wasserzeichen (Logo) - sieht authentischer aus!

Requirements

```bash pip install google-genai ```

Model Details

  • Model: Gemini 2.0 Flash Experimental
  • Capability: Native image generation
  • Resolution: Up to 1024x1024
  • Style: Photorealistic by default

See references/gemini_api.md for API details.

Use Cases

  • Generate lightweight images using Gemini Nano's on-device capabilities
  • Create images with minimal latency using local model inference
  • Build image generation features that work offline without API calls
  • Produce quick visual assets where speed matters more than maximum quality
  • Prototype image generation features without incurring API costs

Pros & Cons

Pros

  • +On-device inference eliminates API latency and network dependency
  • +No API costs for image generation using the local model
  • +Offline capability enables image generation without internet access

Cons

  • -Nano model produces lower quality images compared to full Gemini models
  • -Only available on claude-code and openclaw platforms
  • -On-device execution requires compatible hardware with sufficient compute

FAQ

What does Gemini Nano Images do?
Generate ultra-realistic images and Instagram content using Gemini 2.0 Flash Experimental. Use when creating photorealistic images, social media content, or...
What platforms support Gemini Nano Images?
Gemini Nano Images is available on Claude Code, OpenClaw.
What are the use cases for Gemini Nano Images?
Generate lightweight images using Gemini Nano's on-device capabilities. Create images with minimal latency using local model inference. Build image generation features that work offline without API calls.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.