Skip to content

Gemini Image Remix

Verified

Generate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options.

472 downloads
$ Add to .claude/skills/

About This Skill

# Gemini Image Remix

A versatile tool for text-to-image generation and complex image-to-image remixing. By default, it uses Gemini 2.5 Flash Image for fast, high-quality results. It also supports flagship models like Gemini 3.0 Pro (Nano Banana Pro) for advanced artistic tasks.

Generate Image

Create stunning visuals from a text prompt.

```bash uv run {baseDir}/scripts/remix.py --prompt "a cybernetic owl in a neon forest" --filename "owl.png" ```

Remix/Modify Image

Use one or more reference images to guide the generation. Perfect for style transfers, background changes, or character modifications.

```bash uv run {baseDir}/scripts/remix.py --prompt "change the art style to a pencil sketch" --filename "sketch.png" -i "original.png" ```

Multi-image Composition

Combine elements from up to 14 different images into a single cohesive scene.

```bash uv run {baseDir}/scripts/remix.py --prompt "place the character from image 1 into the environment of image 2" --filename "result.png" -i "character.png" -i "env.png" ```

Advanced Model Selection

Switch to advanced models like Nano Banana Pro for high-fidelity work.

```bash uv run {baseDir}/scripts/remix.py --model "gemini-3-pro-image-preview" --prompt "highly detailed oil painting of a dragon" --filename "dragon.png" ```

Options

  • `--prompt`, `-p`: Image description or specific edit instructions.
  • `--filename`, `-f`: The output path for the generated PNG.
  • `--input-image`, `-i`: Path to an input image (repeatable up to 14 times).
  • `--resolution`, `-r`: `1K` (default), `2K`, or `4K`.
  • `--aspect-ratio`, `-a`: Output aspect ratio (e.g., `1:1`, `16:9`, `9:16`, `4:3`, `3:4`).
  • `--model`, `-m`: Model to use (defaults to `gemini-2.5-flash-image`). Supported: `gemini-2.5-flash-image`, `gemini-3-pro-image-preview`.
  • `--api-key`, `-k`: Gemini API key (defaults to `GEMINI_API_KEY` env var).

Use Cases

  • Remix and modify existing images using Gemini's image understanding and generation
  • Apply style transfers and artistic modifications to source images
  • Generate variations of product images with different backgrounds or settings
  • Create image mashups combining elements from multiple source images
  • Edit specific regions of images based on text instructions

Pros & Cons

Pros

  • +Image-to-image modification preserves key elements while applying changes
  • +Style transfer capability enables creative visual exploration
  • +Region-based editing enables precise modifications without affecting the whole image

Cons

  • -Remix quality depends on the source image and the clarity of modification instructions
  • -Only available on claude-code and openclaw platforms
  • -Requires Gemini API access with image capabilities

FAQ

What does Gemini Image Remix do?
Generate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options.
What platforms support Gemini Image Remix?
Gemini Image Remix is available on Claude Code, OpenClaw.
What are the use cases for Gemini Image Remix?
Remix and modify existing images using Gemini's image understanding and generation. Apply style transfers and artistic modifications to source images. Generate variations of product images with different backgrounds or settings.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.