Skip to content

Chatgpt Image Generation

Verified

Generate images from ChatGPT using Playwright browser automation. Opens ChatGPT, sends prompts, waits for generation, and saves the resulting images.

96 downloads
$ Add to .claude/skills/

About This Skill

# ChatGPT Image Generation Skill

Use Playwright to automate ChatGPT web UI for image generation.

Prerequisites

```bash npm install playwright npx playwright install chromium ```

Usage

```bash # Generate images from prompts file node generate.js --prompts prompts.json --output ./images

# Resume from a specific index node generate.js --prompts prompts.json --output ./images --start 5

# Run in headless mode node generate.js --prompts prompts.json --output ./images --headless ```

Prompt File Format

```json ["prompt 1", "prompt 2"] ```

or

```json { "prompts": ["prompt 1", "prompt 2"] } ```

How It Works

  1. Opens ChatGPT in a Chrome browser
  2. Sends each prompt from the prompts file
  3. Waits for the response to be generated
  4. Finds the generated image in the page
  5. Saves the image to the output directory
  6. Repeats for all prompts

Output

  • Numbered image files: `001.png`, `002.png`, etc.
  • `results.jsonl` — log of results per prompt

Login (One-Time)

  1. If not logged into ChatGPT:
  2. Run the script (browser will open visible)
  3. Sign into ChatGPT
  4. Session is saved for future runs

Use Cases

  • Batch-generate images from a JSON prompts file using ChatGPT's web UI via Playwright
  • Automate ChatGPT image generation for content pipelines without using the API directly
  • Resume interrupted batch jobs from a specific prompt index
  • Generate numbered image outputs with structured JSONL result logs for tracking
  • Run headless image generation for CI/CD or scheduled content workflows

Pros & Cons

Pros

  • +Uses ChatGPT's web UI directly — no separate DALL-E API key or billing required
  • +Batch processing with resume support handles large prompt lists reliably
  • +Session persistence means you only need to log in once

Cons

  • -Depends on ChatGPT web UI DOM structure — breaks if OpenAI changes the page layout
  • -Requires an active ChatGPT Plus subscription with image generation access
  • -Browser automation is slower than API calls — each image requires full page load and render wait

FAQ

What does Chatgpt Image Generation do?
Generate images from ChatGPT using Playwright browser automation. Opens ChatGPT, sends prompts, waits for generation, and saves the resulting images.
What platforms support Chatgpt Image Generation?
Chatgpt Image Generation is available on Claude Code, OpenClaw.
What are the use cases for Chatgpt Image Generation?
Batch-generate images from a JSON prompts file using ChatGPT's web UI via Playwright. Automate ChatGPT image generation for content pipelines without using the API directly. Resume interrupted batch jobs from a specific prompt index.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.