Gemini Video Analyzer

Verified

Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...

242 downloads

$ Add to .claude/skills/

$ openclaw install

About This Skill

# Gemini Video Analyzer

Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.

Quick Start

```bash # Analyze a video with default prompt (full description) GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4

# Ask a specific question GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"

# Manage uploaded files GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup ```

Supported Formats

MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.

How It Works

Video uploads to Google's Files API (temporary, auto-deletes after 48h)
Gemini processes at 1 frame/sec — understands motion, transitions, audio context
Model generates response based on your prompt
Way better than frame extraction for understanding temporal content

Use Cases

| Task | Example Prompt | |------|---------------| | General description | *(default — no prompt needed)* | | UI/text extraction | `"What text and UI elements are visible?"` | | Tutorial summary | `"Summarize the steps shown in this tutorial"` | | Bug report from video | `"Describe what went wrong in this screen recording"` | | Meeting notes | `"Summarize the key points discussed"` | | Content comparison | Upload 2 videos, ask for differences |

Configuration

Set `GOOGLE_AI_API_KEY` in your environment or `.env` file. Get a free key at aistudio.google.com.

Default model: `gemini-2.5-flash` (fast, cheap, excellent vision). Override with `--model gemini-2.5-pro` for complex analysis.

API Reference

See references/gemini-files-api.md for file upload limits, processing details, and advanced options.

Use Cases

Describe scenes and actions in a video file without manual frame extraction
Extract on-screen text and UI elements from screen recordings or tutorials
Generate meeting notes by summarizing key points from recorded video calls
Create bug reports from screen recordings by identifying what went wrong
Transcribe and summarize tutorial videos into step-by-step instructions

Pros & Cons

Pros

+Native video processing at 1 FPS — no frame extraction or preprocessing needed
+Supports 10+ video formats (MP4, AVI, MOV, MKV, WebM, etc.) up to 2GB
+Uses Gemini's multimodal API for combined motion, audio, and visual understanding
+Free API key available; auto-deletes uploaded files after 48 hours for privacy

Cons

-Requires a Google AI API key — depends on external service availability
-Limited to Google Gemini models; no option to use alternative AI providers
-Large video files may take significant time to upload and process

FAQ

What does Gemini Video Analyzer do?

Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe...

What platforms support Gemini Video Analyzer?

Gemini Video Analyzer is available on Claude Code, OpenClaw.

What are the use cases for Gemini Video Analyzer?

Describe scenes and actions in a video file without manual frame extraction. Extract on-screen text and UI elements from screen recordings or tutorials. Generate meeting notes by summarizing key points from recorded video calls.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

AI Humanizer

Make AI text undetectable

AI Detector

Free, unlimited

PDF Tools

Merge, split, compress

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.

Open Free Tools Try AI Detector