Skip to content

Gemini Assistant

Verified

General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversatio...

176 downloads
$ Add to .claude/skills/

About This Skill

# Gemini Assistant

A general-purpose AI assistant powered by Google's Gemini API. Supports both text and voice interactions.

Usage

Text Mode

```bash cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py "Your question or message" ```

Voice Mode

```bash cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py --audio /path/to/audio.ogg "optional context" ```

Response Format

The handler returns a JSON response:

```json { "message": "[[audio_as_voice]]\nMEDIA:/tmp/gemini_voice_xxx.ogg", "text": "Text response from Gemini" } ```

Configuration

Set your Gemini API key:

```bash export GEMINI_API_KEY="your-api-key-here" ```

Or create a `.env` file in the skill directory:

``` GEMINI_API_KEY=your-api-key-here ```

Model Options

The default model is `gemini-2.5-flash-native-audio-preview-12-2025` for audio support.

To use a different model, edit `handler.py`:

```python MODEL = "gemini-2.0-flash-exp" # For text-only ```

Requirements

  • `google-genai>=1.0.0`
  • `numpy>=1.24.0`
  • `soundfile>=0.12.0`
  • `librosa>=0.10.0` (for audio input)
  • FFmpeg (for audio conversion)

Features

  • 🎙️ Voice input/output support
  • 💬 Text conversations
  • 🔧 Configurable system instructions
  • ⚡ Fast responses with Gemini Flash

Use Cases

  • Build conversational assistants powered by Google Gemini models
  • Create multi-turn chat interfaces with Gemini's conversation capabilities
  • Implement tool-use patterns where Gemini calls external functions
  • Design system prompts and persona configurations for Gemini assistants
  • Build Gemini-powered Q&A systems with retrieval-augmented generation

Pros & Cons

Pros

  • +Conversational design patterns specific to Gemini's strengths
  • +Tool-use integration enables Gemini to interact with external systems
  • +RAG implementation guidance combines Gemini with custom knowledge bases

Cons

  • -Gemini-specific patterns may not translate to other AI models
  • -Only available on claude-code and openclaw platforms
  • -Google API pricing and rate limits affect production deployment viability

FAQ

What does Gemini Assistant do?
General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversatio...
What platforms support Gemini Assistant?
Gemini Assistant is available on Claude Code, OpenClaw.
What are the use cases for Gemini Assistant?
Build conversational assistants powered by Google Gemini models. Create multi-turn chat interfaces with Gemini's conversation capabilities. Implement tool-use patterns where Gemini calls external functions.

100+ free AI tools

Writing, PDF, image, and developer tools — all in your browser.

Next Step

Use the skill detail page to evaluate fit and install steps. For a direct browser workflow, move into a focused tool route instead of staying in broader support surfaces.