Announcing Gemini Embedding 2, our first fully multimodal embedding model.

Gemini API

The fastest path from prompt to production with Gemini, Veo, Nano Banana, and more.

Python

from google import genai

client = genai.Client()

response = client.models.generate_content(
    model="gemini-3-flash-preview",
    contents="Explain how AI works in a few words",
)

print(response.text)

JavaScript

import { GoogleGenAI } from "@google/genai";

const ai = new GoogleGenAI({});

async function main() {
  const response = await ai.models.generateContent({
    model: "gemini-3-flash-preview",
    contents: "Explain how AI works in a few words",
  });
  console.log(response.text);
}

await main();

Go

package main

import (
    "context"
    "fmt"
    "log"
    "google.golang.org/genai"
)

func main() {
    ctx := context.Background()
    client, err := genai.NewClient(ctx, nil)
    if err != nil {
        log.Fatal(err)
    }

    result, err := client.Models.GenerateContent(
        ctx,
        "gemini-3-flash-preview",
        genai.Text("Explain how AI works in a few words"),
        nil,
    )
    if err != nil {
        log.Fatal(err)
    }
    fmt.Println(result.Text())
}

Java

package com.example;

import com.google.genai.Client;
import com.google.genai.types.GenerateContentResponse;

public class GenerateTextFromTextInput {
  public static void main(String[] args) {
    Client client = new Client();

    GenerateContentResponse response =
        client.models.generateContent(
            "gemini-3-flash-preview",
            "Explain how AI works in a few words",
            null);

    System.out.println(response.text());
  }
}

C#

using System.Threading.Tasks;
using Google.GenAI;
using Google.GenAI.Types;

public class GenerateContentSimpleText {
  public static async Task main() {
    var client = new Client();
    var response = await client.Models.GenerateContentAsync(
      model: "gemini-3-flash-preview", contents: "Explain how AI works in a few words"
    );
    Console.WriteLine(response.Candidates[0].Content.Parts[0].Text);
  }
}

REST

curl "https://generativelanguage.googleapis.com/v1beta/models/gemini-3-flash-preview:generateContent" \
  -H "x-goog-api-key: $GEMINI_API_KEY" \
  -H 'Content-Type: application/json' \
  -X POST \
  -d '{
    "contents": [
      {
        "parts": [
          {
            "text": "Explain how AI works in a few words"
          }
        ]
      }
    ]
  }'

Start building

Follow our Quickstart guide to get an API key and make your first API call in minutes.

Meet the models

View all

Gemini 3.1 Pro New

Our most intelligent model, the best in the world for multimodal understanding, all built on state-of-the-art reasoning.

Gemini 3 Flash New

Frontier-class performance rivaling larger models at a fraction of the cost.

Gemini 3.1 Flash-Lite New

High-volume, cost-sensitive workhorse model with the performance and quality of the Gemini 3 series.

Nano Banana 2 and Nano Banana Pro

State-of-the-art image generation and editing models.

Veo 3.1

Our state-of-the-art video generation model, with native audio.

Gemini Robotics

A vision-language model (VLM) that brings Gemini's agentic capabilities to robotics and enables advanced reasoning in the physical world.

Explore Capabilities

Native Image Generation (Nano Banana)

Generate and edit highly contextual images natively with Gemini 2.5 Flash Image.

Long Context

Input millions of tokens to Gemini models and derive understanding from unstructured images, videos, and documents.

Structured Outputs

Constrain Gemini to respond with JSON, a structured data format suitable for automated processing.

Function Calling

Build agentic workflows by connecting Gemini to external APIs and tools.

Video Generation with Veo 3.1

Create high-quality video content from text or image prompts with our state-of-the-art model.

Voice Agents with Live API

Build real-time voice applications and agents with the Live API.

Tools

Connect Gemini to the world through built-in tools like Google Search, URL Context, Google Maps, Code Execution and Computer Use.

Document Understanding

Process up to 1000 pages of PDF files with full multimodal understanding or other text-based file types.

Thinking

Explore how thinking capabilities improve reasoning for complex tasks and agents.