Gemini Models

OUR MOST ADVANCED MODEL

Gemini 2.5 Pro

Our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.

Expand to learn more

Gemini 2.5 Pro is our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.

Try in Google AI Studio

Model details

Gemini 2.5 Pro

Property Description
Model code gemini-2.5-pro
Supported data types

Inputs

Audio, images, video, text, and PDF

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL context

Supported

Versions
Read the model version patterns for more details.
  • Stable: gemini-2.5-pro
Latest update June 2025
Knowledge cutoff January 2025

Gemini 2.5 Pro TTS

Property Description
Model code gemini-2.5-pro-preview-tts
Supported data types

Inputs

Text

Output

Audio

Token limits[*]

Input token limit

8,000

Output token limit

16,000

Capabilities

Audio generation

Supported

Batch API

Supported

Caching

Not supported

Code execution

Not supported

Function calling

Not supported

Image generation

Not supported

Live API

Not supported

Search grounding

Not supported

Structured outputs

Not supported

Thinking

Not supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • gemini-2.5-pro-preview-tts
Latest update May 2025

FAST AND INTELLIGENT

Gemini 2.5 Flash

Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.

Expand to learn more

Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.

Try in Google AI Studio

Model details

Gemini 2.5 Flash

Property Description
Model code gemini-2.5-flash
Supported data types

Inputs

Text, images, video, audio

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL context

Supported

Versions
Read the model version patterns for more details.
  • Stable: gemini-2.5-flash
Latest update June 2025
Knowledge cutoff January 2025

Gemini 2.5 Flash Preview

Property Description
Model code gemini-2.5-flash-preview-09-2025
Supported data types

Inputs

Text, images, video, audio

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL Context

Supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.5-flash-preview-09-2025
Latest update September 2025
Knowledge cutoff January 2025

Gemini 2.5 Flash Image

Property Description
Model code gemini-2.5-flash-image-preview, our latest, fastest, and most efficient natively multimodal model that lets you generate and edit images conversationally.
Supported data types

Inputs

Images and text

Output

Images and text

Token limits[*]

Input token limit

32,768

Output token limit

32,768

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Not Supported

Function calling

Not supported

Image generation

Supported

Live API

Not Supported

Search grounding

Not Supported

Structured outputs

Supported

Thinking

Not Supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.5-flash-image-preview
Latest update August 2025
Knowledge cutoff June 2025

Gemini 2.5 Flash Live

Property Description
Model code gemini-2.5-flash-native-audio-preview-09-2025 &
gemini-2.5-flash-exp-native-audio-thinking-dialog
Supported data types

Inputs

Audio, video, text

Output

Audio and text

Token limits[*]

Input token limit

128,000

Output token limit

8,000

Capabilities

Audio generation

Supported

Batch API

Not supported

Caching

Not supported

Code execution

Not supported

Function calling

Supported

Image generation

Not supported

Live API

Supported

Search grounding

Supported

Structured outputs

Not supported

Thinking

Supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.5-flash-native-audio-preview-09-2025
  • Preview: gemini-2.5-flash-preview-native-audio-dialog
  • Preview: gemini-2.5-flash-preview-05-20
  • Preview: gemini-live-2.5-flash-preview
  • Experimental: gemini-2.5-flash-exp-native-audio-thinking-dialog
Latest update September 2025
Knowledge cutoff January 2025

Gemini 2.5 Flash TTS

Property Description
Model code gemini-2.5-flash-preview-tts
Supported data types

Inputs

Text

Output

Audio

Token limits[*]

Input token limit

8,000

Output token limit

16,000

Capabilities

Audio generation

Supported

Batch API

Supported

Caching

Not supported

Code execution

Not supported

Function calling

Not supported

Image generation

Not supported

Live API

Not supported

Search grounding

Not supported

Structured outputs

Not supported

Thinking

Not supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • gemini-2.5-flash-preview-tts
Latest update May 2025

ULTRA FAST

Gemini 2.5 Flash-Lite

Our fastest flash model optimized for cost-efficiency and high throughput.

Expand to learn more

A Gemini 2.5 Flash model optimized for cost-efficiency and high throughput.

Try in Google AI Studio

Model details

Gemini 2.5 Flash-Lite

Property Description
Model code gemini-2.5-flash-lite
Supported data types

Inputs

Text, image, video, audio, PDF

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL context

Supported

Versions
Read the model version patterns for more details.
  • Stable: gemini-2.5-flash-lite
Latest update July 2025
Knowledge cutoff January 2025

Gemini 2.5 Flash-Lite Preview

Property Description
Model code gemini-2.5-flash-lite-preview-09-2025
Supported data types

Inputs

Text, image, video, audio, PDF

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

65,536

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Supported

URL context

Supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.5-flash-lite-preview-09-2025
Latest update September 2025
Knowledge cutoff January 2025


Previous Gemini Models

OUR SECOND GENERATION WORKHORSE MODEL

Gemini 2.0 Flash

Our second generation workhorse model, with a 1 million token context window.

Expand to learn more

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, native tool use, and a 1M token context window.

Try in Google AI Studio

Model details

Gemini 2.5 Flash

Property Description
Model code gemini-2.0-flash
Supported data types

Inputs

Audio, images, video, and text

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

8,192

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Experimental

URL context

Not supported

Versions
Read the model version patterns for more details.
  • Latest: gemini-2.0-flash
  • Stable: gemini-2.0-flash-001
  • Experimental: gemini-2.0-flash-exp
Latest update February 2025
Knowledge cutoff August 2024

Gemini 2.0 Flash Image

Property Description
Model code gemini-2.0-flash-preview-image-generation
Supported data types

Inputs

Audio, images, video, and text

Output

Text and images

Token limits[*]

Input token limit

32,000

Output token limit

8,192

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Not Supported

Function calling

Not supported

Image generation

Supported

Live API

Not Supported

Search grounding

Not Supported

Structured outputs

Supported

Thinking

Not Supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.0-flash-preview-image-generation
  • gemini-2.0-flash-preview-image-generation is not currently supported in a number of countries in Europe, Middle East & Africa

Latest update May 2025
Knowledge cutoff August 2024

Gemini 2.0 Flash Live

Property Description
Model code gemini-2.0-flash-live-001
Supported data types

Inputs

Audio, video, and text

Output

Text, and audio

Token limits[*]

Input token limit

1,048,576

Output token limit

8,192

Capabilities

Audio generation

Supported

Batch API

Not supported

Caching

Not supported

Code execution

Supported

Function calling

Supported

Image generation

Not supported

Live API

Supported

Search grounding

Supported

Structured outputs

Supported

Thinking

Not supported

URL context

Supported

Versions
Read the model version patterns for more details.
  • Preview: gemini-2.0-flash-live-001
Latest update April 2025
Knowledge cutoff August 2024

OUR SECOND GENERATION FAST MODEL

Gemini 2.0 Flash-Lite

Our second generation small workhorse model, with a 1 million token context window.

Expand to learn more

A Gemini 2.0 Flash model optimized for cost efficiency and low latency.

Try in Google AI Studio

Model details

Property Description
Model code gemini-2.0-flash-lite
Supported data types

Inputs

Audio, images, video, and text

Output

Text

Token limits[*]

Input token limit

1,048,576

Output token limit

8,192

Capabilities

Audio generation

Not supported

Batch API

Supported

Caching

Supported

Code execution

Not supported

Function calling

Supported

Image generation

Not supported

Live API

Not supported

Search grounding

Not supported

Structured outputs

Supported

Thinking

Not Supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • Latest: gemini-2.0-flash-lite
  • Stable: gemini-2.0-flash-lite-001
Latest update February 2025
Knowledge cutoff August 2024


Model version name patterns

Stable

Points to a specific stable model. Stable models usually don't change. Most production apps should use a specific stable model.

For example: gemini-2.5-flash.

Preview

Points to a preview model which may be used for production. Preview model will typically have billing enabled, might come with more restrictive rate limits and will be deprecated with at least 2 weeks notice.

For example: gemini-2.5-flash-preview-09-2025.

Latest

Points to the latest release for a specific model variation. This can be a stable, preview or experimental release. This alias will get hot-swapped with every new release of a specific model variation.

For example: gemini-flash-latest.

Experimental

Points to an experimental model which will typically be not be suitable for production use and come with more restrictive rate limits. We release experimental models to gather feedback and get our latest updates into the hands of developers quickly.

Experimental models are not stable and availability of model endpoints is subject to change.