Gemini 2.5 Pro Text-to-Speech

Our premium engine for studio-quality speech synthesis, offering high-fidelity and sophisticated audio generation. Gemini 2.5 Pro TTS is best for long-form content, professional narrations, and complex creative workflows that require the highest level of vocal clarity and natural prosody.

Documentation

Visit the Text-to-Speech guide for full coverage of features and capabilities.

gemini-2.5-pro-preview-tts

Property Description
Model code gemini-2.5-pro-preview-tts
Supported data types

Inputs

Text

Output

Audio

Token limits[*]

Input token limit

8,192

Output token limit

16,384

Capabilities

Audio generation

Supported

Batch API

Supported

Caching

Not supported

Code execution

Not supported

File search

Not Supported

Function calling

Not supported

Grounding with Google Maps

Not supported

Image generation

Not supported

Live API

Not supported

Search grounding

Not supported

Structured outputs

Not supported

Thinking

Not supported

URL context

Not supported

Versions
Read the model version patterns for more details.
  • gemini-2.5-pro-preview-tts
Latest update December 2025