Our fastest engine for high-fidelity speech synthesis, offering low-latency and cost-efficient audio generation. Gemini 2.5 Flash TTS is best for real-time assistants, high-volume narration, and conversational use cases that require fine-grained control over voice style and pacing.
Documentation
Visit the Text-to-Speech guide for full coverage of features and capabilities.
gemini-2.5-flash-preview-tts
| Property | Description |
|---|---|
| Model code | gemini-2.5-flash-preview-tts |
| Supported data types |
Inputs Text Output Audio |
| Token limits[*] |
Input token limit 8,192 Output token limit 16,384 |
| Capabilities | Supported Not supported Not supported Not Supported Not supported Not supported Not supported Not supported Not supported Not supported Not supported Not supported |
| Consumption options |
Supported Not supported Not supported |
| Versions |
|
| Latest update | December 2025 |