The Interactions API is now generally available. We recommend using this API for access to all the latest features and models.

Gemini 3.1 Flash TTS (Text-to-Speech) Preview

The Gemini 3.1 Flash TTS Preview model provides powerful, low-latency speech generation with natural outputs, steerable prompts, and new expressive audio tags for precise narration control.

Try in Google AI Studio

Documentation

The Gemini 3.1 Flash TTS Preview model introduces expressive audio tags for controlling narration, as well as overall improvements to naturalness, controllability, and multilinguality.

Visit the Text-to-Speech guide for full coverage of features and capabilities.

gemini-3.1-flash-tts-preview

Property	Description
Model code	`gemini-3.1-flash-tts-preview`
Supported data types	Inputs Text Output Audio
Token limits^[*]	Input token limit 8,192 Output token limit 16,384
Capabilities	Audio generation Supported Caching Not supported Code execution Not supported File search Not Supported Function calling Not supported Grounding with Google Maps Not supported Image generation Not supported Live API Not supported Search grounding Not supported Structured outputs Not supported Thinking Not supported URL context Not supported
Consumption options	Batch API Supported Flex inference Not supported Priority inference Not supported
Versions	Read the model version patterns for more details. `gemini-3.1-flash-tts-preview`
Latest update	April 2026
Knowledge cutoff	January 2025