The Interactions API is now generally available. We recommend using this API for access to all the latest features and models.

Gemini Robotics-ER 1.6

Gemini Robotics-ER 1.6 is a vision-language model (VLM) that brings Gemini's agentic capabilities to robotics. It's designed for advanced reasoning in the physical world, allowing robots to interpret complex visual data, perform spatial reasoning, and plan actions from natural language commands.

Try in Google AI Studio

Documentation

Visit the Robotics page for full coverage of features and capabilities.

gemini-robotics-er-1.6-preview

Property	Description
Model code	`gemini-robotics-er-1.6-preview`
Supported data types	Inputs Text, images, video, audio Output Text
Token limits^[*]	Input token limit 131,072 Output token limit 65,536
Capabilities	Audio generation Not supported Caching Supported Code execution Supported Computer use Supported File search Supported Function calling Supported Grounding with Google Maps Supported Image generation Not supported Live API Not supported Search grounding Supported Structured outputs Supported Thinking Supported URL context Supported
Consumption options	Batch API Supported Flex inference Supported Priority inference Supported
Versions	Read the model version patterns for more details. Preview: `gemini-robotics-er-1.6-preview`
Latest update	December 2025
Knowledge cutoff	January 2025