Gemini Robotics-ER 1.6 is a vision-language model (VLM) that brings Gemini's agentic capabilities to robotics. It's designed for advanced reasoning in the physical world, allowing robots to interpret complex visual data, perform spatial reasoning, and plan actions from natural language commands.
Documentation
Visit the Robotics page for full coverage of features and capabilities.
gemini-robotics-er-1.6-preview
| Property | Description |
|---|---|
| Model code | gemini-robotics-er-1.6-preview |
| Supported data types |
Inputs Text, images, video, audio Output Text |
| Token limits[*] |
Input token limit 131,072 Output token limit 65,536 |
| Capabilities | Not supported Supported Supported Supported Supported Supported Supported Not supported Not supported Supported Supported Supported Supported |
| Consumption options |
Supported Supported Supported |
| Versions |
|
| Latest update | December 2025 |
| Knowledge cutoff | January 2025 |