This page documents updates to the Gemini API.
December 19, 2024
Model updates:
Released Gemini 2.0 Flash Thinking Mode for public preview. Thinking Mode is a test-time compute model that lets you see the model's thought process while it generates a response, and produces responses with stronger reasoning capabilities.
Read more about Gemini 2.0 Flash Thinking Mode in our overview page.
December 11, 2024
Model updates:
- Released Gemini 2.0 Flash Experimental
for public preview. Gemini 2.0 Flash Experimental's partial list of features includes:
- Twice as fast as Gemini 1.5 Pro
- Bidirectional streaming with our Multimodal Live API
- Multimodal response generation in the form of text, images, and speech
- Native tool use with multi-turn reasoning to use features like code execution, Search, function calling, and more
Read more about Gemini 2.0 Flash in our overview page.
November 21, 2024
Model updates:
- Released
gemini-exp-1121
, an even more powerful experimental Gemini API model.
Model updates:
- Updated the
gemini-1.5-flash-latest
andgemini-1.5-flash
model aliases to usegemini-1.5-flash-002
.- Change to
top_k
parameter: Thegemini-1.5-flash-002
model supportstop_k
values between 1 and 41 (exclusive). Values greater than 40 will be changed to 40.
- Change to
November 14, 2024
Model updates:
- Released
gemini-exp-1114
, a powerful experimental Gemini API model.
November 8, 2024
API updates
- Added support for Gemini in the OpenAI libraries / REST API.
October 31, 2024
API updates
October 3, 2024
Model updates:
- Released
gemini-1.5-flash-8b-001
, a stable version of our smallest Gemini API model.
September 24, 2024
Model updates:
- Released
gemini-1.5-pro-002
andgemini-1.5-flash-002
, two new stable versions of Gemini 1.5 Pro and 1.5 Flash, for general availability. - Updated the
gemini-1.5-pro-latest
model code to usegemini-1.5-pro-002
and thegemini-1.5-flash-latest
model code to usegemini-1.5-flash-002
. - Released
gemini-1.5-flash-8b-exp-0924
to replacegemini-1.5-flash-8b-exp-0827
. - Released the civic integrity safety filter for the Gemini API and AI Studio.
- Released support for two new parameters for Gemini 1.5 Pro and 1.5 Flash in
Python and NodeJS:
frequencyPenalty
andpresencePenalty
.
September 19, 2024
AI Studio updates:
- Added thumb-up and thumb-down buttons to model responses, to enable users to provide feedback on the quality of a response.
API updates:
- Added support for Google Cloud credits, which can now be used towards Gemini API usage.
September 17, 2024
AI Studio updates:
- Added an Open in Colab button that exports a prompt – and the code to run it – to a Colab notebook. The feature doesn't yet support prompting with tools (JSON mode, function calling, or code execution).
September 13, 2024
AI Studio updates:
- Added support for compare mode, which lets you compare responses across models and prompts to find the best fit for your use case.
August 30, 2024
Model updates
- Gemini 1.5 Flash supports supplying JSON schema through model configuration.
August 27, 2024
Model updates:
- Released the following
experimental models:
gemini-1.5-pro-exp-0827
gemini-1.5-flash-exp-0827
gemini-1.5-flash-8b-exp-0827
August 9, 2024
API updates
- Added support for PDF processing.
August 5, 2024
Model updates
- Fine-tuning support released for Gemini 1.5 Flash.
August 1, 2024
Model updates
- Released
gemini-1.5-pro-exp-0801
, a new experimental version of Gemini 1.5 Pro.
July 12, 2024
Model updates
- Support for Gemini 1.0 Pro Vision removed from Google AI services and tools.
June 27, 2024
Model updates
- General availability release for Gemini 1.5 Pro's 2M context window.
API updates
- Added support for code execution.
June 18, 2024
API updates
- Added support for context caching.
June 12, 2024
Model updates
- Gemini 1.0 Pro Vision deprecated.
May 23, 2024
Model updates
- Gemini 1.5 Pro
(
gemini-1.5-pro-001
) is generally available (GA). - Gemini 1.5 Flash
(
gemini-1.5-flash-001
) is generally available (GA).
May 14, 2024
API updates
- Introduced a 2M context window for Gemini 1.5 Pro (waitlist).
- Introduced pay-as-you-go billing for Gemini 1.0 Pro, with Gemini 1.5 Pro and Gemini 1.5 Flash billing coming soon.
- Introduced increased rate limits for the upcoming paid tier of Gemini 1.5 Pro.
- Added built-in video support to the File API.
- Added plain text support to the File API.
- Added support for parallel function calling, which returns more than one call at a time.
May 10, 2024
Model updates
- Released Gemini 1.5 Flash
(
gemini-1.5-flash-latest
) in preview.
April 9, 2024
Model updates
- Released Gemini 1.5 Pro
(
gemini-1.5-pro-latest
) in preview. - Released a new text embedding model,
text-embeddings-004
, which supports elastic embedding sizes under 768.
API updates
- Released the File API for temporarily storing media files for use in prompting.
- Added support for prompting with text, image, and audio data, also known as multimodal prompting. To learn more, see Prompting with media.
- Released System instructions in beta.
- Added Function calling mode, which defines the execution behavior for function calling.
- Added support for the
response_mime_type
configuration option, which lets you request responses in JSON format.
March 19, 2024
- Added support for tuning Gemini 1.0 Pro in Google AI Studio or with the Gemini API.
December 13 2023
4 new models:
- gemini-pro: New text model for a wide variety of tasks. Balances capability and efficiency.
- gemini-pro-vision: New multimodal model for a wide variety of tasks. Balances capability and efficiency.
- embedding-001: New embeddings model.
- aqa: A new specially tuned model that is trained to answer questions using text passages for grounding generated answers.
See Gemini models for more details.
API version updates:
- v1: The stable API channel.
- v1beta: Beta channel. This channel has features that may be under development.
See the API versions topic for more details.
API updates
GenerateContent
is a single unified endpoint for chat and text.- Streaming available through the
StreamGenerateContent
method. - Multimodal capability: Image is a new supported modality
- New beta features:
- Function Calling
- Semantic Retriever
- Attributed Question Answering (AQA)
- Updated candidate count: Gemini models only return 1 candidate.
- Different Safety Settings and SafetyRating categories. See safety settings for more details.
- Tuning models is not yet supported for Gemini models (Work in progress).