Release notes

This page documents updates to the Gemini API.

May 14, 2024

API updates

  • Introduced a 2M context window for Gemini 1.5 Pro (waitlist).
  • Introduced pay-as-you-go billing for Gemini 1.0 Pro, with Gemini 1.5 Pro and Gemini 1.5 Flash billing coming soon.
  • Introduced increased rate limits for the upcoming paid tier of Gemini 1.5 Pro.
  • Added built-in video support to the File API.
  • Added plain text support to the File API.
  • Added support for parallel function calling, which returns more than one call at a time.

May 10, 2024

Model updates

April 9, 2024

Model updates

  • Released Gemini 1.5 Pro (gemini-1.5-pro-latest) in preview.
  • Released a new text embedding model, text-embeddings-004, which supports elastic embedding sizes under 768.

API updates

  • Released the File API for temporarily storing media files for use in prompting.
  • Added support for prompting with text, image, and audio data, also known as multimodal prompting. To learn more, see Prompting with media.
  • Released System instructions in beta.
  • Added Function calling mode, which defines the execution behavior for function calling.
  • Added support for the response_mime_type configuration option, which lets you request responses in JSON format.

March 19, 2024

December 13 2023

4 new models:

  • gemini-pro: New text model for a wide variety of tasks. Balances capability and efficiency.
  • gemini-pro-vision: New multimodal model for a wide variety of tasks. Balances capability and efficiency.
  • embedding-001: New embeddings model.
  • aqa: A new specially tuned model that is trained to answer questions using text passages for grounding generated answers.

See Gemini models for more details.

API version updates:

  • v1: The stable API channel.
  • v1beta: Beta channel. This channel has features that may be under development.

See the API versions topic for more details.

API updates

  • GenerateContent is a single unified endpoint for chat and text.
  • Streaming available through the StreamGenerateContent method.
  • Multimodal capability: Image is a new supported modality
  • New beta features:
  • Updated candidate count: Gemini models only return 1 candidate.
  • Different Safety Settings and SafetyRating categories. See safety settings for more details.
  • Tuning models is not yet supported for Gemini models (Work in progress).