Gemma models overview

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is named after the Latin gemma, meaning precious stone. The Gemma model weights are supported by developer tools that promote innovation, collaboration, and the responsible use of artificial intelligence (AI). You can get multiple variations of Gemma for general and specific use cases:

  • Gemma 4: Solve a wide variety of generative AI tasks with text, audio and image input, support for over 140 languages, and long 128K and up to 256K context window.
  • EmbeddingGemma: Produce numerical representations of text for downstream tasks like information retrieval, semantic similarity search, classification, and clustering.
  • ShieldGemma 2: Evaluate the safety of generative AI models' input and output against defined policies.

Many more Gemma variants are available from Google and our AI developer community. Check them out on Kaggle Models and Hugging Face. Get inspired by what our community members have built with Gemma in the Gemmaverse.

The Gemma models are available to run in your applications and on your hardware, mobile devices, or hosted services. You can also customize these models using tuning techniques so that they excel at performing specific tasks that matter to you and your users. Gemma models draw inspiration and technological lineage from the Gemini family of models, and are made for the AI development community to extend and take further.

Ready to begin? Get started with Gemma models!