Check out the Gemma Cookbook repository for generation and tuning examples! Learn more

Gemma Open Models

A family of lightweight, state-of-the art open models built from the same research and technology used to create the Gemini models

Get started

Responsible by design

Incorporating comprehensive safety measures, these models help ensure responsible and trustworthy AI solutions through curated datasets and rigorous tuning.

Unmatched performance at size

Gemma models achieve exceptional benchmark results at its 2B, 7B, 9B, and 27B sizes, even outperforming some larger open models.

Framework flexible

With Keras 3.0, enjoy seamless compatibility with JAX, TensorFlow, and PyTorch, empowering you to effortlessly choose and switch frameworks depending on your task.

Introducing
Gemma 2

Redesigned for outsized performance and unmatched efficiency, Gemma 2 optimizes for blazing-fast inference on diverse hardware.

Try Gemma 2 in Google AI Studio

5-shot

MMLU

The MMLU benchmark is a test that measures the breadth of knowledge and problem-solving ability acquired by large language models during pretraining.

25-shot

ARC-C

The ARC-c benchmark is a more focused subset of the ARC-e dataset, containing only questions answered incorrectly by common (retrieval-base and word co-occurrence) algorithms.

5-shot

GSM8K

The GSM8K benchmark tests a language model's ability to solve grade-school-level math problems that frequently require multiple steps of reasoning.

3-5-shot

AGIEval

The AGIEval benchmark tests a language model's general intelligence by using questions derived from real-world exams designed to assess human intellectual abilities.

3-shot, CoT

BBH

The BBH (BIG-Bench Hard) benchmark focuses on tasks deemed beyond the abilities of current language models, testing their limits across various reasoning and understanding domains.

3-shot, F1

DROP

DROP is a reading comprehension benchmark that requires discrete reasoning over paragraphs.

5-shot

Winogrande

The Winogrande benchmark tests a language model's ability to resolve ambiguous fill-in-the-blank tasks with binary options, requiring generalized commonsense reasoning.

10-shot

HellaSwag

The HellaSwag benchmark challenges a language model's ability to understand and apply common sense reasoning by selecting the most logical ending to a story.

4-shot

MATH

MATH evaluates a language model's ability to solve complex mathematical word problems, requiring reasoning, multi-step problem-solving, and the understanding of mathematical concepts.

0-shot

ARC-e

The ARC-e benchmark tests a language model's advanced question-answering skills with genuine grade-school level, multiple-choice science questions.

0-shot

PIQA

The PIQA benchmark tests a language model's ability to understand and apply physical commonsense knowledge by answering questions about everyday physical interactions.

0-shot

SIQA

The SIQA benchmark evaluates a language model's understanding of social interactions and social common sense by asking questions about people’s actions and their social implications.

0-shot

Boolq

The BoolQ benchmark tests a language model's ability to answer naturally occurring yes/no questions, testing the models ability to do real-world natural language inference tasks.

5-shot

TriviaQA

The TriviaQA benchmark tests reading comprehension skills with question-answer-evidence triples.

5-shot

The NQ (Natural Questions) benchmark tests a language model's ability to find and comprehend answers within entire Wikipedia articles, simulating real-world question-answering scenarios.

pass@1

HumanEval

The HumanEval benchmark tests a language model's code generation abilities by evaluating whether its solutions pass functional unit tests for programming problems.

3-shot

MBPP

The MBPP benchmark tests a language model's ability to solve basic Python programming problems, focusing on fundamental programming concepts and standard library usage.

100%

75%

50%

25%

100%

75%

50%

25%

Gemma 1

2.5B

42.3

Gemma 2

2.6B

51.3

Mistral

62.5

LLAMA 3

66.6

Gemma 1

64.4

Gemma 2

71.3

Gemma 2

27B

75.2

Gemma 1

2.5B

48.5

Gemma 2

2.6B

55.4

Mistral

60.5

LLAMA 3

59.2

Gemma 1

61.1

Gemma 2

68.4

Gemma 2

27B

71.4

Gemma 1

2.5B

15.1

Gemma 2

2.6B

23.9

Mistral

39.6

LLAMA 3

45.7

Gemma 1

51.8

Gemma 2

68.6

Gemma 2

27B

74.0

Gemma 1

2.5B

24.2

Gemma 2

2.6B

30.6

Mistral

44.0

LLAMA 3

45.9

Gemma 1

44.9

Gemma 2

52.8

Gemma 2

27B

55.1

Gemma 1

2.5B

35.2

Gemma 2

2.6B

41.9

Mistral

56.0

LLAMA 3

61.1

Gemma 1

59.0

Gemma 2

68.2

Gemma 2

27B

74.9

Gemma 1

2.5B

48.5

Gemma 2

2.6B

52.0

Mistral

63.8

LLAMA 3

58.4

Gemma 1

56.3

Gemma 2

69.4

Gemma 2

27B

74.2

Gemma 1

2.5B

66.8

Gemma 2

2.6B

70.9

Mistral

78.5

LLAMA 3

76.1

Gemma 1

79.0

Gemma 2

80.6

Gemma 2

27B

83.7

Gemma 1

2.5B

71.7

Gemma 2

2.6B

73.0

Mistral

83.0

LLAMA 3

82.0

Gemma 1

82.3

Gemma 2

81.9

Gemma 2

27B

86.4

Gemma 1

2.5B

11.8

Gemma 2

2.6B

15.0

Mistral

12.7

Gemma 1

24.3

Gemma 2

36.6

Gemma 2

27B

42.3

Gemma 1

2.5B

73.2

Gemma 2

2.6B

80.1

Mistral

80.5

Gemma 1

81.5

Gemma 2

88.0

Gemma 2

27B

88.6

Gemma 1

2.5B

77.3

Gemma 2

2.6B

77.8

Mistral

82.2

Gemma 1

81.2

Gemma 2

81.7

Gemma 2

27B

83.2

Gemma 1

2.5B

49.7

Gemma 2

2.6B

51.9

Mistral

47.0

Gemma 1

51.8

Gemma 2

53.4

Gemma 2

27B

53.7

Gemma 1

2.5B

69.4

Gemma 2

2.6B

72.5

Mistral

83.2

Gemma 1

83.2

Gemma 2

84.2

Gemma 2

27B

84.8

Gemma 1

2.5B

53.2

Gemma 2

2.6B

59.4

Mistral

62.5

Gemma 1

63.4

Gemma 2

76.6

Gemma 2

27B

83.7

Gemma 1

2.5B

12.5

Gemma 2

2.6B

16.7

Mistral

23.2

Gemma 1

23.0

Gemma 2

29.2

Gemma 2

27B

34.5

Gemma 1

2.5B

22.0

Gemma 2

2.6B

17.7

Mistral

26.2

Gemma 1

32.3

Gemma 2

40.2

Gemma 2

27B

51.8

Gemma 1

2.5B

29.2

Gemma 2

2.6B

29.6

Mistral

40.2

Gemma 1

44.4

Gemma 2

52.4

Gemma 2

27B

62.6

*These are the benchmarks for the pre-trained models, see the technical report for details on performance with other methodologies.

Read the technical report

Gemma model family

New release

Gemma 2

Gemma 2 offers three new, powerful, and efficient models available in 2, 9, and 27 billion parameter sizes, all with built-in safety advancements.

Get started on Hugging Face Get started on Kaggle

New release

DataGemma

DataGemma are the first open models designed to connect LLMs with extensive real-world data drawn from Google's Data Commons.

Get started on Kaggle

Gemma 1

Gemma models are lightweight, text-to-text, decoder-only large language models, trained on a massive dataset of text, code, and mathematical content for a variety of natural language processing tasks.

Get started on Kaggle

RecurrentGemma

RecurrentGemma is a technically distinct model that leverages recurrent neural networks and local attention to improve memory efficiency.

Get started on Kaggle

PaliGemma

PaliGemma is an open vision-language model inspired by PaLI-3, leveraging SigLIP and Gemma, designed as a versatile model for transfer to a wide range of vision-language tasks.

Get started on Kaggle

CodeGemma

Harnessing the foundation of our original pre-trained Gemma models, CodeGemma brings powerful code completion and generation capabilities in sizes fit for your local computer.

Get started on Kaggle

Explore our tools

ShieldGemma

ShieldGemma is a suite of safety content classifier models built upon Gemma 2 to filter the input and outputs of AI models and keep the user safe.

Gemma Scope

Gemma Scope offers researchers unprecedented transparency into the decision-making processes of our Gemma 2 models.

Quick-start guides for developers

Discover quickstarts on Kaggle

Visit the Kaggle Models page to find quickstarts, code examples, and discussions for Gemma.

Open in Kaggle

Train and deploy on Google Cloud

Gemma 2 works best on Google Cloud, with end-to-end TPU optimization for market-leading performance and total cost of ownership on Vertex.

Open in Vertex AI

Try low-rank adaptation with JAX via Keras 3

Adapt Gemma models to your unique domain and data with the backend framework of your choice via Keras 3.

Open in Colab

View all quickstarts in our documentation View all quickstarts

Partner quick-start guides

Hugging Face

Utilize Hugging Face Transformers and TRL for fine-tuning and inference tasks with Gemma models.

NVIDIA

Fine-tune Gemma models with NVIDIA NeMo Framework and export to TensorRT-LLM for production.

LangChain

This tutorial shows you how to get started with Gemma and LangChain, running in Google Cloud or in your Colab environment.

Anyscale

These docs show how to use Gemma via Anyscale Endpoint as fully managed API endpoints.

MongoDB

This article presents how to leverage Gemma as the foundation model in a retrieval-augmented generation pipeline or system.

Weights and Biases

Dive deep into W&B's Model Registry and Launch tools through a step-by-step example using Google's Gemma models.

Gemma Cookbook

Explore a collection of practical recipes and examples showcasing the power and versatility of Gemma for tasks like image captioning with PaliGemma, code generation with CodeGemma, and building chatbots with fine-tuned Gemma models.

Get cooking

Access Gemma models today

Kaggle Models

Access Gemma 2 models on Kaggle

Vertex AI Model Garden

Customize Gemma 2 with your own data

Hugging Face Models

Access, fine-tune and deploy Gemma

Responsible AI Development

Responsibility by Design

Pre-trained on carefully curated data and tuned for safety on top, helping to empower safe and responsible AI development based with Gemma models.

Robust and Transparent Evaluation

Comprehensive evaluations and transparent reporting unveil model limitations to adopt a responsible approach for each use case.

Powering Responsible Development

The Responsible Generative AI Toolkit supports developers to design and implement Responsible AI best practices.

Explore Responsible Gen AI Toolkit

Optimized for Google Cloud

With Gemma models on Google Cloud, you can deeply customize the model to your specific needs with Vertex AI's fully-managed tools or GKE’s self-managed option and deploy it to flexible and cost-efficient AI-optimized infrastructure.

Learn more in Google Cloud blog

Accelerating academic research with Google Cloud credits

The Academic Research Program recently concluded its application period, awarding Google Cloud credits to support researchers pushing the boundaries of scientific discovery using Gemma models. We are excited to see the groundbreaking research that emerges from this initiative.

Stay tuned for future opportunities to advance your research with Google Cloud.

Join the community

Connect, explore, and share your knowledge with others in the ML model community.

Gemma Open Models

Responsible by design

Unmatched performance at size

Framework flexible

Introducing
Gemma 2

Gemma model family

Gemma 2

DataGemma

Gemma 1

RecurrentGemma

PaliGemma

CodeGemma

Explore our tools

Quick-start guides for developers

Discover quickstarts on Kaggle

Train and deploy on Google Cloud

Try low-rank adaptation with JAX via Keras 3

Partner quick-start guides

Gemma Cookbook

Access Gemma models today

Responsible AI Development

Optimized for Google Cloud

Accelerating academic research with Google Cloud credits

Join the community

Kaggle

Discord

Blog

Gemma Open Models

Responsible by design

Unmatched performance at size

Framework flexible

Introducing Gemma 2

Gemma model family

Gemma 2

DataGemma

Gemma 1

RecurrentGemma

PaliGemma

CodeGemma

Explore our tools

Quick-start guides for developers

Discover quickstarts on Kaggle

Train and deploy on Google Cloud

Try low-rank adaptation with JAX via Keras 3

Partner quick-start guides

Gemma Cookbook

Access Gemma models today

Responsible AI Development

Optimized for Google Cloud

Accelerating academic research with Google Cloud credits

Join the community

Kaggle

Discord

Blog

Introducing
Gemma 2