Gemma Scope

Gemma Scope is a research tool for analyzing and understanding the inner workings of the Gemma 2 generative AI models. The tool allows you to examine the behavior of individual AI model layers of Gemma 2 models, while the model is processing requests. Researchers can apply this technique to examine and help address critical concerns such as hallucinations, biases, and manipulation, ultimately leading to safer and more trustworthy AI systems.

This tool provides researchers with a suite of sparse autoencoders for examination of the features and representations learned by Gemma 2 base models. You use the tool by instrumenting a Gemma 2 model with the provided autoencoders, which allow you to examine the behavior of individual AI model layers, while processing requests. For more information on how to analyze Gemma 2 models with this tool, see the Gemma Scope guide.

  • Evaluate the precise behavior of Gemma 2 models with layer-level analysis.
  • Use Gemma Scope to modify model layers and evaluate the impact on overall model behavior.

Learn more

Read the blog post

Learn more about Gemma Scope's purpose and capabilities.
View code, notebooks, information, and discussions about Gemma Scope.
Run a working example Gemma Scope testing rig for evaluating Gemma 2 models.