Welcome to the Gemini API Cookbook

This cookbook provides a structured learning path for using the Gemini API, focusing on hands-on tutorials and practical examples.

For comprehensive API documentation, visit ai.google.dev.

Navigating the Cookbook

This cookbook is organized into two main categories:

Quick Starts: Step-by-step guides covering both introductory topics ("Get Started") and specific API features.
Examples: Practical use cases demonstrating how to combine multiple features.

We also showcase Demos in separate repositories, illustrating end-to-end applications of the Gemini API.

What's New?

Here are the recent additions and updates to the Gemini API and the Cookbook:

Gemini 2.5 models: Explore the capabilities of the latest Gemini 2.5 models (Flash and Pro)! See the Get Started Guide and the thinking guide as they'll all be thinking ones.
Imagen and Veo: Get started with our media generation model with this brand new Veo guide and Imagen guide!
LiveAPI: Get started with the multimodal Live API and unlock new interactivity with Gemini.
Recently Added Guides:
- Browser as a tool: Use a web browser for live and internal (intranet) web interactions
- Code execution: Generating and running Python code to solve complex tasks and even output graphs
- Function calling: The function calling guide has been reworked and should better explain how to use that very convient capability.

1. Quick Starts

The quickstarts section contains step-by-step tutorials to get you started with Gemini and learn about its specific features.

To begin, you'll need:

A Google account.
An API key (create one in Google AI Studio).

We recommend starting with the following:

Authentication: Set up your API key for access.
Get started: Get started with Gemini models and the Gemini API, covering basic prompting and multimodal input.

Then, explore the other quickstarts tutorials to learn about individual features:

Get started with Live API: Get started with the live API with this comprehensive overview of its capabilities
Get started with Veo: Get started with our video generation capabilities
Get started with Imagen and Image-out: Get started with our image generation capabilities
Grounding: use Google Search for grounded responses
Code execution: Generating and running Python code to solve complex tasks and even ouput graphs
And many more

2. Examples (Practical Use Cases)

These examples demonstrate how to combine multiple Gemini API features or 3rd-party tools to build more complex applications.

Illustrate a book: Use Gemini and Imagen to create illustration for an open-source book
Animated Story Generation: Create animated videos by combining Gemini's story generation, Imagen, and audio synthesis
Plotting and mapping Live: Mix Live API and Code execution to solve complex tasks live
3D Spatial understanding: Use Gemini 3D spatial abilities to understand 3D scenes
Gradio and live API: Use gradio to deploy your own instance of the Live API
And many many more

3. Demos (End-to-End Applications)

These fully functional, end-to-end applications showcase the power of Gemini in real-world scenarios.

Gemini API quickstart: Python Flask App running with the Google AI Gemini API, designed to get you started building with Gemini's multi-modal capabilities
Multimodal Live API Web Console: React-based starter app for using the Multimodal Live API over a websocket
Google AI Studio Starter Applets: A collection of small apps that demonstrate how Gemini can be used to create interactive experiences

Official SDKs

The Gemini API is a REST API. You can call it directly using tools like curl (see REST examples or the great Postman workspace), or use one of our official SDKs:

Important: Migration

With Gemini 2 we are offering a new SDK (google-genai, v1.0). The updated SDK is fully compatible with all Gemini API models and features, including recent additions like the live API (audio + video streaming), improved tool usage ( code execution, function calling and integrated Google search grounding), and media generation (Imagen and Veo). This SDK allows you to connect to the Gemini API through either Google AI Studio or Vertex AI.

The google-generativeai package will continue to support the original Gemini models. It can also be used with Gemini 2 models, just with a limited feature set. All new features will be developed in the new Google GenAI SDK.

See the migration guide for details.

Get Help

Ask a question on the Google AI Developer Forum.

The Gemini API on Google Cloud Vertex AI

For enterprise developers, the Gemini API is also available on Google Cloud Vertex AI. See this repo for examples.

Contributing

Contributions are welcome! See CONTRIBUTING.md for details.

Thank you for developing with the Gemini API! We're excited to see what you create.

Name	Name	Last commit message	Last commit date
Latest commit Giom-V Updating the new pro model alias in the examples (#754 ) May 13, 2025 7354ce7 · May 13, 2025 History 447 Commits
.devcontainer	.devcontainer	Remove markdown in codespace welcome.txt (#289 )	Feb 24, 2025
.github	.github	Exclude rebase/merge files from lint/fmt checks (#742 )	Apr 28, 2025
examples	examples	Updating the new pro model alias in the examples (#754 )	May 13, 2025
gemini-2	gemini-2	Fixing licences (#726 )	Apr 18, 2025
images	images	Fix colab logo in github view. (#269 )	Sep 3, 2024
quickstarts	quickstarts	Updating the new pro model alias in the examples (#754 )	May 13, 2025
.gitignore	.gitignore	Ignore Windows ini files and links	Feb 21, 2025
CONTRIBUTING.md	CONTRIBUTING.md	Adding detailed guidelines to CONTRIBUTING.md (#678 )	Apr 23, 2025
LICENSE	LICENSE	Starting the cookbook	Mar 22, 2024
README.md	README.md	Flash 2.5 release follow-up (#724 )	Apr 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to the Gemini API Cookbook

Navigating the Cookbook

What's New?

1. Quick Starts

2. Examples (Practical Use Cases)

3. Demos (End-to-End Applications)

Official SDKs

Important: Migration

Get Help

The Gemini API on Google Cloud Vertex AI

Contributing

About

Contributors 75

Languages

License

google-gemini/cookbook

Folders and files

Latest commit

History

Repository files navigation

Welcome to the Gemini API Cookbook

Navigating the Cookbook

What's New?

1. Quick Starts

2. Examples (Practical Use Cases)

3. Demos (End-to-End Applications)

Official SDKs

Important: Migration

Get Help

The Gemini API on Google Cloud Vertex AI

Contributing

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Contributors 75

Languages