Pricing models
Priced to help you bring your app to the world
Gemini 1.5 Flash Available now
Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
1 million TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
2,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$0.075 / 1 million tokens
output Pricing
$0.30 / 1 million tokens
Context Caching
$0.01875 / 1 million tokens
Prompts longer than 128k
Input Pricing
$0.15 / 1 million tokens
output Pricing
$0.60 / 1 million tokens
Context Caching
$0.0375 / 1 million tokens
Context caching (storage)
$1.00 / 1 million tokens per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.5 Flash-8B Available now
Our smallest model for lower intelligence use cases with a 1 million token context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
1 million TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Free of charge, up to 1 million tokens of storage per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate limits
4,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$0.0375 / 1 million tokens
output Pricing
$0.15 / 1 million tokens
Context Caching
$0.01 / 1 million tokens
Prompts longer than 128k
Input Pricing
$0.075 / 1 million tokens
output Pricing
$0.30 / 1 million tokens
Context Caching
$0.02 / 1 million tokens
Context caching (storage)
$0.25 / 1 million tokens per hour
Tuning price
Input/output prices are the same for tuned models. Tuning service is free of charge.
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.5 Pro Available now
Our next-generation model with a breakthrough 2 million context window. Now generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
2 RPM (requests per minute)
32,000 TPM (tokens per minute)
50 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate Limits
1,000 RPM (requests per minute)
4 million TPM (tokens per minute)
Prompts up to 128k tokens
Input Pricing
$1.25 / 1 million tokens
output Pricing
$5.00 / 1 million tokens
Context Caching
$0.3125 / 1 million tokens
Prompts longer than 128k
Input Pricing
$2.50 / 1 million tokens
output Pricing
$10.00 / 1 million tokens
Context Caching
$0.625 / 1 million tokens
Context caching (storage)
$4.50 / 1 million tokens per hour
Tuning price
Not available
Grounding with Google Search
$35 / 1K grounding requests (for up to 5K requests per day).
Used to improve our products
Gemini 1.0 Pro Available now
Our first-generation model offering only text and image reasoning. Generally available for production use.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
15 RPM (requests per minute)
32,000 TPM (tokens per minute)
1,500 RPD (requests per day)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Pay-as-you-go (prices in USD)
Scale your AI service with confidence using the Gemini API pay-as-you-go billing service. Set up billing easily in Google AI Studio by clicking on “Get API key”.
Rate Limits
360 RPM (requests per minute)
120,000 TPM (tokens per minute)
30,000 RPD (requests per day)
Input Pricing
$0.50 / 1 million tokens
Output Pricing
$1.50 / 1 million tokens
Context caching
Not available
Tuning price
Not available
Grounding with Google Search
Not available
Used to improve our products
Text Embedding 004 Available now
Our state-of-the-art text embedding model.
Free of charge
The Gemini API “free tier” is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries.
Rate Limits
1,500 RPM (requests per minute)
Input Pricing
Free of charge
Output Pricing
Free of charge
Context caching
Not applicable
Tuning price
Not applicable
Used to improve our products