Priced to help you bring your app to the world
Preview
Available now
Our next-generation model with a breakthrough 1 million context window. Currently available in Preview.
Free of charge
Rate Limits*#
2 RPM (requests per minute)
32,000 TPM (tokens per minute)
50 RPD (requests per day)
Price (input)
Free of charge
Price (output)
Free of charge
Prompts/responses used to improve our products
Yes Learn more
Pay-as-you-go
Rate Limits*#
10 RPM (requests per minute)
10 million TPM (tokens per minute)
2,000 RPD (requests per day)
Price (input)
$7 / 1 million tokens (preview pricing)
Price (output)
$21 / 1 million tokens (preview pricing)
Prompts/responses used to improve our products
No Learn more
Our best performing model with features for a wide variety of text and image reasoning tasks. Available in Google AI Studio.
Free of charge
Rate Limits*#
15 RPM (requests per minute)
32,000 TPM (tokens per minute)
1500 RPD (requests per day)
Price (input)
Free of charge
Price (output)
Free of charge
Prompts/responses used to improve our products
Yes Learn more
Pay-as-you-go
Rate Limits*#
360 RPM (requests per minute)
120,000 TPM (tokens per minute)
30,000 RPD (requests per day)
Price (input)
$0.50 / 1 million tokens **
Price (output)
$1.50 / 1 million tokens **
Prompts/responses used to improve our products
No Learn more
* Specified rate limits are not guaranteed and actual capacity may vary
# Apply for an increased maximum rate limit
** Tuned model inference costs are billed at the same price as the base models.
Build with Vertex AI on Google Cloud