Start building free of charge with generous limits, then scale up with pay-as-you-go pricing for your production ready applications.
Free
For developers and small projects getting started with the Gemini API.
- check_circleLimited access to certain models
- check_circleFree input & output tokens
- check_circleGoogle AI Studio access
- check_circleData used to improve our products*
Paid
For production applications that require higher volumes and advanced features.
- check_circleHigher rate limits for production deployments
- check_circleAccess to Context Caching
- check_circleBatch API (50% cost reduction)
- check_circleAccess to Google's most advanced models
- check_circleData *not* used to improve our products*
Enterprise
For large-scale deployments with custom needs for security, support, and compliance, powered by Vertex AI.
- check_circleAll features in Paid, plus optional access to:
- check_circleDedicated support channels
- check_circleAdvanced security & compliance
- check_circleProvisioned throughput
- check_circleVolume-based discounts (based on usage)
- check_circleML Ops, Model garden and more
Gemini 2.5 Pro
gemini-2.5-pro
Our state-of-the-art multipurpose model, which excels at coding and complex reasoning tasks.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $1.25, prompts <= 200k tokens $2.50, prompts > 200k tokens |
Output price (including thinking tokens) | Free of charge | $10.00, prompts <= 200k tokens $15.00, prompts > 200k |
Context caching price | Not available | $0.31, prompts <= 200k tokens $0.625, prompts > 200k $4.50 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.625, prompts <= 200k tokens 1.25, prompts > 200k tokens |
Output price (including thinking tokens) | Not available | $5.00, prompts <= 200k tokens $7.50, prompts > 200k |
Context caching price | Not available | $0.31, prompts <= 200k tokens $0.625, prompts > 200k $4.50 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Gemini 2.5 Flash
gemini-2.5-flash
Our first hybrid reasoning model which supports a 1M token context window and has thinking budgets.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.30 (text / image / video) $1.00 (audio) |
Output price (including thinking tokens) | Free of charge | $2.50 |
Context caching price | Not available | $0.075 (text / image / video) $0.25 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Free of charge, up to 500 RPD (limit shared with Flash-Lite RPD) | 1,500 RPD (free, limit shared with Flash-Lite RPD), then $35 / 1,000 requests |
Live API | Free of charge | Input: $0.50 (text), $3.00 (audio / image [video]) Output: $2.00 (text), $12.00 (audio) |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.15 (text / image / video) $0.50 (audio) |
Output price (including thinking tokens) | Not available | $1.25 |
Context caching price | Not available | $0.075 (text / image / video) $0.25 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free, limit shared with Flash-Lite RPD), then $35 / 1,000 requests |
Live API | Not available | Not available |
Used to improve our products | Yes | No |
Gemini 2.5 Flash Preview
gemini-2.5-flash-preview-09-2025
The latest model based on the 2.5 Flash model. 2.5 Flash Preview is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.30 (text / image / video) $1.00 (audio) |
Output price (including thinking tokens) | Free of charge | $2.50 |
Context caching price | Not available | $0.075 (text / image / video) $0.25 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Free of charge, up to 500 RPD (limit shared with Flash-Lite RPD) | 1,500 RPD (free, limit shared with Flash-Lite RPD), then $35 / 1,000 requests |
Live API | Free of charge | Input: $0.50 (text), $3.00 (audio / image [video]) Output: $2.00 (text), $12.00 (audio) |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.15 (text / image / video) $0.50 (audio) |
Output price (including thinking tokens) | Not available | $1.25 |
Context caching price | Not available | $0.075 (text / image / video) $0.25 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free, limit shared with Flash-Lite RPD), then $35 / 1,000 requests |
Live API | Not available | Not available |
Used to improve our products | Yes | No |
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Our smallest and most cost effective model, built for at scale usage.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price (text, image, video) | Free of charge | $0.10 (text / image / video) $0.30 (audio) |
Output price (including thinking tokens) | Free of charge | $0.40 |
Context caching price | Not available | $0.025 (text / image / video) $0.125 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Free of charge, up to 500 RPD (limit shared with Flash RPD) | 1,500 RPD (free, limit shared with Flash RPD), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price (text, image, video) | Not available | $0.05 (text / image / video) $0.15 (audio) |
Output price (including thinking tokens) | Not available | $0.20 |
Context caching price | Not available | $0.025 (text / image / video) $0.125 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free, limit shared with Flash RPD), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Gemini 2.5 Flash-Lite Preview
gemini-2.5-flash-lite-preview-09-2025
The latest model based on Gemini 2.5 Flash lite optimized for cost-efficiency, high throughput and high quality.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price (text, image, video) | Free of charge | $0.10 (text / image / video) $0.30 (audio) |
Output price (including thinking tokens) | Free of charge | $0.40 |
Context caching price | Not available | $0.025 (text / image / video) $0.125 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Free of charge, up to 500 RPD (limit shared with Flash RPD) | 1,500 RPD (free, limit shared with Flash RPD), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price (text, image, video) | Not available | $0.05 (text / image / video) $0.15 (audio) |
Output price (including thinking tokens) | Not available | $0.20 |
Context caching price | Not available | $0.025 (text / image / video) $0.125 (audio) $1.00 / 1,000,000 tokens per hour (storage price) |
Grounding with Google Search | Not available | 1,500 RPD (free, limit shared with Flash RPD), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Gemini 2.5 Flash Native Audio
gemini-2.5-flash-preview-native-audio-dialog
Our native audio models optimized for higher quality audio outputs with better pacing, voice naturalness, verbosity, and mood.
Preview models may change before becoming stable and have more restrictive rate limits.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.50 (text) $3.00 (audio / video) |
Output price (including thinking tokens) | Not available | $2.00 (text) $12.00 (audio) |
Used to improve our products | Yes | No |
Gemini 2.5 Flash Image
gemini-2.5-flash-image
Our native image generation model, optimized for speed, flexibility, and contextual understanding. Text input and output is priced the same as 2.5 Flash.
Preview models may change before becoming stable and have more restrictive rate limits.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.30 (text / image) |
Output price | Not available | $0.039 per image* |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.15 (text / image) |
Output price | Not available | $0.0195 per image* |
Used to improve our products | Yes | No |
[*] Image output is priced at $30 per 1,000,000 tokens. Output images up to 1024x1024px consume 1290 tokens and are equivalent to $0.039 per image.
Gemini 2.5 Flash Preview TTS
gemini-2.5-flash-preview-tts
Our 2.5 Flash text-to-speech audio model optimized for price-performant, low-latency, controllable speech generation.
Preview models may change before becoming stable and have more restrictive rate limits.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.50 (text) |
Output price | Free of charge | $10.00 (audio) |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.25 (text) |
Output price | Not available | $5.00 (audio) |
Used to improve our products | Yes | No |
Gemini 2.5 Pro Preview TTS
gemini-2.5-pro-preview-tts
Our 2.5 Pro text-to-speech audio model optimized for powerful, low-latency speech generation for more natural outputs and easier to steer prompts.
Preview models may change before becoming stable and have more restrictive rate limits.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $1.00 (text) |
Output price | Not available | $20.00 (audio) |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.50 (text) |
Output price | Not available | $10.00 (audio) |
Used to improve our products | Yes | No |
Gemini 2.0 Flash
gemini-2.0-flash
Our most balanced multimodal model with great performance across all tasks, with a 1 million token context window, and built for the era of Agents.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.10 (text / image / video) $0.70 (audio) |
Output price | Free of charge | $0.40 |
Context caching price | Free of charge | $0.025 / 1,000,000 tokens (text/image/video) $0.175 / 1,000,000 tokens (audio) |
Context caching (storage) | Not available | $1.00 / 1,000,000 tokens per hour |
Image generation pricing | Free of charge | $0.039 per image* |
Tuning price | Not available | Not available |
Grounding with Google Search | Free of charge, up to 500 RPD | 1,500 RPD (free), then $35 / 1,000 requests |
Live API | Free of charge | Input: $0.35 (text), $2.10 (audio / image [video]) Output: $1.50 (text), $8.50 (audio) |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.05 (text / image / video) $0.35 (audio) |
Output price | Not available | $0.20 |
Context caching price | Not available | $0.025 / 1,000,000 tokens (text/image/video) $0.175 / 1,000,000 tokens (audio) |
Context caching (storage) | Not available | $1.00 / 1,000,000 tokens per hour |
Image generation pricing | Not available | $0.0195 per image* |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | 1,500 RPD (free), then $35 / 1,000 requests |
Live API | Not available | Not available |
Used to improve our products | Yes | No |
[*] Image output is priced at $30 per 1,000,000 tokens. Output images up to 1024x1024px consume 1290 tokens and are equivalent to $0.039 per image.
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite
Our smallest and most cost effective model, built for at scale usage.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.075 |
Output price | Free of charge | $0.30 |
Context caching price | Not available | Not available |
Context caching (storage) | Not available | Not available |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.0375 |
Output price | Not available | $0.15 |
Context caching price | Not available | Not available |
Context caching (storage) | Not available | Not available |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
Imagen 4
imagen-4.0-generate-001
, imagen-4.0-ultra-generate-001
, imagen-4.0-fast-generate-001
Our latest image generation model, with significantly better text rendering and better overall image quality.
Preview models may change before becoming stable and have more restrictive rate limits.
Free Tier | Paid Tier, per Image in USD | |
---|---|---|
Imagen 4 Fast image price | Not available | $0.02 |
Imagen 4 Standard image price | Not available | $0.04 |
Imagen 4 Ultra image price | Not available | $0.06 |
Used to improve our products | Yes | No |
Imagen 3
imagen-3.0-generate-002
Our state-of-the-art image generation model, available to developers on the paid tier of the Gemini API.
Free Tier | Paid Tier, per Image in USD | |
---|---|---|
Image price | Not available | $0.03 |
Used to improve our products | Yes | No |
Veo 3
veo-3.0-generate-001
, veo-3.0-fast-generate-001
Our latest video generation model, available to developers on the paid tier of the Gemini API.
Free Tier | Paid Tier, per second in USD | |
---|---|---|
Veo 3 Standard video with audio price (default) | Not available | $0.40 |
Veo 3 Fast video with audio price (default) | Not available | $0.15 |
Used to improve our products | Yes | No |
Veo 2
veo-2.0-generate-001
Our state-of-the-art video generation model, available to developers on the paid tier of the Gemini API.
Free Tier | Paid Tier, per second in USD | |
---|---|---|
Video price | Not available | $0.35 |
Used to improve our products | Yes | No |
Gemini Embedding
gemini-embedding-001
Our newest embeddings model, more stable and with higher rate limits than previous versions, available to developers on the free and paid tiers of the Gemini API.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.15 |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | $0.075 |
Used to improve our products | Yes | No |
Gemini Robotics-ER 1.5 Preview
gemini-robotics-er-1.5-preview
Gemini Robotics-ER, short for Gemini Robotics-Embodied Reasoning, is a thinking model that enhances robots' abilities to understand and interact with the physical world.
Standard
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.30 (text / image / video) $1.00 (audio) |
Output price (including thinking tokens) | Free of charge | $2.50 |
Grounding with Google Search | Free of charge, up to 500 RPD (limit shared with Flash-Lite RPD) | 1,500 RPD (free, limit shared with Flash-Lite RPD), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Batch
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Not available | Not available |
Output price (including thinking tokens) | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
Gemma 3
Our lightweight, state-of the art, open model built from the same technology that powers our Gemini models.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | Not available |
Output price | Free of charge | Not available |
Context caching price | Free of charge | Not available |
Context caching (storage) | Free of charge | Not available |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
Gemma 3n
Our open model built for efficient performance on everyday devices like mobile phones, laptops, and tablets.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | Not available |
Output price | Free of charge | Not available |
Context caching price | Free of charge | Not available |
Context caching (storage) | Free of charge | Not available |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
[*] Google AI Studio usage is free of charge in all available regions. See Billing FAQs for details.
[**] Prices may differ from the prices listed here and the prices offered on Vertex AI. For Vertex prices, see the Vertex AI pricing page.
[***] If you are using dynamic retrieval to optimize costs, only requests that contain at least one grounding support URL from the web in their response are charged for Grounding with Google Search. Costs for Gemini always apply. Rate limits are subject to change.