Verified 2025-12-18 ยท sourced from Google
Gemini 3 Flash Token Calculator, Pricing & 100K/1M Cost
Check Google Gemini 3 Flash pricing, estimate 100K and 1M token cost, and size a real API budget before you send a single request. Standard pricing is $0.50 per million input tokens and $3.00 per million output tokens with a 1M token context window.
Quick answer: Gemini 3 Flash pricing per 1M tokens is $0.50 input and $3.00 output. Context window: 1,000,000 tokens ยท Cached input: $0.050 / 1M.
Best for searches like Gemini 3 Flash token calculator, Gemini 3 Flash pricing, Gemini 3 Flash 100K tokens cost, Gemini 3 Flash 1M token cost.
Pick the route that matches what you searched for
Some visitors want a fast Gemini 3 Flash API cost estimate, others want a direct 100K or 1M token budget, and some are already comparing alternatives. These shortcuts remove the extra click.
Estimate a single request or prompt budget right now.
Jump straight to the most common budgeting checkpoint.
Use this when you are sizing production traffic or a monthly plan.
Open the closest head-to-head comparison instead of researching from scratch.
Context window
1,000,000 tokens
Input price
$0.50 / 1M
Output price
$3.00 / 1M
Cached input
$0.050 / 1M
Usage scenarios
Compare standard and cached pricing (where available) across common workloads.
| Scenario | Tokens in | Tokens out | Total tokens | Standard cost | Cached cost |
|---|---|---|---|---|---|
Quick chat reply Single user question with a short assistant answer | 650 | 220 | 870 | $0.0010 | $0.0007 |
Coding assistant session Multi-turn pair programming exchange (โ6 turns) | 2,600 | 1,400 | 4,000 | $0.0055 | $0.0043 |
Knowledge base response Retrieval-augmented answer referencing multiple passages | 12,000 | 3,000 | 15,000 | $0.0150 | $0.0096 |
Near-max context run Large document processing approaching the 1M token limit | 880,000 | 120,000 | 1,000,000 | $0.800 | $0.404 |
Daily & monthly budgeting
Translate usage into predictable operating expenses across popular deployment sizes.
| Profile | Requests/day | Tokens/day | Daily cost | Monthly cost | Cached daily | Cached monthly |
|---|---|---|---|---|---|---|
| Team pilot | 25 | 75,000 | $0.100 | $3.00 | $0.0775 | $2.33 |
| Product launch | 100 | 500,000 | $0.625 | $18.75 | $0.467 | $14.02 |
| Enterprise scale | 500 | 3,000,000 | $4.00 | $120.00 | $3.10 | $93.00 |
Pricing notes
- ๐ Latest Flash model (Dec 17, 2025) - Google's fastest frontier model
- Outperforms Gemini 2.5 Pro while being 3x faster
- Input: $0.50/MTok, Output: $3.00/MTok
- Audio input: $1.00/MTok
- Context caching: 90% cost reduction for repeated tokens
- Batch API: 50% discount available
- 30% fewer thinking tokens than 2.5 Pro on average
- Default model in Gemini app
Frequently asked questions
How much does Gemini 3 Flash cost per 1,000 tokens?
At the published rates of $0.50 per million input tokens and $3.00 per million output tokens, a typical 1,000 token request (โ70% input, 30% output) costs about $0.0013.
Does Gemini 3 Flash offer cached input discounts?
Gemini 3 Flash drops input costs to $0.050 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0009, a significant saving for chatbots and RAG systems.
What is the context window for Gemini 3 Flash?
Gemini 3 Flash supports up to 1,000,000 tokens (1M), allowing large prompts and retrieval-augmented payloads in a single call.
How fresh is the Gemini 3 Flash pricing data?
Pricing is sourced from https://ai.google.dev/gemini-api/docs/pricing and was last verified on 2025-12-18. The calculator updates automatically when models.json is refreshed.