Verified 2025-09-22 · sourced from Google
Gemini 2.0 Flash-Lite Token Calculator & Cost Guide
Estimate Google Gemini 2.0 Flash-Lite API usage in dollars before you send a single request. Standard pricing is $0.07 per million input tokens and $0.30 per million output tokens with a 1M token context window.
Context window
1,000,000 tokens
Input price
$0.07 / 1M
Output price
$0.30 / 1M
Cached input
Not published
Usage scenarios
Compare standard and cached pricing (where available) across common workloads.
Scenario | Tokens in | Tokens out | Total tokens | Standard cost |
---|---|---|---|---|
Quick chat reply Single user question with a short assistant answer | 650 | 220 | 870 | $0.0001 |
Coding assistant session Multi-turn pair programming exchange (≈6 turns) | 2,600 | 1,400 | 4,000 | $0.0006 |
Knowledge base response Retrieval-augmented answer referencing multiple passages | 12,000 | 3,000 | 15,000 | $0.0018 |
Near-max context run Large document processing approaching the 1M token limit | 880,000 | 120,000 | 1,000,000 | $0.102 |
Daily & monthly budgeting
Translate usage into predictable operating expenses across popular deployment sizes.
Profile | Requests/day | Tokens/day | Daily cost | Monthly cost |
---|---|---|---|---|
Team pilot | 25 | 75,000 | $0.0112 | $0.337 |
Product launch | 100 | 500,000 | $0.0712 | $2.14 |
Enterprise scale | 500 | 3,000,000 | $0.450 | $13.50 |
Frequently asked questions
How much does Gemini 2.0 Flash-Lite cost per 1,000 tokens?
At the published rates of $0.07 per million input tokens and $0.30 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0001.
What is the context window for Gemini 2.0 Flash-Lite?
Gemini 2.0 Flash-Lite supports up to 1,000,000 tokens (1M), allowing large prompts and retrieval-augmented payloads in a single call.
How fresh is the Gemini 2.0 Flash-Lite pricing data?
Pricing is sourced from https://ai.google.dev/gemini-api/docs/pricing and was last verified on 2025-09-22. The calculator updates automatically when models.json is refreshed.