Last verified 2025-09-22 (left) · 2025-09-22 (right)
Gemini 2.5 Flash vs o3 — Pricing & Capability Comparison
Gemini 2.5 Flash charges $0.30 per million input tokens and $2.50 per million output tokens. o3 comes in at $2.00 / $8.00. Context windows span 1M vs 200K tokens respectively.
Input price (per 1M)
Gemini 2.5 Flash
$0.30
o3
$2.00
Gemini 2.5 Flash leads here
Output price (per 1M)
Gemini 2.5 Flash
$2.50
o3
$8.00
Gemini 2.5 Flash leads here
Context window
Gemini 2.5 Flash
1,000,000 tokens
o3
200,000 tokens
Gemini 2.5 Flash leads here
Cached input
Gemini 2.5 Flash
Not published
o3
Not published
No published data
Cost comparison for 10K-token workloads
Side-by-side pricing for identical workloads (10,000 total tokens per request) across different distributions.
Scenario | Gemini 2.5 Flash | o3 |
---|---|---|
Balanced conversation 50% input · 50% output | $0.0140 | $0.0500 |
Input-heavy workflow 80% input · 20% output | $0.0074 | $0.0320 |
Generation heavy 30% input · 70% output | $0.0184 | $0.0620 |
Cached system prompt 90% cached input · 10% fresh output | $0.0052 | $0.0260 |
Frequently asked questions
Which model is cheaper per million input tokens?
Gemini 2.5 Flash costs $0.30 per million input tokens versus $2.00 for o3.
How do output prices compare?
Gemini 2.5 Flash charges $2.50 per million output tokens, while o3 costs $8.00 per million.
Which model supports a larger context window?
Gemini 2.5 Flash offers 1,000,000 tokens (1M) versus 200K for o3.