Last verified 2025-09-22 (left) · 2025-09-22 (right)

Gemini 2.0 Flash vs o3 — Pricing & Capability Comparison

Gemini 2.0 Flash charges $0.10 per million input tokens and $0.40 per million output tokens. o3 comes in at $2.00 / $8.00. Context windows span 1M vs 200K tokens respectively.

Input price (per 1M)

Gemini 2.0 Flash

$0.10

o3

$2.00

Gemini 2.0 Flash leads here

Output price (per 1M)

Gemini 2.0 Flash

$0.40

o3

$8.00

Gemini 2.0 Flash leads here

Context window

Gemini 2.0 Flash

1,000,000 tokens

o3

200,000 tokens

Gemini 2.0 Flash leads here

Cached input

Gemini 2.0 Flash

Not published

o3

Not published

No published data

Cost comparison for 10K-token workloads

Side-by-side pricing for identical workloads (10,000 total tokens per request) across different distributions.

ScenarioGemini 2.0 Flasho3
Balanced conversation
50% input · 50% output
$0.0025$0.0500
Input-heavy workflow
80% input · 20% output
$0.0016$0.0320
Generation heavy
30% input · 70% output
$0.0031$0.0620
Cached system prompt
90% cached input · 10% fresh output
$0.0013$0.0260

Frequently asked questions

Which model is cheaper per million input tokens?

Gemini 2.0 Flash costs $0.10 per million input tokens versus $2.00 for o3.

How do output prices compare?

Gemini 2.0 Flash charges $0.40 per million output tokens, while o3 costs $8.00 per million.

Which model supports a larger context window?

Gemini 2.0 Flash offers 1,000,000 tokens (1M) versus 200K for o3.

Related resources