Question 1

How much does GPT-4o cost per 1,000 tokens?

Accepted Answer

At the published rates of $2.50 per million input tokens and $10.00 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0047.

Question 2

Does GPT-4o offer cached input discounts?

Accepted Answer

GPT-4o drops input costs to $1.250 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0039, a significant saving for chatbots and RAG systems.

Question 3

What is the context window for GPT-4o?

Accepted Answer

GPT-4o supports up to 128,000 tokens (128K), allowing large prompts and retrieval-augmented payloads in a single call.

Question 4

How fresh is the GPT-4o pricing data?

Accepted Answer

Pricing is sourced from https://platform.openai.com/docs/pricing and was last verified on 2025-09-22. The calculator updates automatically when models.json is refreshed.

Question 5

What is GPT-4o pricing per 1M tokens?

Accepted Answer

GPT-4o is currently priced at $2.50 per million input tokens and $10.00 per million output tokens. If you use prompt caching, cached input is billed at $1.25 per million tokens.

Question 6

How much does 100K GPT-4o tokens cost?

Accepted Answer

Using the official GPT-4o API rate, 100K input tokens cost about $0.250, 100K cached input tokens cost about $0.125, and 100K output tokens cost about $1.00. That quick budget check is useful when you are sizing multimodal chat, agent, or vision-heavy production flows before launch.

Question 7

Does GPT-4o have a 128K context window?

Accepted Answer

Yes. GPT-4o supports a 128,000-token (128K) context window, so you can send long prompts, larger retrieved context, or multi-turn chat state without immediately moving to a separate long-context model.

Question 8

Is GPT-4o cheaper than GPT-4.1 for multimodal apps?

Accepted Answer

GPT-4o is often chosen for production chat, vision, and multimodal workloads when teams want a strong balance of quality, latency, and cost. This page helps you compare GPT-4o directly with GPT-4.1, GPT-4o mini, and GPT-5 mini before you commit budget.

Scenario	Tokens in	Tokens out	Total tokens	Standard cost	Cached cost
Quick chat reply Single user question with a short assistant answer	650	220	870	$0.0038	$0.0030
Coding assistant session Multi-turn pair programming exchange (≈6 turns)	2,600	1,400	4,000	$0.0205	$0.0173
Knowledge base response Retrieval-augmented answer referencing multiple passages	12,000	3,000	15,000	$0.0600	$0.0450
Near-max context run Large document processing approaching the 128K token limit	112,000	16,000	128,000	$0.440	$0.300

Profile	Requests/day	Tokens/day	Daily cost	Monthly cost	Cached daily	Cached monthly
Team pilot	25	75,000	$0.375	$11.25	$0.313	$9.38
Product launch	100	500,000	$2.38	$71.25	$1.94	$58.13
Enterprise scale	500	3,000,000	$15.00	$450.00	$12.50	$375.00

GPT-4o Token Calculator, Pricing & 100K/1M Cost

Pick the route that matches what you searched for

Usage scenarios

Daily & monthly budgeting

Pricing notes

Frequently asked questions

Related resources

Other Token Calculators

Explore More Tools

All Token Calculators

100K Token Costs

Price Comparisons