Verified 2026-03-06 · sourced from OpenAI
GPT-5.4 Token Calculator & Cost Guide
Estimate OpenAI GPT-5.4 API usage in dollars before you send a single request. Standard pricing is $2.50 per million input tokens and $15.00 per million output tokens with a 1.1M token context window.
Context window
1,050,000 tokens
Input price
$2.50 / 1M
Output price
$15.00 / 1M
Cached input
$0.250 / 1M
Usage scenarios
Compare standard and cached pricing (where available) across common workloads.
| Scenario | Tokens in | Tokens out | Total tokens | Standard cost | Cached cost |
|---|---|---|---|---|---|
Quick chat reply Single user question with a short assistant answer | 650 | 220 | 870 | $0.0049 | $0.0035 |
Coding assistant session Multi-turn pair programming exchange (≈6 turns) | 2,600 | 1,400 | 4,000 | $0.0275 | $0.0217 |
Knowledge base response Retrieval-augmented answer referencing multiple passages | 12,000 | 3,000 | 15,000 | $0.0750 | $0.0480 |
Near-max context run Large document processing approaching the 1.1M token limit | 924,000 | 126,000 | 1,050,000 | $4.20 | $2.12 |
Daily & monthly budgeting
Translate usage into predictable operating expenses across popular deployment sizes.
| Profile | Requests/day | Tokens/day | Daily cost | Monthly cost | Cached daily | Cached monthly |
|---|---|---|---|---|---|---|
| Team pilot | 25 | 75,000 | $0.500 | $15.00 | $0.388 | $11.63 |
| Product launch | 100 | 500,000 | $3.13 | $93.75 | $2.34 | $70.13 |
| Enterprise scale | 500 | 3,000,000 | $20.00 | $600.00 | $15.50 | $465.00 |
Pricing notes
- Latest GPT-5 flagship released on March 5, 2026.
- 1,050,000 token context window and 128,000 max output tokens.
- Cached input pricing is $0.25 per 1M tokens, a 90% discount from standard input.
Frequently asked questions
How much does GPT-5.4 cost per 1,000 tokens?
At the published rates of $2.50 per million input tokens and $15.00 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0062.
Does GPT-5.4 offer cached input discounts?
GPT-5.4 drops input costs to $0.250 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0047, a significant saving for chatbots and RAG systems.
What is the context window for GPT-5.4?
GPT-5.4 supports up to 1,050,000 tokens (1.1M), allowing large prompts and retrieval-augmented payloads in a single call.
How fresh is the GPT-5.4 pricing data?
Pricing is sourced from https://openai.com/api/pricing/ and was last verified on 2026-03-06. The calculator updates automatically when models.json is refreshed.