Verified 2025-09-22 · sourced from OpenAI
GPT-4o (fine-tuned) Token Calculator & Cost Guide
Estimate OpenAI GPT-4o (fine-tuned) API usage in dollars before you send a single request. Standard pricing is $3.75 per million input tokens and $15.00 per million output tokens with a 128K token context window.
Context window
128,000 tokens
Input price
$3.75 / 1M
Output price
$15.00 / 1M
Cached input
$1.875 / 1M
Usage scenarios
Compare standard and cached pricing (where available) across common workloads.
Scenario | Tokens in | Tokens out | Total tokens | Standard cost | Cached cost |
---|---|---|---|---|---|
Quick chat reply Single user question with a short assistant answer | 650 | 220 | 870 | $0.0057 | $0.0045 |
Coding assistant session Multi-turn pair programming exchange (≈6 turns) | 2,600 | 1,400 | 4,000 | $0.0307 | $0.0259 |
Knowledge base response Retrieval-augmented answer referencing multiple passages | 12,000 | 3,000 | 15,000 | $0.0900 | $0.0675 |
Near-max context run Large document processing approaching the 128K token limit | 112,000 | 16,000 | 128,000 | $0.660 | $0.450 |
Daily & monthly budgeting
Translate usage into predictable operating expenses across popular deployment sizes.
Profile | Requests/day | Tokens/day | Daily cost | Monthly cost | Cached daily | Cached monthly |
---|---|---|---|---|---|---|
Team pilot | 25 | 75,000 | $0.563 | $16.88 | $0.469 | $14.06 |
Product launch | 100 | 500,000 | $3.56 | $106.88 | $2.91 | $87.19 |
Enterprise scale | 500 | 3,000,000 | $22.50 | $675.00 | $18.75 | $562.50 |
Pricing notes
- Fine-tuned GPT-4o pricing per 1M tokens: input $3.75, output $15.00, cache $1.875.
Frequently asked questions
How much does GPT-4o (fine-tuned) cost per 1,000 tokens?
At the published rates of $3.75 per million input tokens and $15.00 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0071.
Does GPT-4o (fine-tuned) offer cached input discounts?
GPT-4o (fine-tuned) drops input costs to $1.875 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0058, a significant saving for chatbots and RAG systems.
What is the context window for GPT-4o (fine-tuned)?
GPT-4o (fine-tuned) supports up to 128,000 tokens (128K), allowing large prompts and retrieval-augmented payloads in a single call.
How fresh is the GPT-4o (fine-tuned) pricing data?
Pricing is sourced from https://platform.openai.com/docs/pricing and was last verified on 2025-09-22. The calculator updates automatically when models.json is refreshed.