Verified 2025-09-22 · sourced from OpenAI

GPT-4o (fine-tuned) Token Calculator & Cost Guide

Estimate OpenAI GPT-4o (fine-tuned) API usage in dollars before you send a single request. Standard pricing is $3.75 per million input tokens and $15.00 per million output tokens with a 128K token context window.

Context window

128,000 tokens

Input price

$3.75 / 1M

Output price

$15.00 / 1M

Cached input

$1.875 / 1M

Usage scenarios

Compare standard and cached pricing (where available) across common workloads.

ScenarioTokens inTokens outTotal tokensStandard costCached cost
Quick chat reply
Single user question with a short assistant answer
650220870$0.0057$0.0045
Coding assistant session
Multi-turn pair programming exchange (≈6 turns)
2,6001,4004,000$0.0307$0.0259
Knowledge base response
Retrieval-augmented answer referencing multiple passages
12,0003,00015,000$0.0900$0.0675
Near-max context run
Large document processing approaching the 128K token limit
112,00016,000128,000$0.660$0.450

Daily & monthly budgeting

Translate usage into predictable operating expenses across popular deployment sizes.

ProfileRequests/dayTokens/dayDaily costMonthly costCached dailyCached monthly
Team pilot2575,000$0.563$16.88$0.469$14.06
Product launch100500,000$3.56$106.88$2.91$87.19
Enterprise scale5003,000,000$22.50$675.00$18.75$562.50

Pricing notes

  • Fine-tuned GPT-4o pricing per 1M tokens: input $3.75, output $15.00, cache $1.875.

Frequently asked questions

How much does GPT-4o (fine-tuned) cost per 1,000 tokens?

At the published rates of $3.75 per million input tokens and $15.00 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0071.

Does GPT-4o (fine-tuned) offer cached input discounts?

GPT-4o (fine-tuned) drops input costs to $1.875 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0058, a significant saving for chatbots and RAG systems.

What is the context window for GPT-4o (fine-tuned)?

GPT-4o (fine-tuned) supports up to 128,000 tokens (128K), allowing large prompts and retrieval-augmented payloads in a single call.

How fresh is the GPT-4o (fine-tuned) pricing data?

Pricing is sourced from https://platform.openai.com/docs/pricing and was last verified on 2025-09-22. The calculator updates automatically when models.json is refreshed.

Related resources

Other Token Calculators

Explore More Tools