Verified 2026-05-15 · sourced from DeepSeek
DeepSeek V4 Flash Token Calculator, Pricing & 100K/1M Cost
Check DeepSeek DeepSeek V4 Flash pricing, estimate 100K and 1M token cost, and size a real API budget before you send a single request. Standard pricing is $0.14 per million input tokens and $0.28 per million output tokens with a 1M token context window.
Quick answer: DeepSeek V4 Flash pricing per 1M tokens is $0.14 input and $0.28 output. Context window: 1,000,000 tokens · Cached input: $0.003 / 1M.
Best for searches like DeepSeek V4 Flash token calculator, DeepSeek V4 Flash pricing, DeepSeek V4 Flash 100K tokens cost, DeepSeek V4 Flash 1M token cost.
Pick the route that matches what you searched for
Some visitors want a fast DeepSeek V4 Flash API cost estimate, others want a direct 100K or 1M token budget, and some are already comparing alternatives. These shortcuts remove the extra click.
Estimate a single request or prompt budget right now.
Jump straight to the most common budgeting checkpoint.
Use this when you are sizing production traffic or a monthly plan.
Open the closest head-to-head comparison instead of researching from scratch.
Context window
1,000,000 tokens
Input price
$0.14 / 1M
Output price
$0.28 / 1M
Cached input
$0.003 / 1M
Usage scenarios
Compare standard and cached pricing (where available) across common workloads.
| Scenario | Tokens in | Tokens out | Total tokens | Standard cost | Cached cost |
|---|---|---|---|---|---|
Quick chat reply Single user question with a short assistant answer | 650 | 220 | 870 | $0.0002 | $0.0001 |
Coding assistant session Multi-turn pair programming exchange (≈6 turns) | 2,600 | 1,400 | 4,000 | $0.0008 | $0.0004 |
Knowledge base response Retrieval-augmented answer referencing multiple passages | 12,000 | 3,000 | 15,000 | $0.0025 | $0.0009 |
Near-max context run Large document processing approaching the 1M token limit | 880,000 | 120,000 | 1,000,000 | $0.157 | $0.0361 |
Daily & monthly budgeting
Translate usage into predictable operating expenses across popular deployment sizes.
| Profile | Requests/day | Tokens/day | Daily cost | Monthly cost | Cached daily | Cached monthly |
|---|---|---|---|---|---|---|
| Team pilot | 25 | 75,000 | $0.0140 | $0.420 | $0.0071 | $0.214 |
| Product launch | 100 | 500,000 | $0.0910 | $2.73 | $0.0430 | $1.29 |
| Enterprise scale | 500 | 3,000,000 | $0.560 | $16.80 | $0.286 | $8.57 |
Pricing notes
- DeepSeek says deepseek-chat and deepseek-reasoner map to DeepSeek V4 Flash compatibility modes.
- Cache-hit input pricing was reduced to $0.0028 per 1M tokens effective 2026-04-26 12:15 UTC.
Frequently asked questions
How much does DeepSeek V4 Flash cost per 1,000 tokens?
At the published rates of $0.14 per million input tokens and $0.28 per million output tokens, a typical 1,000 token request (≈70% input, 30% output) costs about $0.0002.
Does DeepSeek V4 Flash offer cached input discounts?
DeepSeek V4 Flash drops input costs to $0.003 per million cached tokens. Using cached contexts, that same 1,000 token call totals $0.0001, a significant saving for chatbots and RAG systems.
What is the context window for DeepSeek V4 Flash?
DeepSeek V4 Flash supports up to 1,000,000 tokens (1M), allowing large prompts and retrieval-augmented payloads in a single call.
How fresh is the DeepSeek V4 Flash pricing data?
Pricing is sourced from https://api-docs.deepseek.com/quick_start/pricing/ and was last verified on 2026-05-15. The calculator updates automatically when models.json is refreshed.