DeepSeek

DeepSeek V4 Flash Pricing & Token Costs (2026)

Per 1M tokens: input $0.14 · output $0.28$ · cached 0.003. Context window $1,000,000 tokens with source and verification details.

Launch calculator

TL;DR — Pricing Quick Summary

  • Input pricing: $0.14 per 1M tokens ($0.0001 per 1K)
  • Output pricing: $0.28 per 1M tokens ($0.0003 per 1K)
  • Prompt caching: $0.003 per 1M tokens — save 98% on repeated context
  • Context window: 1,000,000 tokens
  • Typical monthly cost: $1.96 for 10M input + 2M output tokens
  • Daily cost example: $0.0210 for 100K tokens (50K in, 50K out)

Key metrics

Context window
1,000,000 tokens
Input price
$0.14 / 1M tokens
Output price
$0.28 / 1M tokens
Cached input
$0.003 / 1M tokens
Trusted sources

Official link:https://api-docs.deepseek.com/quick_start/pricing/

Last verified: 2026-05-15

Pricing notes
  • · DeepSeek says deepseek-chat and deepseek-reasoner map to DeepSeek V4 Flash compatibility modes.
  • · Cache-hit input pricing was reduced to $0.0028 per 1M tokens effective 2026-04-26 12:15 UTC.

Multi-currency (per 1M tokens)

CurrencyInputCachedOutput
USD$0.14$0.00$0.28
CNY¥1.00¥0.02¥2.00
EUR0,13 €0,00 €0,26 €
JPY¥20¥0¥41

* Live search cost uses sources / 1000 × price and currently applies to xAI Grok only.

Frequently Asked Questions

What is the cost per 1M tokens for DeepSeek DeepSeek V4 Flash?

DeepSeek DeepSeek V4 Flash costs $0.14 per 1M input tokens and $0.28 per 1M output tokens, with cached input at $0.003 per 1M tokens.

How much does it cost per 1K tokens?

Per 1K tokens: $0.0001 for input and $0.0003 for output. This is useful for calculating costs for smaller workloads or individual API calls.

What is the estimated monthly cost for typical usage?

For a typical workload of 10M input + 2M output tokens per month, DeepSeek DeepSeek V4 Flash would cost approximately $1.96. Daily usage of 100K tokens (50K in, 50K out) costs about $0.0210.

Does DeepSeek DeepSeek V4 Flash offer a free tier?

Check DeepSeek's official documentation for free tier availability. Some providers offer free credits for new users or limited free usage. Visit https://api-docs.deepseek.com/quick_start/pricing/ for current free tier details.

How does prompt caching work to reduce costs?

With prompt caching enabled, input pricing drops to $0.003 per 1M tokens for repeated context (a 98% discount), while output remains $0.28 per 1M tokens. Caching is ideal for repeated prompts or system messages.

What is the context window size for DeepSeek V4 Flash?

DeepSeek V4 Flash supports a 1,000,000 token context window. This determines the maximum combined length of your input prompt and output response.

How frequently is this pricing information updated?

All prices reference official DeepSeek documentation (https://api-docs.deepseek.com/quick_start/pricing/), last verified on 2026-05-15. We update pricing as soon as providers announce changes.

How can I calculate exact costs for my use case?

Use our free token calculator to estimate costs based on your specific usage pattern. The calculator supports all major models and shows costs in multiple currencies. You can also compare costs across different models to find the most economical option.