Infrastructure Hub

Hub

AI system architecture and optimization

Explore Infrastructure Hub Hub
AI Pricing

AI API Pricing May 2026: GPT-5.5, Claude Opus 4.7, Gemini 3.1, Grok 4.3, DeepSeek V4

A May 2026 AI API pricing update covering GPT-5.5, Claude Opus 4.7, Gemini 3.1, Grok 4.3, DeepSeek V4, Qwen3.6 Plus, and Kimi K2.6.
Qing Ke Ai
3 min read
#AI API pricing#GPT-5.5 pricing#Claude Opus 4.7#Gemini 3.1 Pro#Grok 4.3#DeepSeek V4

AI API Pricing May 2026

This May 2026 refresh updates LangCopilot's pricing catalog around the models teams are most likely to search for now: OpenAI GPT-5.5, Anthropic Claude Opus 4.7, Google Gemini 3.1, xAI Grok 4.3 and Grok 4.20, DeepSeek V4, Qwen3.6 Plus, and Kimi K2.6.

The main change is not just "more models." The pricing surface is now more complex: long-context thresholds, batch/flex/priority modes, cache-hit rates, and temporary discounts can change the winner for the same workload.

Quick price table

ModelStatusContextInput / 1MCached input / 1MOutput / 1M
GPT-5.5active1M$5.00$0.50$30.00
Claude Opus 4.7active1M$5.00$0.50$25.00
Gemini 3.1 Pro Previewpreview1M2.00<=200K,2.00 <=200K, 4.00 >200K0.20<=200K,0.20 <=200K, 0.40 >200K12.00<=200K,12.00 <=200K, 18.00 >200K
Grok 4.3active1M$1.25$0.20$2.50
DeepSeek V4 Proactive1M$0.435$0.003625$0.87
Qwen3.6 Plusactive1M0.50<=256K,0.50 <=256K, 2.00 >256Knot listed3.00<=256K,3.00 <=256K, 6.00 >256K
Kimi K2.6active256K$0.95$0.16$4.00

Official sources: OpenAI pricing, Anthropic pricing, Gemini API pricing, xAI pricing, DeepSeek pricing, Qwen Cloud pricing, and Kimi API Platform.

What changed

  • OpenAI GPT-5.5 and GPT-5.4 pricing now needs short-context, long-context, batch, flex, and priority treatment.
  • Anthropic Claude Opus 4.7 adds a new premium Anthropic route with 1M context and optional fast mode.
  • Gemini 3.1 Pro Preview has a clear 200K prompt threshold, so calculator math must switch rates above that point.
  • xAI's current search intent should move toward Grok 4.3 and Grok 4.20 instead of old Grok 4 pages.
  • DeepSeek V4 Pro is temporarily discounted through 2026-05-31 15:59 UTC, making it a strong cost comparison candidate.
  • Qwen3.6 Plus requires tiered prompt pricing rather than one flat input/output rate.

Recommended calculator routes

Start from the pricing hub, then open the model-specific calculators:

SEO update note

The safest SEO move is to stop indexing every possible pairwise comparison. Keep the high-intent compare pages, noindex retired model routes, and connect the new pricing data to editorial pages like this one. That gives crawlers a clearer reason to index the pages that matter.

Further Reading

Explore More in Infrastructure Hub

This article is part of our Infrastructure series. Discover more insights and practical guides.

Visit Infrastructure Hub

About This Article

Topic: AI Pricing
Difficulty: Intermediate
Reading Time: 3 minutes
Last Updated: May 15, 2026

This article is part of our comprehensive guide to Large Language Models and AI technologies. Stay updated with the latest developments in the AI field.

All Articles
Share this article to spread LLM knowledge