The 2026 Chinese LLM Price War: Top 5 Frontier API Costs Compared
The 2026 Chinese LLM price war has seen significant reductions in API costs, with several models cutting prices multiple times. DeepSeek V4-Pro is now the cheapest option at $0.87 per million output tokens, while Xiaomi's MiMo V2.5 offers a flat rate for long-context tasks. The competition among these models is aimed at addressing various developer needs and optimizing workload management.
- ▪Chinese labs cut LLM API prices six times in the first half of 2026, with three cuts made permanent.
- ▪DeepSeek V4-Pro is priced at $0.87 per million output tokens, making it the cheapest option available.
- ▪Xiaomi's MiMo V2.5 offers a flat rate of $3 for long-context tasks, while Alibaba's Qwen3 Max is priced at $3.90.
Opening excerpt (first ~120 words) tap to expand
try { if(localStorage) { let currentUser = localStorage.getItem('current_user'); if (currentUser) { currentUser = JSON.parse(currentUser); if (currentUser.id === 3890506) { document.getElementById('article-show-container').classList.add('current-user-is-article-author'); } } } } catch (e) { console.error(e); } Hassann Posted on May 27 • Originally published at apidog.com The 2026 Chinese LLM Price War: Top 5 Frontier API Costs Compared #ai #llm #api #news Chinese labs cut LLM API prices six times in the first half of 2026, and three of those cuts were declared permanent. DeepSeek V4-Pro now costs $0.87 per million output tokens. Xiaomi MiMo V2.5 flattened its long-context tier to $3 output. Alibaba’s Qwen3 Max ships at $3.90. Moonshot’s Kimi K2.6 holds the cache-hit floor at $0.07.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at DEV.to (Top).