#1Four Chinese Labs Release Frontier Coding Models in 12-Day Blitz
Z.ai's GLM-5.1, MiniMax M2.7, Moonshot's Kimi K2.6, and DeepSeek V4 all landed at roughly the same capability ceiling on agentic engineering benchmarks—at one-third or less the inference cost of Western frontier models. DeepSeek V4 Pro leads Chinese models on NIST's aggregate benchmark but still trails the leading US frontier by approximately eight months.