Chinese Coding Models GLM 4.7 and M2.1 Step Up

Łukasz Grochal

Chinese companies are quietly turning into serious players in AI coding tools, with Zhipu’s GLM 4.7 and MiniMax’s M2.1 as two of the most interesting new models. GLM 4.7 is positioned as Zhipu’s flagship model with strong upgrades in programming and multi step reasoning, and on public coding benchmarks it lands close to leading systems like Claude Sonnet 4.5, even setting open source records on tests such as SWE bench Verified and LiveCodeBench.

MiniMax M2.1 goes in a slightly different direction, focusing not just on code correctness but also on handling complex “agent style” workflows, tool use and multi language programming, all while trying to keep responses shorter, faster and cheaper in tokens. In everyday use, China’s AI landscape looks different from the US: ByteDance’s Doubao dominates as a mass market assistant with well over 100 million monthly users, ahead of Tencent’s Yuanbao and DeepSeek, and Ant Group pushes its A Fu health app and Alibaba’s Qwen deeper into consumer services.

Overall, Chinese coding models are now competitive with strong Western mid to high tier models on many benchmarks, but leadership at the very top of the charts still mostly belongs to US systems.

(2)
Sources
Palantier Dilemma Human Rights vs Sercurity

Europe's Palantir Boom Amid Sovereignty and Rights Fears

Project Glasswing: Anthropic Mythos Zero-Day Exploit Finder Art

Claude Mythos Leak Ignites Fears of Unstoppable AI Exploits

OpenRouter LLM Leaderboard April

Chinese AI Models Dominate OpenRouter Top Six in Token Usage

Claude Code’s Big npm Leak

Inside the Claude Code Leak and Anthropic’s Agent Design

China AI accelerator card shipments vs NVIDIA 2025 chart

NVIDIA’s AI Chip Share in China Drops from 95% to 55%

TurboQuant KV Cache Compression Visualization

Google’s TurboQuant makes AI caches smaller and faster

Black Forest Labs FLUX.2 klein

FLUX.2 klein 9B-KV Explained: Speed, Quality, GPUs

Nvidia Slashes LLM Context Memory With KVTC Design

KVTC: Nvidia’s 20x LLM Memory Cut Without Retraining

OpenAI Sora shutdown concept

Sora’s Short Life: Inside OpenAI’s Quiet Retreat

Stitch (stitch.withgoogle.com) experimental Google Labs tool

Google Stitch: From simple prompt to working app UI