Kimi K2: China’s Trillion-Param AI Model

Łukasz Grochal

Chinese startup Moonshot AI, backed by Alibaba, has launched Kimi K2, the world’s first open-source trillion-parameter LLM (with 32 billion active at inference). Based on a Mixture-of-Experts architecture and trained on 15.5 trillion tokens using the novel MuonClip optimizer, Kimi K2 achieves impressive stability and scaling. It surpasses GPT‑4.1 on programming benchmarks, scoring 65.8% on SWE-bench Verified and 53.7% on LiveCodeBench, close to Anthropic’s Claude Sonnet 4.

Designed for tool use and autonomous agents, it also posts a strong 76.5% on AceBench. Available under MIT license (with attribution for commercial use), it can run locally or via API compatible with OpenAI’s. Moonshot’s aggressive pricing — around $0.15 per million input tokens and $2.50 per million output — puts competitive pressure on closed models. This milestone could reshape the global AI race, positioning Chinese firms as serious challengers to OpenAI and Anthropic.

References
3 sources
01
moonshotai.github.ioMoonshot AI
02
reuters.comReuters
03
venturebeat.comVentureBeat
DeepSeek V4‑Pro 1.6T‑Parameter AI Model Architecture

DeepSeek V4: 1M‑Token Context and Budget Frontier AI Models

Palantir Manifesto Graphic: AI Defense and Culture Clash

Palantir Manifesto Hits at Regressive Cultures and AI Shift

OpenAI ChatGPT Images 2.0 feature overview

OpenAI Updates ChatGPT Images With Better Text

Publishers Are Shutting Out Internet Archive

News Giants Block Wayback Machine Over AI Fears

Claude Design Launch: Brand-Aware AI Prototyping Image

Anthropic Launches Claude Design to Rival Figma Tools

Qwen3.6 Coding Agent Benchmarks Chart Visual

Exploring Qwen3.6: Coding Benchmarks and Speed

Palantier Dilemma Human Rights vs Sercurity

Europe's Palantir Boom Amid Sovereignty and Rights Fears

Project Glasswing: Anthropic Mythos Zero-Day Exploit Finder Art

Claude Mythos Leak Ignites Fears of Unstoppable AI Exploits

OpenRouter LLM Leaderboard April

Chinese AI Models Dominate OpenRouter Top Six in Token Usage

Claude Code’s Big npm Leak

Inside the Claude Code Leak and Anthropic’s Agent Design

Kimi K2: China’s Trillion-Param AI Model | LucasGraphic