DeepSeek-V3 Unveiled: 236B Parameters and 128K Context for Free

Łukasz Grochal

The DeepSeek-V3 represents a significant evolutionary leap from its predecessor, the DeepSeek-V2. The most notable upgrade is the substantial expansion in scale, with the model's parameters increasing dramatically to a massive 236 billion. This enhanced architecture directly contributes to its superior performance across a wide array of benchmarks, including coding, mathematics, and general reasoning tasks. A key improvement is the extension of its context window, which now supports up to 128,000 tokens.

This allows the model to process and comprehend extremely long documents, complex technical papers, or maintain context over extended conversations far more effectively than previous versions. Furthermore, while retaining its strong text-based capabilities, the V3 iteration is architected for advanced multimodal understanding, laying the groundwork for future functionalities.

Perhaps the most impactful change is its accessibility strategy; unlike many competitors, this more powerful model is being offered to the public for free, marking a bold move to democratize access to cutting-edge AI.

References
3 sources
01
deepseek.comDeepSeek
02
build.nvidia.comNvidia
03
huggingface.coHugging Face
DeepSeek V4‑Pro 1.6T‑Parameter AI Model Architecture

DeepSeek V4: 1M‑Token Context and Budget Frontier AI Models

Palantir Manifesto Graphic: AI Defense and Culture Clash

Palantir Manifesto Hits at Regressive Cultures and AI Shift

OpenAI ChatGPT Images 2.0 feature overview

OpenAI Updates ChatGPT Images With Better Text

Publishers Are Shutting Out Internet Archive

News Giants Block Wayback Machine Over AI Fears

Claude Design Launch: Brand-Aware AI Prototyping Image

Anthropic Launches Claude Design to Rival Figma Tools

Qwen3.6 Coding Agent Benchmarks Chart Visual

Exploring Qwen3.6: Coding Benchmarks and Speed

Palantier Dilemma Human Rights vs Sercurity

Europe's Palantir Boom Amid Sovereignty and Rights Fears

Project Glasswing: Anthropic Mythos Zero-Day Exploit Finder Art

Claude Mythos Leak Ignites Fears of Unstoppable AI Exploits

OpenRouter LLM Leaderboard April

Chinese AI Models Dominate OpenRouter Top Six in Token Usage

Claude Code’s Big npm Leak

Inside the Claude Code Leak and Anthropic’s Agent Design