NextStep-1 New Standard in Autoregressive Image Generation

Łukasz Grochal

NextStep-1 is a large-scale text-to-image system built around a 14 billion-parameter autoregressive Transformer combined with a lightweight 157 million-parameter flow-matching head. Instead of relying on diffusion pipelines or lossy quantization, it directly models continuous visual tokens patch by patch, preserving fine detail and improving compositional consistency.

The design unifies language and vision tokens under the same prediction objective, creating a streamlined generation process that avoids the complexity of multi-stage models. Extensive evaluation across benchmarks such as Wise, GenAI-Bench, DPG-Bench, and OneIG Bench shows strong performance in world knowledge understanding and high-fidelity synthesis.

An additional version, NextStep-1-Edit, is fine-tuned for editing tasks and achieves competitive scores on GEdit-Bench and ImgEdit-Bench. The project is open-sourced, with code and models freely available for research and development.

References
3 sources
01
stepfun.aiStepFun
02
huggingface.coHugging Face
03
github.comGitHub
Publishers Are Shutting Out Internet Archive

News Giants Block Wayback Machine Over AI Fears

Claude Design Launch: Brand-Aware AI Prototyping Image

Anthropic Launches Claude Design to Rival Figma Tools

Qwen3.6 Coding Agent Benchmarks Chart Visual

Exploring Qwen3.6: Coding Benchmarks and Speed

Palantier Dilemma Human Rights vs Sercurity

Europe's Palantir Boom Amid Sovereignty and Rights Fears

Project Glasswing: Anthropic Mythos Zero-Day Exploit Finder Art

Claude Mythos Leak Ignites Fears of Unstoppable AI Exploits

OpenRouter LLM Leaderboard April

Chinese AI Models Dominate OpenRouter Top Six in Token Usage

Claude Code’s Big npm Leak

Inside the Claude Code Leak and Anthropic’s Agent Design

China AI accelerator card shipments vs NVIDIA 2025 chart

NVIDIA’s AI Chip Share in China Drops from 95% to 55%

TurboQuant KV Cache Compression Visualization

Google’s TurboQuant makes AI caches smaller and faster

Black Forest Labs FLUX.2 klein

FLUX.2 klein 9B-KV Explained: Speed, Quality, GPUs

NextStep-1 New Standard in Autoregressive Image Generation | LucasGraphic