TAG

— BROWSING TAG —

#ALIBABA

POSTS

25/04/2625/04/26

DeepSeek V4‑Pro 1.6T‑Parameter AI Model Architecture

DeepSeek V4: 1M‑Token Context and Budget Frontier AI Models

DeepSeek V4 launches with 1M‑token context, MoE‑based Pro and Flash models, and prices far below U.S. rivals, focusing on code and agent workloads.

DeepSeek Alibaba Open Source

18/04/26

Qwen3.6 Coding Agent Benchmarks Chart Visual

Exploring Qwen3.6: Coding Benchmarks and Speed

Qwen3.6-35B-A3B: Open MoE LLM with 35B params (3B active), tops SWE-bench at 73.4%. Fast local runs at 170 t/s, strong agents vs Gemma4.

Qwen Alibaba Open Source

08/04/26

Chinese AI Models Dominate OpenRouter Top Six in Token Usage

Chinese LLMs took top 6 on OpenRouter with 13T tokens vs US 3T; Qwen3.6 Plus led at 4.6T. Five weeks straight ahead.

Claude Alibaba Qwen+3

06/03/2606/03/26

OpenSandbox: A Unified Sandbox Layer For AI Agents

Alibaba’s OpenSandbox offers a unified, secure API so AI agents can run code, browse and train in isolated Docker or Kubernetes sandboxes.

Alibaba Open Source

06/03/2606/03/26

Qwen Beats gpt-oss-120B with Laptop Power

Alibaba's Tiny Qwen Beats Big OpenAI Model

Alibaba's Qwen3.5-9B tops OpenAI's gpt-oss-120B on GPQA, Video-MME benchmarks. Runs on laptops, multimodal edge AI star.

Alibaba Qwen Open Source

28/02/26

Qwen3.5 Medium Matches Sonnet 4.5 on Local Hardware

Qwen3.5-27B/35B open-source, runs local on 18-24GB VRAM. Benchmarks rival Claude Sonnet 4.5, 1M ctx, tool-native.

Qwen Alibaba Open Source

27/02/2627/02/26

DeepSeek V4 vs Claude/GPT: Strengths, Controversies Ahead

DeepSeek V4 eyes early March drop, topping Claude/GPT in code benchmarks with 1M context, but Blackwell chip scandal brews US ire.

Alibaba DeepSeek

16/02/2616/02/26

Cinematic AI video frame generated by Seedance 2.0

Seedance 2.0: China’s AI video shocks Hollywood

Seedance 2.0 shows China’s AI video can rival OpenAI and Google, mixing realism with serious ethical and legal questions.

ByteDance Alibaba

16/02/2616/02/26

Qwen3.5‑397B‑A17B model card on ModelScope.

Qwen 3.5: Faster, Cheaper And More Multimodal AI

Alibaba’s Qwen 3.5 boosts multimodal and agent skills, narrowing the gap with US AI leaders while staying efficient through a sparse MoE design.

Qwen Alibaba

10/02/26

2K Qwen Image 2.0 slide layout with crisp English text

Inside Qwen Image 2.0: Slides, Posters And Photorealism

Qwen Image 2.0 unifies generation and editing with sharper 2K visuals, better humans and typography that finally makes text‑heavy designs workable.

Qwen Alibaba

28/01/26

Qwen3 Max Thinking AI agent visualized with Qwen bear

Qwen3 Max Thinking: New Contender in Deep Reasoning

Alibaba’s reasoning model pairs test‑time scaling with web search and code tools, topping Gemini 3 Pro and GPT‑5.2 on HLE search and math benchmarks.

Alibaba Qwen

26/01/26

Why US platforms are turning to Chinese AI models

Chinese open source AI like DeepSeek and Qwen is winning US clients by mixing solid performance with flexibility and lower operating costs.

Alibaba Qwen DeepSeek

05/01/26

DeepSeek mHC: Smarter Scaling For Giant AI Models

How DeepSeek Trains Powerful Models On A Budget

The mHC idea lets DeepSeek train 3–27B models more efficiently, supporting its push to rival US labs while spending only a fraction on hardware.

Alibaba DeepSeek

02/01/26

Qwen-Image-2512 vs Nano Banana Pro AI generator visual

Building On Qwen-Image-2512 Instead Of Closed Image AI

Qwen-Image-2512 offers open source, Apache licensed image generation that challenges Google’s Nano Banana Pro for flexible, enterprise ready workflows...

Alibaba Qwen

30/12/25

Zhipu’s GLM 4.7 and MiniMax M2.1 boost coding and agent skills, as Doubao and Yuanbao

Chinese Coding Models GLM 4.7 and M2.1 Step Up

Chinese coding models now sit close to Claude tier performance, while Doubao and Yuanbao prove domestic AI platforms can win huge mainstream audiences...

Anthropic Alibaba DeepSeek+1

24/12/25

Qwen and DeepSeek vs closed US AI platforms

How Open Source AI Is Quietly Deflating the Hype Today

As enterprises discover capable open models they can run themselves, the pricing power of closed US platforms starts to look increasingly fragile.

Alibaba Qwen Meta+1

22/12/25

From Raster to Layers: What Qwen-Image-Layered Does

Layered AI Images: Inside Qwen-Image-Layered Editing

Qwen-Image-Layered splits flat images into editable RGBA layers, enabling consistent, Photoshop-like edits with precise control over each element.

Qwen Alibaba

28/11/25

Alibaba’s Z-Image-Turbo excels in photorealistic text image AI

Z-Image-Turbo: Alibaba’s 6B param text-to-image powerhouse

Z-Image-Turbo is a 6B param model by Alibaba for photorealistic, fast text-to-image generation with English and Chinese text support.

Alibaba

21/09/25

DeepSeek R1: $294K training cost, ultra‑low inference rates

DeepSeek’s R1 cost ~$294K to train with 512 H800 GPUs and 80h. Inference costs are ~$0.55 input / $2.19 output per million tokens.

DeepSeek Alibaba

01/09/25

DeepSeek-V3 Unveiled: 236B Parameters and 128K Context for Free

DeepSeek-V3 majorly upgrades from V2 with a massive 236B parameter count, a vastly larger 128K context window, and superior performance, all offered f...

Alibaba DeepSeek

20 / 26