Edit Images with Text Precision: Qwen-Image-Edit Released

Łukasz Grochal

Qwen-Image-Edit is a powerful new image-editing model released by Alibaba’s Qwen Team in August 2025. Built on a 20B-parameter Qwen-Image foundation, it brings advanced capabilities in both semantic and appearance editing thanks to a dual-encoding architecture: one branch (Qwen2.5-VL) controls high-level visual semantics, while a VAE encoder handles low-level appearance details.

The model excels at precise bilingual text editing (English and Chinese), preserving font, size, and style when adding, modifying, or deleting text in images. It supports high-fidelity semantic transformations like style transfer, object rotation (including novel view synthesis), IP generation, as well as localized appearance tweaks such as removing fine details or changing colors without affecting surrounding areas.

Qwen-Image-Edit is accessible via platforms like Qwen Chat, Hugging Face, ModelScope, and Alibaba Cloud’s API, where developers can integrate image editing into their applications with straightforward HTTP or SDK calls. Its open-source Apache 2.0 license ensures enterprise-grade adoption and potential cost savings compared to proprietary tools.

References(2)
Sources
Stitch (stitch.withgoogle.com) experimental Google Labs tool

Google Stitch: From simple prompt to working app UI

Yann LeCun’s AMI vision for physically grounded AI

Yann LeCun’s AMI Lab Pioneers Physical‑World AI

Project Maven Dashboards Visualizing Targets and Risks

Claude, Palantir and Who Controls AI in Modern War

OpenSandbox Logo

OpenSandbox: A Unified Sandbox Layer For AI Agents

Qwen Beats gpt-oss-120B with Laptop Power

Alibaba's Tiny Qwen Beats Big OpenAI Model

QuitChatGPT – Street Art Mural

Is it time to quit ChatGPT? Inside the QuitGPT revolt

OpenAI ChatGPT 5.4

GPT 5.4: Native Computer Use Meets Finance Workflows

Google’s Nano Banana 2: Fast, Pro‑Level AI Image Generation

Nano Banana 2 Delivers Pro‑Grade Images at Flash Speed

Cloud AI agents orchestrating workflows in a browser UI

How Perplexity Computer Orchestrates 19 Models For You

AI Distillation Attack: Anthropic vs DeepSeek Claude Theft Illustration

Claude Distillation Drama: Anthropic vs Chinese AI Labs