my-server
← Wiki

Qwen

Qwen (also known as Tongyi Qianwen, ; pinyin: Tōngyì Qiānwèn) is a family of large language models developed by Alibaba Cloud. Many Qwen variants are distributed as open‑weight models under the Apache‑2.0 license, while others are served through Alibaba Cloud.

In July 2024, South China Morning Post reported that benchmarking platform SuperCLUE ranked Qwen2‑72B‑Instruct behind OpenAI's GPT‑4o and Anthropic’s Claude 3.5 Sonnet and ahead of other Chinese models.

Models

Alibaba launched a beta of Qwen in April 2023 under the name Tongyi Qianwen, then opened it for public use in September 2023 after regulatory clearance.

The model's architecture was based on the Llama architecture developed by Meta AI. In December 2023, it released its 72B and 1.8B models for download, while Qwen 7B weights were released in August. Their models are sometimes described as open source, but the training code has not been released nor has the training data been documented, and they do not meet the terms of either the Open Source AI Definition or the Model Openness Framework from the Linux Foundation.

Qwen2 was released in June 2024, and in September it released some of its models with open weights, while keeping its most advanced models proprietary. Qwen2 contains both dense and sparse models.

In November 2024, QwQ-32B-Preview, a model focusing on reasoning similar to OpenAI's o1, was released under the Apache 2.0 License, although only the weights were released, not the dataset or training method. QwQ has a 32K token context length and performs better than o1 on some benchmarks. It was also in November 2024 that the Accio application was launched. Accio is an AI native application that is built upon Qwen and is used to generate market insights and answer sourcing questions for Alibaba's business to business e-commerce site. The tool is able to automate labor intensive tasks like data collection and trend tracking.

The Qwen-VL series is a line of visual language models that combines a vision transformer with an LLM. Alibaba released Qwen2-VL with variants of 2 billion and 7 billion parameters.

In January 2025, Qwen2.5-VL was released with variants of 3, 7, 32, and 72 billion parameters. All models except the 72B variant are licensed under the Apache 2.0 license. Qwen-VL-Max is Alibaba's flagship vision model as of 2024, and is sold by Alibaba Cloud at a cost of US$0.41 per million input tokens.

Alibaba has released several other model types such as Qwen-Audio and Qwen2-Math. In total, it has released more than 100 open weight models, with its models having been downloaded more than 40 million times. Fine-tuned versions of Qwen have been developed by enthusiasts, such as "Liberated Qwen", developed by San Francisco-based Abacus AI, which is a version that responds to any user request without content restrictions.

On January 29, 2025, Alibaba launched Qwen2.5-Max.

On March 24, 2025, Alibaba launched Qwen2.5-VL-32B-Instruct as a successor to the Qwen2.5-VL model. It was released under the Apache 2.0 license.

On March 26, 2025, Qwen2.5-Omni-7B was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face, GitHub, and ModelScope. The Qwen2.5-Omni model accepts text, images, videos, and audio as input and can generate both text and audio as output, allowing it to be used for real-time voice chatting, similar to OpenAI's GPT-4o.

On April 28, 2025, the Qwen3 model family was released, with all models licensed under the Apache 2.0 license. The Qwen3 model family includes both dense (0.6B, 1.7B, 4B, 8B, 14B, and 32B parameters) and sparse models (30B with 3B activated parameters, 235B with 22B activated parameters). They were trained on 36 trillion tokens in 119 languages and dialects.

On September 5, 2025, Alibaba launched Qwen3-Max.

On September 10, 2025, Qwen3-Next was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face and Model Scope.

On September 22, 2025, Qwen3-Omni was release under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms like Hugging Face and Model Scope. Qwen3-Omni is a mixed/multimodal model that can generate text, images, audio, and video.

On 27 January 2026, Qwen3-Max-Thinking was released. The model can generate text, pictures, or video.The Qwen-3.5 model was released on 17 February 2026.

On February 16, 2026, Qwen3.5 and Qwen3.5-Plus were released. Qwen3.5 is open-weights. Several Qwen executives resigned in early 2026, including Lin ⁠Junyang, who led develop of Qwen3-Max and Qwen3.5. Amid concern that this could mean a shift away from research and open-source artificial intelligence, Alibaba said it will continue its focus on open source.

See also

References

External links