Alibaba introduces Qwen3, a family of 'hybrid' AI reasoning models
Alibaba introduces competitive Qwen3 AI models, blending efficiency with advanced reasoning.

Alibaba has introduced Qwen3, a family of AI models they describe as 'hybrid,' indicating their ability to perform both complex reasoning over time and simpler tasks quickly. The models range from 0.6 billion to a remarkable 235 billion parameters and, according to Alibaba, are competitive with top models from rivals like Google and OpenAI. Most of these models will be released under an open license, available via platforms such as Hugging Face and GitHub for researchers and developers.
Qwen3 supports an impressive 119 languages and is trained on a gargantuan dataset covering nearly 36 trillion tokens. To put this into perspective, a million tokens equate to around 750,000 words. The diversity of the dataset spans textbooks, coding snippets, and question-answer pairs, offering a rich foundation for comprehensive AI capabilities. Moreover, Alibaba asserts significant upgrades over its predecessor, Qwen2, in terms of speed and problem-solving prowess.
The new Qwen3 models incorporate a mixture of experts (MoE) architecture, amplifying computational efficiency by delegating tasks to specialized 'expert' models. This architectural approach allows handling large-scale data without overwhelming computation resources. Alibaba highlights that Qwen3 models outperform in several benchmarks, notably edging out OpenAI's o3-mini and Google's Gemini 2.5 Pro in evaluations like AIME, a mathematical reasoning benchmark, and other reasoning assessments like BFCL.
While the largest model, Qwen-3-235B-A22B, demonstrates exceptional performance, it is not yet public. However, the most substantial publicly available model, Qwen3-32B, still competes robustly against both proprietary and open AI models, even surpassing OpenAI's o1 on several benchmarks, including LiveCodeBench for coding tasks. In addition, Alibaba details plans to offer these models through cloud services like Fireworks AI and Hyperbolic, highlighting the practical applications of Qwen3 in real-world AI deployments.
Industry perspectives, such as those from Tuhin Srivastava of Baseten, underscore the strategic significance of open models like Qwen3 in keeping pace with closed-source AI systems. Despite U.S. restrictions on technology trades with China, models such as Qwen3 are poised to thrive domestically, pointing to a broader trend of hybrid AI adoption, integrating both bespoke and commercially available systems.
Sources: TechCrunch, X