TheSequence

TheSequence

The Sequence AI of the Week #717: First Trillion Among the Majors: Qwen-Max

One of the most impressive frontier models of the new wave.

Sep 10, 2025
∙ Paid
Created Using GPT-5

Alibaba’s newest release Qwen3-Max is one of the most impressive frontier AI models ever created boosting 1 trillion parameters! Altough we are still learning about the details behind Qwen3-Max, I wanted to share some initial findings and impressions.

Qwen3‑Max sits at the intersection of three trends: very large‑scale pretraining, sparse (Mixture‑of‑Experts) computation for throughput and cost, and aggressive post‑training that pushes reasoning, coding, and long‑context behavior. It’s delivered as a cloud model with an OpenAI‑compatible API and aims squarely at the front of the pack—reporting strong results on capability suites like Arena‑Hard, LiveBench, LiveCodeBench, and GPQA‑Diamond, and offering a “Max” endpoint you can hit with the usual chat‑completion semantics.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Jesus Rodriguez
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture