Curiosity · AI Model

Qwen 2.5 72B Instruct

Qwen 2.5 72B Instruct is Alibaba Cloud's September 2024 open-weights flagship — a 72B dense transformer that matched Llama 3.1 405B on several benchmarks while being roughly one-sixth the size. Apache 2.0 licensed and with broad multilingual coverage, it defined the Qwen line's credibility in Western open-source benchmarking.

Model specs

Vendor
Alibaba
Family
Qwen 2.5
Released
2024-09
Context window
131,072 tokens
Modalities
text
Input price
$0.35/M tok
Output price
$0.4/M tok
Pricing as of
2026-04-20

Strengths

  • Apache 2.0 license — fully permissive commercial use
  • Matches or beats Llama 3.1 70B on most public benchmarks
  • 128K context with strong needle-in-haystack retrieval
  • Best-in-class Chinese, Japanese, Korean language quality

Limitations

  • Superseded by Qwen 3 dense and MoE variants in 2025
  • Trails DeepSeek V3 and Llama 4 Maverick on top-end reasoning
  • Enterprise adoption in US / EU slowed by geopolitical caution
  • Tool-use and agentic fine-tuning weaker than closed frontier

Use cases

  • Multilingual production assistants and chatbots
  • Fine-tuning base across dozens of community derivatives
  • RAG pipelines with 128K context
  • Research baselines for open-weights mid-flagship comparisons

Benchmarks

BenchmarkScoreAs of
MMLU≈85%2024-09
HumanEval≈86%2024-09
MATH≈83%2024-09

Frequently asked questions

Is Qwen 2.5 72B still worth using?

As a stable, well-supported open-weights flagship from 2024, yes — especially for Apache 2.0 deployments or multilingual workloads. For bleeding-edge quality, look at Qwen 3 or DeepSeek V3.

How does Qwen 2.5 72B compare to Llama 3.1 70B?

Qwen 2.5 72B typically beats Llama 3.1 70B on math and Chinese-language benchmarks, matches it on English reasoning, and ships with a more permissive Apache 2.0 license.

What sizes does the Qwen 2.5 family cover?

Qwen 2.5 comes in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B dense variants, plus specialized Qwen 2.5 Coder and Qwen 2.5 Math models.

Sources

  1. Qwen — Qwen 2.5 announcement — accessed 2026-04-20
  2. Hugging Face — Qwen/Qwen2.5-72B-Instruct — accessed 2026-04-20