Curiosity · AI Model

Mistral Small 3

Mistral Small 3 is Mistral AI's January 2025 efficiency-tier open-weights model — 24B parameters under a pure Apache 2.0 license with no community-license caveats. It targets low-latency applications where you want strong quality per dollar and full commercial freedom without Meta-style revenue thresholds.

Model specs

Vendor
Mistral AI
Family
Mistral Small
Released
2025-01
Context window
32,768 tokens
Modalities
text
Input price
$0.1/M tok
Output price
$0.3/M tok
Pricing as of
2026-04-20

Strengths

  • Apache 2.0 license — no revenue cap or community-license caveats
  • 24B dense — fits on a single H100 at BF16, laptop-class at 4-bit
  • Competitive with Llama 3.3 70B on several benchmarks at a third the size
  • Fast inference — 150+ tokens/sec on a single H100

Limitations

  • 32K context window — short compared to Llama 3.1/3.3 at 128K
  • Text-only — see Pixtral for vision and Mistral Large for flagship
  • Trails DeepSeek V3 and Llama 4 Maverick on top-end reasoning
  • Smaller community than Llama — fewer off-the-shelf fine-tunes

Use cases

  • Low-latency chat and voice-assistant backends
  • High-throughput batch pipelines where cost dominates
  • On-prem deployments needing a permissive license
  • Fine-tuning on proprietary data without license friction

Benchmarks

BenchmarkScoreAs of
MMLU≈81%2025-01
HumanEval≈84%2025-01

Frequently asked questions

What makes Mistral Small 3 different from Llama 3.3 70B?

Size and license. Mistral Small 3 is 24B under Apache 2.0 with no community-license caveats, so it's fully permissive. Llama 3.3 70B is larger and stronger but ships under a conditional Meta license.

Is Mistral Small 3 good for production?

Yes — it was explicitly designed as a production model. Latency is its selling point: under 100ms first-token on a single GPU, with competitive instruction-following quality.

Where do I get Mistral Small 3?

Weights are on Hugging Face under mistralai/Mistral-Small-24B-Instruct-2501. Hosted inference is available via Mistral's own API, Together, Fireworks, and OpenRouter.

Sources

  1. Mistral AI — Mistral Small 3 announcement — accessed 2026-04-20
  2. Hugging Face — mistralai/Mistral-Small-24B-Instruct — accessed 2026-04-20