Curiosity · AI Model

Phi-3.5 Mini

Phi-3.5 Mini is Microsoft Research's August 2024 tiny open-weights LLM — a 3.8B dense transformer trained on a high-quality filtered and synthetic curriculum. MIT-licensed and small enough to run on a modern smartphone via ONNX Runtime, it's the default starting point for on-device intelligent features.

Model specs

Vendor
Microsoft
Family
Phi
Released
2024-08
Context window
128,000 tokens
Modalities
text
Input price
$0.03/M tok
Output price
$0.03/M tok
Pricing as of
2026-04-20

Strengths

  • Open weights under MIT license — fully permissive commercial use
  • 3.8B size — fits in ~2GB quantized, runs on phones
  • 128K context window — unusual for tiny models
  • Optimized for ONNX Runtime and DirectML on Windows

Limitations

  • Small size limits complex reasoning and long-form generation
  • Trails Gemma 2 9B and Llama 3.1 8B on many benchmarks
  • Limited multilingual coverage relative to mid-tier models
  • Best for narrow, well-specified tasks — not open-ended chat

Use cases

  • On-device assistants for Windows, iOS, Android via ONNX Runtime
  • Low-latency classification, routing, and intent detection
  • Edge IoT inference on Raspberry Pi and similar devices
  • Draft model for speculative decoding with larger targets

Benchmarks

BenchmarkScoreAs of
MMLU≈69%2024-08
GSM8K≈86%2024-08
HumanEval≈62%2024-08

Frequently asked questions

What is Phi-3.5 Mini?

Phi-3.5 Mini is Microsoft's 3.8B open-weights small language model released August 2024 under MIT license. Part of the Phi-3 family, it targets on-device AI with a heavy synthetic-data training curriculum.

Can Phi-3.5 Mini run on a phone?

Yes — quantized to INT4 it fits in ~2GB and runs on modern flagship phones via ONNX Runtime. That's the primary design target along with edge Windows devices.

Should I use Phi-3.5 Mini or Phi-4?

Phi-4 is much stronger on reasoning but 14B — it needs a real GPU. Phi-3.5 Mini is designed for phones and edge devices where 3.8B is the ceiling. Pick based on deployment target.

Sources

  1. Microsoft — Phi-3.5 family blog — accessed 2026-04-20
  2. Hugging Face — microsoft/Phi-3.5-mini-instruct — accessed 2026-04-20