Curiosity · AI Model

Claude Sonnet 4.6

Claude Sonnet 4.6 is the production workhorse in Anthropic's 2026 Claude 4 family. It sits below Opus on cost but keeps nearly all of Opus's coding and tool-use reliability, with a 1M-token context and extended thinking. For most developers using Claude in production, Sonnet 4.6 is the default model, with Opus reserved for the hardest tasks.

Model specs

Vendor
Anthropic
Family
Claude 4
Released
2026-01
Context window
1,000,000 tokens
Modalities
text, vision, code
Input price
$3/M tok
Output price
$15/M tok
Pricing as of
2026-04-20

Strengths

  • Best cost/quality ratio in Anthropic's lineup
  • Very reliable tool use and structured outputs
  • 1M context plus prompt caching cuts long-document costs sharply
  • Follows XML-tag prompts precisely — great for templated pipelines

Limitations

  • Trails Opus 4.7 on frontier reasoning and hardest coding tasks
  • Slower than Haiku 4.5 for ultra-low-latency UX
  • No audio modality — pair with OpenAI Realtime or Gemini Live for voice

Use cases

  • Production coding agents — the default Claude Code model
  • Customer support and product assistants at scale
  • Multi-step agents with MCP servers and tool use
  • Content generation and summarisation for long documents

Benchmarks

BenchmarkScoreAs of
SWE-bench Verified≈70%2026-04
MMLU-Pro≈84%2026-04
GPQA Diamond≈66%2026-04

Frequently asked questions

What is Claude Sonnet 4.6?

Claude Sonnet 4.6 is the mid-tier model in Anthropic's 2026 Claude 4 family. It balances cost, latency, and quality — typically used as the default production model while Opus 4.7 is reserved for the hardest workloads.

When should I pick Sonnet 4.6 over Opus 4.7?

Choose Sonnet 4.6 for nearly all production workloads — coding copilots, support agents, RAG, tool-calling. Pick Opus 4.7 only when quality noticeably improves your end-user result and cost is tolerable.

How much does Claude Sonnet 4.6 cost?

As of April 2026, Sonnet 4.6 is priced at roughly USD 3 per million input tokens and USD 15 per million output tokens on the Anthropic API. Prompt caching further reduces effective cost.

Does Sonnet 4.6 support extended thinking?

Yes. Sonnet 4.6 supports Anthropic's extended-thinking mode, where the model produces a long internal reasoning trace before its final answer. This is toggled via an API parameter.

Sources

  1. Anthropic — Models overview — accessed 2026-04-20
  2. Anthropic — Pricing — accessed 2026-04-20