Curiosity · AI Model

Claude 3 Opus

Claude 3 Opus is Anthropic's March 2024 flagship and the model that pulled Claude into the frontier tier alongside GPT-4. It set a new bar for long-context recall, measured near-perfect on needle-in-a-haystack at 200K tokens, and established Anthropic as a top-three LLM provider before the Claude 3.5 family rebalanced the lineup.

Model specs

Vendor
Anthropic
Family
Claude 3
Released
2024-03
Context window
200,000 tokens
Modalities
text, vision, code
Input price
$15/M tok
Output price
$75/M tok
Pricing as of
2026-04-20

Strengths

  • Near-perfect needle-in-a-haystack recall across 200K tokens
  • Strong nuanced writing and refusal behaviour
  • The model that re-established Anthropic as a frontier lab
  • Clean, predictable output style

Limitations

  • No tool use or JSON-mode stability compared to 2025 models
  • Slower and pricier than newer Sonnet / Opus tiers
  • Superseded on nearly every benchmark by Claude 3.5 Sonnet and later
  • No extended thinking or MCP support

Use cases

  • Long-document analysis — contracts, research, books
  • Multi-step reasoning and research assistants (2024 era)
  • High-stakes content where hallucination cost was high
  • Translation and high-context instruction following

Benchmarks

BenchmarkScoreAs of
MMLU≈86.8%2024-03
HumanEval≈84.9%2024-03
GPQA (graduate)≈50.4%2024-03

Frequently asked questions

What is Claude 3 Opus?

Claude 3 Opus is Anthropic's original Opus-tier flagship, released March 2024. It was the first Claude model to compete with GPT-4 on general benchmarks and introduced Anthropic's 200K-token long-context capability.

Is Claude 3 Opus still worth using?

For new projects, no — Claude Opus 4.7 or Sonnet 4.6 is meaningfully better and often cheaper per unit of quality. Claude 3 Opus is offered mainly for legacy pipelines that can't migrate.

How much does Claude 3 Opus cost?

As of April 2026, Claude 3 Opus is priced at USD 15 per million input tokens and USD 75 per million output tokens — the same headline pricing as Opus 4.7 but with older-generation quality.

What was Claude 3 Opus best known for?

Its near-perfect long-context recall — the famous needle-in-a-haystack demos where the model found one sentence in a 200K-token document with high accuracy. It reset community expectations about what long context could actually do.

Sources

  1. Anthropic — Claude 3 family — accessed 2026-04-20
  2. Anthropic — Pricing — accessed 2026-04-20