Claude 4
Anthropic's first model family explicitly designed for sustained autonomous work — running for hours on multi-step tasks without human intervention. Opus 4 became the dominant model behind "AI software engineer" products in 2025, while Sonnet 4 stayed the workhorse mid-tier. Both shipped with native tool use, file editing, and computer-use improvements.
How are Intelligence, Speed & Cost bucketed?
- Top 1%≤ 1%
- Top 5%≤ 5%
- Top 10%≤ 10%
- Good≤ 25%
- Medium≤ 50%
- Below avg> 50%
- Top 1%≥ 345 tok/s
- Top 5%≥ 237 tok/s
- Top 10%≥ 196 tok/s
- Good≥ 146 tok/s
- Medium≥ 90 tok/s
- Slow< 90 tok/s
- Freeopen weights · self-host
- Low< $1 / M out
- Moderate$1–5 / M out
- High≥ $5 / M out
Why it matters
Claude 4 is the model that made "have AI write code, autonomously, and you review the PR" a normal workflow at frontier engineering teams. The labor-market implications — entry-level engineering hiring contraction visible in Q3 2025 BLS data — are downstream of this and similar releases.
Core Capabilities
Context Window
Availability
Pricing Model
Capability / Performance
Where this model sits relative to the middle 60% of models in the tree. All scores are 0–10 (higher is better).
What it feels like
- Sustains hours-long agent loops without losing the thread — testers ran Opus 4 on 7+ hour autonomous coding sessions
- Best-in-class for real-world software engineering at release; SWE-bench Verified gains over Claude 3.7 Sonnet
- Native MCP support makes tool-using agents noticeably more reliable in production
- File editing and computer use feel meaningfully better than 3.7 — fewer 'almost works' debugging cycles
- Opus 4 is slower and more expensive than alternatives; reserve for the hardest tasks
- Sonnet 4 is the workhorse — same agentic strengths at a much friendlier price
Best use cases
- AI-software-engineer products (Devin, Cursor, Cline) that need long-horizon autonomy
- Agentic workflows with tool use, file editing, and computer use
- Complex bug fixes that span multiple files / systems
- Multi-hour research or coding sessions where context retention matters
Tools to try
Not ideal for
- High-volume chat or simple Q&A (use Sonnet or Haiku)
- Strictly latency-sensitive applications — Opus is slower
- Fully offline / air-gapped deployments (API only)
Model Evolution
Claude is Anthropic's language model family.