Opus 4.7 to 4.8

What actually changed in Anthropic's flagship model, what the new effort control does, and where Max effort earns its tokens. Released 28 May 2026, same price as 4.7.

Shipped 28 May 2026 claude-opus-4-8 1M token context
The one-line read
"A modest but tangible improvement on its predecessor."

Anthropic's own framing. Translation for us: not a re-platform, not a reason to change how we work, but a genuine lift in coding accuracy, maths reasoning, honesty, and how independently it can run a long task. The bigger story is the new controls shipped alongside it.

Where each one sits

4.7 retired / 4.8 live
Predecessor
Opus 4.7
Released mid-April 2026. A tepid reception.
  • Solid flagship, but early users found its adaptive thinking over-eager on simple tasks
  • No manual effort control, the model decided how long to think
  • Strong coding, weaker honesty calibration
  • Same 1M context, same rate card
Current flagship
Opus 4.8
Released 28 May 2026, just 41 days later.
  • Sharper judgement and more independent on long agentic tasks
  • Manual effort control from Low to Max, you decide the spend
  • Around 4x less likely than 4.7 to let its own code flaws pass unremarked
  • Alignment near Mythos Preview level, lower deception rates than 4.7

The numbers that moved

Anthropic benchmarks / 4.7 to 4.8
Agentic codingSWE-Bench Pro
64.3%
69.2%
Beats GPT-5.5 (58.6%) and Gemini 3.1 Pro
Maths reasoningUSAMO 2026
69.3%
96.7%
The biggest single-cycle jump in the Opus line
Multidisciplinary reasoningWith tools
54.7%
57.9%
Steady gain across knowledge work
Code-flaw honestySelf-checking
Baseline
~4x
Less likely to let its own flaws pass unremarked

Honest caveat. These are Anthropic's own tests. 4.8 leads most public benchmarks against GPT-5.5 and Gemini 3.1 Pro, but GPT-5.5 still leads on terminal and CLI coding, and the two are roughly tied on web browsing and graduate-level science. Opus 4.8 sits between 4.7 and the unreleased Mythos Preview on Anthropic's internal capability ladder.

Effort control, and where Max fits

New on claude.ai and Cowork

There is now a control beside the model selector that lets me dial how hard Claude works on a single response. Higher effort means deeper thinking, more tokens, slower output, and better results. Lower effort means faster answers that are lighter on rate limits. This is the direct fix for 4.7's habit of overthinking simple tasks.

Lightest
Low

Fastest, cheapest on your limit. Quick lookups, reformatting, simple drafts, straightforward questions.

Balanced
Medium

Everyday drafting, summaries, moderate analysis. The sensible middle for routine work.

Recommended
High
Default

Anthropic's default. Roughly matches 4.7's token spend on coding while performing better. Most substantive work sits here.

Deepest
Max
Peak reasoning

Slowest, most tokens, deepest reasoning. Hard multi-step problems, high-stakes accuracy, long autonomous runs.

Adaptive thinking

A separate toggle that hands the decision back to the model, it judges how long to think per task. Useful when the workload is mixed and you do not want to keep changing the dial.

Worth confirming in your own selector. Labels differ slightly by surface. In Claude Code the high tier above the default is called "extra" or "xhigh", with "max" above that, and Anthropic has raised Claude Code rate limits to absorb the heavier token use. Early reporting also disagrees on the exact consumer default, so check what your model selector dropdown actually shows on claude.ai before you assume a setting.

Shipped on the same day

The capabilities, not just the model
01

Dynamic Workflows

In Claude Code, Claude plans the work then spawns hundreds of parallel subagents in one session. Anthropic's example is a codebase-scale migration across hundreds of thousands of lines, from kickoff to merge, with the existing test suite as the bar.

Research preview / Enterprise, Team, Max
02

Effort control

The Low to Max dial covered above. The single most useful day-to-day change for how we work in claude.ai and Cowork, because it puts cost and depth in our hands rather than the model's.

claude.ai and Cowork
03

Cheaper fast mode

Fast mode runs Opus 4.8 at about 2.5x the speed and is reportedly three times cheaper than it was on previous models. Standard pricing is held flat at 4.7 levels.

2.5x speed / 3x cheaper
04

Mid-task system entries in the API

The Messages API now accepts system entries inside the messages array. For our Make and MCP builds this means updating Claude's instructions mid-task without breaking the prompt cache or routing the change through a fake user turn. A quiet but genuinely useful change for automation.

Developer / API
05

Mythos on the horizon

Anthropic teased a more capable Mythos-class model for all customers in the coming weeks, pending safeguards. Worth tracking, but nothing to plan around yet. 4.8 is the working tool for now.

Teaser / not yet released

Sources. Anthropic launch announcement and system card (28 May 2026), as reported across Axios, Gizmodo, TechCrunch, VentureBeat, MacRumors, 9to5Mac, The Decoder, and Thurrott. Benchmark figures are Anthropic's own. Effort-level labels and the consumer default vary by surface and across early reporting, confirm in your model selector.