Opus 4.7 to 4.8

What actually changed in Anthropic's flagship model, what the new effort control does, and where Max effort earns its tokens. Released 28 May 2026, same price as 4.7.

Shipped 28 May 2026 claude-opus-4-8 1M token context

The one-line read

"A modest but tangible improvement on its predecessor."

Anthropic's own framing. Translation for us: not a re-platform, not a reason to change how we work, but a genuine lift in coding accuracy, maths reasoning, honesty, and how independently it can run a long task. The bigger story is the new controls shipped alongside it.

Where each one sits

4.7 retired / 4.8 live

Predecessor

Opus 4.7

Released mid-April 2026. A tepid reception.

Solid flagship, but early users found its adaptive thinking over-eager on simple tasks
No manual effort control, the model decided how long to think
Strong coding, weaker honesty calibration
Same 1M context, same rate card

Current flagship

Opus 4.8

Released 28 May 2026, just 41 days later.

Sharper judgement and more independent on long agentic tasks
Manual effort control from Low to Max, you decide the spend
Around 4x less likely than 4.7 to let its own code flaws pass unremarked
Alignment near Mythos Preview level, lower deception rates than 4.7

The numbers that moved

Anthropic benchmarks / 4.7 to 4.8

Agentic codingSWE-Bench Pro

64.3%

→

69.2%

Beats GPT-5.5 (58.6%) and Gemini 3.1 Pro

Maths reasoningUSAMO 2026

69.3%

→

96.7%

The biggest single-cycle jump in the Opus line

Multidisciplinary reasoningWith tools

54.7%

→

57.9%

Steady gain across knowledge work

Code-flaw honestySelf-checking

Baseline

→

~4x

Less likely to let its own flaws pass unremarked

Honest caveat. These are Anthropic's own tests. 4.8 leads most public benchmarks against GPT-5.5 and Gemini 3.1 Pro, but GPT-5.5 still leads on terminal and CLI coding, and the two are roughly tied on web browsing and graduate-level science. Opus 4.8 sits between 4.7 and the unreleased Mythos Preview on Anthropic's internal capability ladder.

Effort control, and where Max fits

New on claude.ai and Cowork

There is now a control beside the model selector that lets me dial how hard Claude works on a single response. Higher effort means deeper thinking, more tokens, slower output, and better results. Lower effort means faster answers that are lighter on rate limits. This is the direct fix for 4.7's habit of overthinking simple tasks.

Lightest

Low

Fastest, cheapest on your limit. Quick lookups, reformatting, simple drafts, straightforward questions.

Balanced

Medium

Everyday drafting, summaries, moderate analysis. The sensible middle for routine work.

Recommended

High

Default

Anthropic's default. Roughly matches 4.7's token spend on coding while performing better. Most substantive work sits here.

Deepest

Max

Peak reasoning

Slowest, most tokens, deepest reasoning. Hard multi-step problems, high-stakes accuracy, long autonomous runs.

Adaptive thinking

A separate toggle that hands the decision back to the model, it judges how long to think per task. Useful when the workload is mixed and you do not want to keep changing the dial.

Worth confirming in your own selector. Labels differ slightly by surface. In Claude Code the high tier above the default is called "extra" or "xhigh", with "max" above that, and Anthropic has raised Claude Code rate limits to absorb the heavier token use. Early reporting also disagrees on the exact consumer default, so check what your model selector dropdown actually shows on claude.ai before you assume a setting.

Shipped on the same day

The capabilities, not just the model

01

Dynamic Workflows

In Claude Code, Claude plans the work then spawns hundreds of parallel subagents in one session. Anthropic's example is a codebase-scale migration across hundreds of thousands of lines, from kickoff to merge, with the existing test suite as the bar.

Research preview / Enterprise, Team, Max

02

Effort control

The Low to Max dial covered above. The single most useful day-to-day change for how we work in claude.ai and Cowork, because it puts cost and depth in our hands rather than the model's.

claude.ai and Cowork

03

Cheaper fast mode

Fast mode runs Opus 4.8 at about 2.5x the speed and is reportedly three times cheaper than it was on previous models. Standard pricing is held flat at 4.7 levels.

2.5x speed / 3x cheaper

04

Mid-task system entries in the API

The Messages API now accepts system entries inside the messages array. For our Make and MCP builds this means updating Claude's instructions mid-task without breaking the prompt cache or routing the change through a fake user turn. A quiet but genuinely useful change for automation.

Developer / API

05

Mythos on the horizon

Anthropic teased a more capable Mythos-class model for all customers in the coming weeks, pending safeguards. Worth tracking, but nothing to plan around yet. 4.8 is the working tool for now.

Teaser / not yet released

Sources. Anthropic launch announcement and system card (28 May 2026), as reported across Axios, Gizmodo, TechCrunch, VentureBeat, MacRumors, 9to5Mac, The Decoder, and Thurrott. Benchmark figures are Anthropic's own. Effort-level labels and the consumer default vary by surface and across early reporting, confirm in your model selector.