sakutto
Generative AI· Claude Sonnet 5

What Is Claude Sonnet 5? Near-Opus Performance and Pricing Explained

ClaudeAnthropicClaude Sonnet 5
What Is Claude Sonnet 5? Near-Opus Performance and Pricing Explained

What Is Claude Sonnet 5?

Where Claude Sonnet 5 fits

Position
The latest model in the Sonnet series, described as the "most agentic Sonnet"
Performance
Achieves a level close to the flagship Opus 4.8 at a lower price
Length it can handle
1 million tokens at once on the API and major cloud platforms
Available plans
Default model for Free and Pro; also available on Max, Team, and Enterprise

Claude Sonnet 5 is the latest model in the Sonnet series of Claude, released by Anthropic on June 30, 2026. Sonnet was originally positioned as a mid-tier model with a good balance of performance and price. This Sonnet 5, however, goes beyond that frame: its biggest feature is offering performance that approaches the flagship Opus 4.8 while keeping the price down. Let's look at what it is and its features in order.

The Most Agentic Sonnet

Claude Sonnet 5 is a model with enhanced ability as an "agent" that plans on its own and works autonomously using tools such as browsers and terminals. By agent here, we mean a use case where it takes an instruction just once, decides the intermediate steps on its own, and sees the task through to the end. Anthropic describes Sonnet 5 as the most agentic Sonnet yet.

View official source →
Claude Sonnet 5 is built to be the most agentic Sonnet model yet. / It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models. — From the Claude Sonnet 5 announcement page (introduction)

Until recently, this kind of autonomous work required much larger and more expensive models. The significance of Sonnet 5 is that it delivers work at that level in a mid-tier price range.

Near-Opus 4.8 Performance at a Low Price

Another feature of Sonnet 5 is how close it is to the flagship model. Anthropic explains that Sonnet 5's performance has narrowed to a level close to Opus 4.8, and yet its price is lower.

View official source →
Sonnet 5 narrows the gap: its performance is close to that of Opus 4.8, but at lower prices. — From the Claude Sonnet 5 announcement page (introduction)

In other words, the key point this time is that much of the work that previously required a flagship model can now be handled by the more affordable Sonnet 5. We'll confirm the specific differences with numbers in the benchmarks section that follows.

1M-Token Support

Sonnet 5 can also handle a large amount of text at once, supporting 1 million tokens on the API and major cloud platforms. A token is a small unit that text is broken into, and 1 million tokens is roughly equivalent to one or two books.

View official source →
Claude Opus 4.8, Claude Opus 4.7, Claude Opus 4.6, Claude Sonnet 5, and Claude Sonnet 4.6 have a 1M-token context window on the Claude API, Amazon Bedrock, Google Cloud, and Microsoft Foundry. / Claude Sonnet 5, Claude Sonnet 4.6, Claude Sonnet 4.5, and Claude Haiku 4.5 have context awareness: these models track their remaining context window (their "token budget") throughout a conversation. — From Context windows, "Context window sizes by model" and "Context awareness"

It is also equipped with a mechanism to keep track of its remaining capacity (its remaining tokens) as it proceeds through a long exchange, making it well suited to loading lengthy materials all at once or being entrusted with long-running work.

Sonnet 5's Performance and Benchmarks

Comparison of key benchmarks (unit: %; higher is better)

SWE-bench Pro (agentic coding)

Sonnet 563.2 Sonnet 4.658.1 Opus 4.869.2

Terminal-Bench 2.1 (command-line work)

Sonnet 580.4 Sonnet 4.667.0 Opus 4.882.7

OSWorld-Verified (PC operation)

Sonnet 581.2 Sonnet 4.678.5 Opus 4.883.4

Anthropic presents benchmark results across multiple areas, including coding, reasoning, and PC operation. In each benchmark, the bars are listed from top to bottom in the order Sonnet 5, the previous-generation Sonnet 4.6, and the flagship Opus 4.8 (reference value). In every case you can see that it has clearly improved over Sonnet 4.6 and has come very close to Opus 4.8.

Agentic Coding Performance

Sonnet 5 has greatly improved its scores from the previous generation on benchmarks that simulate real software development. On SWE-bench Pro, which evaluates difficult code fixes, it scores 63.2%, beating Sonnet 4.6's 58.1%. On Terminal-Bench 2.1, which measures command-line work that requires planning, iteration, and tool integration, it scores 80.4%, a big step forward from Sonnet 4.6's 67.0%.

View official source →
It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models. — From the Claude Sonnet 5 announcement page (each score is a figure from the official benchmark table on the same page 〈the three columns Sonnet 5/Sonnet 4.6/Opus 4.8〉)

Scores in Reasoning, PC Operation, and Knowledge Work

It has also advanced beyond coding. On Humanity's Last Exam, which gathers difficult cross-domain questions; OSWorld-Verified, which operates a PC screen; and GDPval-AA v2, which measures practical knowledge work, it surpassed Sonnet 4.6. The table below summarizes the main scores presented by Anthropic.

BenchmarkClaude Sonnet 5Sonnet 4.6Opus 4.8 (reference)
SWE-bench Pro (coding)63.2%58.1%69.2%
Terminal-Bench 2.1 (command-line)80.4%67.0%82.7%
Humanity's Last Exam (no tools)43.2%34.6%49.8%
Humanity's Last Exam (with tools)57.4%46.8%57.9%
OSWorld-Verified (PC operation)81.2%78.5%83.4%
GDPval-AA v2 (knowledge work, score)161813951615

GDPval-AA v2, which measures knowledge work, is shown as a score rather than a percentage, and Sonnet 5's 1618 slightly exceeds Opus 4.8's 1615. Depending on the area, there are now cases where Sonnet 5 stands shoulder to shoulder with, or even overtakes, the flagship model.

Improvement from Sonnet 4.6 and the Gap with Opus

Lining up the numbers makes the trend clear. Compared with the previous-generation Sonnet 4.6, Sonnet 5 has improved on every item by anywhere from a few points to more than ten points. On the other hand, the gap with the flagship Opus 4.8 stays within a few points on many items. Sonnet 5's real strength is that it has been reliably raised up from the old Sonnet and has come to just short of the flagship. How you view this gap becomes the basis for choosing a model in the second half.

Sonnet 5's Pricing and Available Plans

Availability of Claude Sonnet 5

Chat
Default model for Free and Pro; also available on Max, Team, and Enterprise
For developers
Claude Code, Claude API (model ID: claude-sonnet-5)
Cloud platforms
Amazon Bedrock, Google Cloud, Microsoft Foundry
API introductory price
$2 input / $10 output per million tokens (through August 31, 2026)

Pricing is easier to grasp if you think of it in two ways of using it: chat under a monthly plan, and the API where you pay for what you use. Let's look at each in turn.

What You Can Use in Chat, and the Default Model

First, chat, which most people use. Claude Sonnet 5 has become the default model on the free plan (Free) and the paid Pro plan. That means you can converse with Sonnet 5 as standard, with no special setup. It is also available on the Max, Team, and Enterprise plans, and within a monthly plan, there is no per-model additional charge for using chat.

View official source →
Claude Sonnet 5 is available across all plans: it is the default model for Free and Pro plans, and is available to Max, Team, and Enterprise users. — From the Claude Sonnet 5 announcement page (Availability and pricing)

Now that Sonnet 5 has become the standard for Free and Pro, many users are naturally using a higher-performing model than before. For how to use Claude overall and how to think about plans, see also the explainer article What Is Claude? Pricing and How to Choose a Model.

API Pricing and the Introductory Price

The API, which you build into your own app or program, is usage-based according to the amount used (tokens). Sonnet 5 has an introductory price set: through August 31, 2026, it is $2 per million input tokens and $10 per million output tokens, and after that it becomes $3 for input and $15 for output.

View official source →
It's also available in Claude Code and on the Claude Platform, where it launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which it will be priced at $3 per million input tokens and $15 per million output tokens. — From the Claude Sonnet 5 announcement page (Availability and pricing)
PeriodInput (per million tokens)Output (per million tokens)
Introductory price (through August 31, 2026)$2$10
Standard price (from September 1, 2026)$3$15

For the time being, it's an introductory-price period during which you can use it at a relative bargain for its performance. For how to compare pricing with the Opus series, we lay it out in detail in the explainer article Claude Opus 4.8 / 4.7 / 4.6 Pricing Comparison.

Availability in Claude Code and the API

For developers, it is available from Claude Code, the AI coding environment, and the Claude API. The model ID (the model's identifier) to specify in the API is claude-sonnet-5. In addition, it can be used from major cloud platforms such as Amazon Bedrock, Google Cloud, and Microsoft Foundry.

View official source →
Developers can use `claude-sonnet-5` via the Claude API. — From the Claude Sonnet 5 announcement page

How to Choose Between Sonnet 5 and Opus

Choosing between Sonnet 5 and Opus 4.8

Claude Sonnet 5
Broadly, from everyday consultation and document creation to coding and processing long materials. Suited to high-volume work and cost-conscious situations
Claude Opus 4.8
The most difficult analysis and development, where the last few percent of accuracy decides the outcome. Suited to situations where quality is the top priority

With the arrival of Sonnet 5, choosing between it and the flagship Opus 4.8 has become clearer. The benchmark gap is small and Sonnet 5 is cheaper, so the axis of judgment narrows down to "how much accuracy do you need."

When Sonnet 5 Is a Good Fit

Most everyday work can be handled just fine with Sonnet 5. It broadly suits uses where the balance of performance and cost is effective, such as writing and summarizing text, research, programming assistance, and loading long materials. In particular, for work that runs many similar processes or situations where you want to keep costs down, Sonnet 5 — which offers near-Opus 4.8 performance cheaply — is the first choice.

When You Should Choose Opus 4.8

On the other hand, for work where the last slight bit of accuracy greatly affects the outcome, Opus 4.8 still has the edge. For example, on SWE-bench Pro, Opus 4.8 scores 69.2%, beating Sonnet 5's 63.2%. For the most difficult code fixes, or analysis where you want to prioritize quality above all, Opus 4.8 is worth choosing. The features of Opus 4.8 itself are explained in the article Claude Opus 4.8 Release Explained.

The Bottom Line on Choosing a Model When Unsure

To sum up, the sensible choice, both in terms of performance and price, is to first try the standard Sonnet 5 and switch to Opus 4.8 only for the tough spots where accuracy is simply not enough. Because the Free and Pro default has become Sonnet 5, most users can enjoy this benefit without changing any settings in particular. If you want to compare with other companies' latest models, see also the article GPT-5.6 Sol, Terra, and Luna Explained.

Claude's official announcements are often long texts in English, and there are times when you want to quickly grasp the key points. If you tidy up a web page into markdown format as-is, the structure of headings and tables is preserved, and it also tends to improve the accuracy when you have an AI read and organize it.

Free ToolURL to Markdown ConverterConvert any public web page URL to Markdown. Preserves headings, tables, lists, and links — perfect for LLM and RAG preprocessing, research notes, and archiving web articles.Try it now →

FAQ

Q. Is Claude Sonnet 5 free to use?
Yes. Claude Sonnet 5 is the default model on the free plan (Free) and the paid Pro plan, and it is also available on the Max, Team, and Enterprise plans. An easy way to approach it is to start within the free tier and move up to a higher plan once you need more usage or accuracy.
Anthropic 公式ニュース — Claude Sonnet 5(Availability and pricing)
Claude Sonnet 5 is available across all plans: it is the default model for Free and Pro plans, and is available to Max, Team, and Enterprise users. Anthropic 公式ニュース — Claude Sonnet 5(Availability and pricing)
Q. Which is more powerful, Claude Sonnet 5 or Opus 4.8?
The highest benchmark scores still belong to the flagship Opus 4.8, but the gap has narrowed. Anthropic describes Sonnet 5's performance as close to the level of Opus 4.8, with the advantage of being cheaper to use. A practical approach is to rely on Sonnet 5 for everyday work and choose Opus 4.8 only for the toughest tasks where the last few percent of accuracy matters.
Anthropic 公式ニュース — Claude Sonnet 5
Sonnet 5 narrows the gap: its performance is close to that of Opus 4.8, but at lower prices. Anthropic 公式ニュース — Claude Sonnet 5
Q. How much does Claude Sonnet 5 cost?
If you use it in chat under a monthly plan, it is included within your plan at no additional charge. If you use it via the API, pricing is usage-based: as an introductory price, $2 per million input tokens and $10 per million output tokens applies through August 31, 2026, after which it becomes $3 for input and $15 for output.
Anthropic 公式ニュース — Claude Sonnet 5(Availability and pricing)
It's also available in Claude Code and on the Claude Platform, where it launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which it will be priced at $3 per million input tokens and $15 per million output tokens. Anthropic 公式ニュース — Claude Sonnet 5(Availability and pricing)
Q. How long a document can Claude Sonnet 5 handle?
On the API and major cloud platforms, it can handle up to 1 million tokens at once. A token is a small unit that text is broken into, and 1 million tokens is roughly equivalent to one or two books. This is a specification that shines when you want to load lengthy materials all at once.
Anthropic 公式ドキュメント — Context windows
Claude Opus 4.8, Claude Opus 4.7, Claude Opus 4.6, Claude Sonnet 5, and Claude Sonnet 4.6 have a 1M-token context window on the Claude API, Amazon Bedrock, Google Cloud, and Microsoft Foundry. Anthropic 公式ドキュメント — Context windows

Related Tools

Related Tool Categories

Articles