Claude Sonnet 4.6

https://www.anthropic.com/news/claude-sonnet-4-6

71•meetpateltech•1h ago

Comments

a_void_sky•1h ago

Opus 4.6 but cheaper

mudkipdev•1h ago

What happened to sonnet 5?

hxugufjfjf•40m ago

Those hours that with gentle work did frame The lovely gaze where every eye doth dwell, Will play the tyrants to the very same And that unfair which fairly doth excel:

meetpateltech•30m ago

They're probably saving 5 for a bigger leap.

rvz•59m ago

Anthropic again running scared of the open weight models which are rapidly catching up to them. Not even Sonnet or Opus isn't going to help with that at all.

It has already happened with the music gen models already. It's only a matter of time when the open weight models will overtake Anthropic.

Expect them to dial up the scaremongering until they IPO. The Claude family of models are their only AI product that is keeping them alive.

throwup238•55m ago

What are the latest open music models?

falloon•35m ago

Ace step 1.5 is great, only 1.5b params so very easy to run locally.

https://github.com/ace-step/ACE-Step-1.5

catigula•52m ago

Chinese companies distilling frontier models is certainly a crisis but it isn't one that implies said Chinese companies are anywhere in the 'race'.

bigyabai•51m ago

The "race" matters less than making money. If those Chinese models perform well in price/performance, AGI might as well pound sand.

cube2222•50m ago

So tldr it seems like it's

- a reasonable improvement over sonnet 4.5, esp. with agentic tool use

- generally worse than opus 4.6

Probably not worth it for coding, but a win for anybody building agentic ai assistants of any sort with Sonnet.

Handy-Man•41m ago

It’s similar to or better than Opus 4.5 as per benchmarks, while being 2x-3x cheaper, definitely worth it over Opus 4.6, if cost/tokens is the concern.

To remind, Opus 4.5 was SOTA 2-3 weeks ago.

adastra22•39m ago

Yes but Opus 4.6 is a massive step up. Some applications don’t need that power though.

dchuk•48m ago

curious if the 1m context window will be default available in claude code. if so, that's a pretty big deal: "Sonnet 4.6’s 1M token context window is enough to hold entire codebases, lengthy contracts, or dozens of research papers in a single request. More importantly, Sonnet 4.6 reasons effectively across all that context."

pkaye•42m ago

Above 200k token context they charge a premium. I think its $10/M tokens of input.

_ink_•33m ago

Interesting. Is it because they can or is it really more expensive for them to process bigger context?

cube2222•24m ago

Attention is, at its core, quadratic wrt context length. So I'd believe that to be the case, yeah.

pkaye•13m ago

I've read that compute costs for LLMs go up O(n^2) with context window size. But I think it is also a combination of limited compute availability, users preference for Anthropic models and Anthropic planning to go IPO.

deanc•40m ago

I really don't get these companies posting disingenuous benchmarks. Every time, they pick and choose who to compare against. Not comparing to the latest 5.3-codex is absurd when it's been out a couple of weeks now. Who are they trying to kid?

AdamConwayIE•37m ago

There aren't really any of the typical benchmark suites targeting Codex 5.3 because it's still not in the API.

SWE bench for example creates a predictions file and evaluates the results in the harness. Without Codex 5.3 being in the API, it can't.

rvz•36m ago

> Who are they trying to kid?

People who do not know how reproducible research works.

Any benchmark that is presented by AI labs must be reproduced reliably by someone else independent of that AI lab presenting these results.

Otherwise, not only it is biased, these numbers can be just made up for marketing purposes.

falloon•33m ago

If you were writing a promotional post for your new model, would you include benchmarks of a competitor that's spanking you across the board? This is marketing.

rishabhaiover•40m ago

I am not seeing it on claude-code yet

devinprater•21m ago

I'm glad I have chatGPT to turn that image with benchmarks into an accessible table lol. I like claude Code, but their accessibility in anything other than accidental CLI accessibility is frustrating. Try it. Load a screen reader like VoiceOver for Mac (cause I know most programmers use Macs) and go to claude.ai. In the "write your prompt to Claude" box, type something like "What will the weather be like tomorrow?" and press Enter/Return. Try closing your eyes for a good 30 seconds and within those 30 seconds, tell me how you'd know if a reply has been given by the model. Then try the same thing with ChatGPT. I would /love/ to be proven wrong.

Terminals should generate the 256-color palette

Aft, AAUP Demand SEC Probe over Apollo Execs' Epstein Contacts

The cultural evolution of pluralistic ignorance

Tesla's 45 Austin Robotaxis now have 14 crashes on the books since June 2025

BYD's new electric SUV delivers over 440 miles range for just $26,000

Tesla 'Robotaxi' adds 5 more crashes in Austin in a month – 4x worse than humans

Writing a native VLC plugin in C#

Thaura – AI from Syria

Ford's New 2028 Electric Truck Will Be a Fully Modern EV for $30,000

Idea Raised for Nicer DRM Panic Screen Integration on Fedora Linux

McRock – A context-driven AI music platform for creators

GhostBSD to Use XLibre Server, Mate vs. Gershwin Desktop Decision in Future

Find a Niche by Intersecting Your Strengths

Slagent – a self-learning tool for AI coding agents (Claude Code, Codex)

Giant barocaloric cooling effect offers a new route to refrigeration

Show HN: I built a new software primitive. It replaces AI screenshot agents

Lunar Triple

Scent analysis reveals the composition of ancient Egyptian embalming materials

CFTC Announces Innovation Advisory Committee Members

I can't tell if I'm experiencing or simulating experiencing

The Pepe Silvia Guide to ChatGPT Psychosis – By Lyta Gold

Show HN: Motionode – Cursor for Technical Planning

Ask HN: Best multi-lingual text-to-speech system

LDBC datasets are now served from Cloudflare

The first PICO-8 emulator on the Apple App Store

Show HN: Nos – a hobby x86-64 C++ OS kernel running on real KVM clouds

We Built a Real Company Using AI – Not Just a Website

Show HN: 7 months later – librari.io is live

Project Aura: ESP32 Air quality monitor

What Would Steve Jobs Do with Apple's AI Hand?