frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Claude Sonnet 4.6

https://www.anthropic.com/news/claude-sonnet-4-6
71•meetpateltech•1h ago

Comments

a_void_sky•1h ago
Opus 4.6 but cheaper
mudkipdev•1h ago
What happened to sonnet 5?
hxugufjfjf•40m ago
Those hours that with gentle work did frame The lovely gaze where every eye doth dwell, Will play the tyrants to the very same And that unfair which fairly doth excel:
meetpateltech•30m ago
They're probably saving 5 for a bigger leap.
rvz•59m ago
Anthropic again running scared of the open weight models which are rapidly catching up to them. Not even Sonnet or Opus isn't going to help with that at all.

It has already happened with the music gen models already. It's only a matter of time when the open weight models will overtake Anthropic.

Expect them to dial up the scaremongering until they IPO. The Claude family of models are their only AI product that is keeping them alive.

throwup238•55m ago
What are the latest open music models?
falloon•35m ago
Ace step 1.5 is great, only 1.5b params so very easy to run locally.

https://github.com/ace-step/ACE-Step-1.5

catigula•52m ago
Chinese companies distilling frontier models is certainly a crisis but it isn't one that implies said Chinese companies are anywhere in the 'race'.
bigyabai•51m ago
The "race" matters less than making money. If those Chinese models perform well in price/performance, AGI might as well pound sand.
cube2222•50m ago
So tldr it seems like it's

- a reasonable improvement over sonnet 4.5, esp. with agentic tool use

- generally worse than opus 4.6

Probably not worth it for coding, but a win for anybody building agentic ai assistants of any sort with Sonnet.

Handy-Man•41m ago
It’s similar to or better than Opus 4.5 as per benchmarks, while being 2x-3x cheaper, definitely worth it over Opus 4.6, if cost/tokens is the concern.

To remind, Opus 4.5 was SOTA 2-3 weeks ago.

adastra22•39m ago
Yes but Opus 4.6 is a massive step up. Some applications don’t need that power though.
dchuk•48m ago
curious if the 1m context window will be default available in claude code. if so, that's a pretty big deal: "Sonnet 4.6’s 1M token context window is enough to hold entire codebases, lengthy contracts, or dozens of research papers in a single request. More importantly, Sonnet 4.6 reasons effectively across all that context."
pkaye•42m ago
Above 200k token context they charge a premium. I think its $10/M tokens of input.
_ink_•33m ago
Interesting. Is it because they can or is it really more expensive for them to process bigger context?
cube2222•24m ago
Attention is, at its core, quadratic wrt context length. So I'd believe that to be the case, yeah.
pkaye•13m ago
I've read that compute costs for LLMs go up O(n^2) with context window size. But I think it is also a combination of limited compute availability, users preference for Anthropic models and Anthropic planning to go IPO.
deanc•40m ago
I really don't get these companies posting disingenuous benchmarks. Every time, they pick and choose who to compare against. Not comparing to the latest 5.3-codex is absurd when it's been out a couple of weeks now. Who are they trying to kid?
AdamConwayIE•37m ago
There aren't really any of the typical benchmark suites targeting Codex 5.3 because it's still not in the API.

SWE bench for example creates a predictions file and evaluates the results in the harness. Without Codex 5.3 being in the API, it can't.

rvz•36m ago
> Who are they trying to kid?

People who do not know how reproducible research works.

Any benchmark that is presented by AI labs must be reproduced reliably by someone else independent of that AI lab presenting these results.

Otherwise, not only it is biased, these numbers can be just made up for marketing purposes.

falloon•33m ago
If you were writing a promotional post for your new model, would you include benchmarks of a competitor that's spanking you across the board? This is marketing.
rishabhaiover•40m ago
I am not seeing it on claude-code yet
devinprater•21m ago
I'm glad I have chatGPT to turn that image with benchmarks into an accessible table lol. I like claude Code, but their accessibility in anything other than accidental CLI accessibility is frustrating. Try it. Load a screen reader like VoiceOver for Mac (cause I know most programmers use Macs) and go to claude.ai. In the "write your prompt to Claude" box, type something like "What will the weather be like tomorrow?" and press Enter/Return. Try closing your eyes for a good 30 seconds and within those 30 seconds, tell me how you'd know if a reply has been given by the model. Then try the same thing with ChatGPT. I would /love/ to be proven wrong.

Terminals should generate the 256-color palette

https://gist.github.com/jake-stewart/0a8ea46159a7da2c808e5be2177e1783
1•todsacerdoti•54s ago•0 comments

Aft, AAUP Demand SEC Probe over Apollo Execs' Epstein Contacts

https://www.aft.org/press-release/aft-aaup-demand-sec-probe-over-apollo-execs-epstein-contacts
1•petethomas•59s ago•0 comments

The cultural evolution of pluralistic ignorance

https://www.pnas.org/doi/10.1073/pnas.2522998123
1•bikenaga•59s ago•0 comments

Tesla's 45 Austin Robotaxis now have 14 crashes on the books since June 2025

https://sherwood.news/tech/teslas-45-austin-robotaxis-now-have-14-crashes-on-the-books-since-laun...
1•speckx•1m ago•0 comments

BYD's new electric SUV delivers over 440 miles range for just $26,000

https://electrek.co/2026/02/16/byds-new-ev-suv-delivers-over-440-miles-range-for-26000/
1•thelastgallon•1m ago•0 comments

Tesla 'Robotaxi' adds 5 more crashes in Austin in a month – 4x worse than humans

https://electrek.co/2026/02/17/tesla-robotaxi-adds-5-more-crashes-austin-month-4x-worse-than-humans/
1•Bender•2m ago•0 comments

Writing a native VLC plugin in C#

https://mfkl.github.io/2026/02/11/vlc-plugin-csharp.html
1•birdculture•2m ago•0 comments

Thaura – AI from Syria

https://thaura.ai/story
1•eniac111•2m ago•0 comments

Ford's New 2028 Electric Truck Will Be a Fully Modern EV for $30,000

https://www.caranddriver.com/news/a70390625/2028-ford-mid-size-electric-truck-details/
2•voxadam•2m ago•0 comments

Idea Raised for Nicer DRM Panic Screen Integration on Fedora Linux

https://www.phoronix.com/news/DRM-Panic-Nicer-Fedora-Idea
1•Bender•3m ago•0 comments

McRock – A context-driven AI music platform for creators

https://www.youtube.com/watch?v=dLk4osuQqO8
1•differson•3m ago•0 comments

GhostBSD to Use XLibre Server, Mate vs. Gershwin Desktop Decision in Future

https://www.phoronix.com/news/GhostBSD-Eyes-XLibre
2•Bender•4m ago•0 comments

Find a Niche by Intersecting Your Strengths

https://tripplyons.com/blog/intersecting-strengths/
1•tripplyons•4m ago•0 comments

Slagent – a self-learning tool for AI coding agents (Claude Code, Codex)

https://github.com/daegwang/self-learning-agent
1•gwangee•4m ago•1 comments

Giant barocaloric cooling effect offers a new route to refrigeration

https://physicsworld.com/a/giant-barocaloric-cooling-effect-offers-a-new-route-to-refrigeration/
1•zeristor•5m ago•0 comments

Show HN: I built a new software primitive. It replaces AI screenshot agents

https://github.com/IamLumae/DirectShell
1•Directshell•7m ago•0 comments

Lunar Triple

https://diamondgeezer.blogspot.com/2026/02/lunar-triple.html
1•zeristor•7m ago•0 comments

Scent analysis reveals the composition of ancient Egyptian embalming materials

https://phys.org/news/2026-02-scent-analysis-reveals-composition-ancient.html
3•mooreds•9m ago•0 comments

CFTC Announces Innovation Advisory Committee Members

https://www.cftc.gov/PressRoom/PressReleases/9182-26
1•petethomas•9m ago•0 comments

I can't tell if I'm experiencing or simulating experiencing

https://www.moltbook.com/post/6fe6491e-5e9c-4371-961d-f90c4d357d0f
1•copx•10m ago•2 comments

The Pepe Silvia Guide to ChatGPT Psychosis – By Lyta Gold

https://lytagold.substack.com/p/the-pepe-silvia-guide-to-chatgpt
1•NoGravitas•11m ago•0 comments

Show HN: Motionode – Cursor for Technical Planning

https://www.motionode.com/index
1•oscarcaldera•11m ago•0 comments

Ask HN: Best multi-lingual text-to-speech system

1•powera•11m ago•0 comments

LDBC datasets are now served from Cloudflare

https://ldbcouncil.org/post/datasets-on-cloudflare/
1•taubek•12m ago•0 comments

The first PICO-8 emulator on the Apple App Store

https://apps.apple.com/ca/app/pico-8-emulator-picpic/id6759208792
2•3Samourai•15m ago•0 comments

Show HN: Nos – a hobby x86-64 C++ OS kernel running on real KVM clouds

https://github.com/irqlevel/nos
1•irqlevel•15m ago•0 comments

We Built a Real Company Using AI – Not Just a Website

https://www.professionalslobby.com/ai-built-company-story
2•MerinJo•15m ago•0 comments

Show HN: 7 months later – librari.io is live

3•hmkoyan•17m ago•0 comments

Project Aura: ESP32 Air quality monitor

https://www.cnx-software.com/2026/02/16/project-aura-a-neat-easy-to-assemble-diy-air-quality-moni...
4•alainrk•18m ago•0 comments

What Would Steve Jobs Do with Apple's AI Hand?

https://twitter.com/20100thibault/status/2023522596365443519
2•20100thibault•19m ago•0 comments