frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Can you tell the difference between Claude Sonnet and Opus?

4•muddi900•4h ago
Hello

I have been using Claude code for the past 6 months. In that time, multiple revisions of each model have come out. I have seen some improvement, especially in regards to sycophancy, with recent iterations.

However, I can't differentiate the outputs of either. To me, sonnet seems just as capable as opus.

Have any of y'all run real life tests? Mine seem to be too random to say either way.

Comments

nawi•4h ago
You are not missing anything. For 95% of dev work, sonnet, especially 3.5 and 3.7 has basically win opus, value per price. in my experience the difference boils down to this 1. Sonnet is the faster. It's concise, follows instructions literally, and is significantly better at agentic tasks. 2. Opus is the philosopher. It’s better at high level architecture, creative writing, or spotting subtle nuances in a 50 pages document. the reason your tests feel random is that for standard coding, sonnet is actually the superior model now. it is faster, less prone to over engineering, and has much lower latency. if you have a massive, messy refactor where you need the model to reason through 10 files without adding bugs, opus might still have a slight edge in coherence. for everythng else, Sonnet is the meta. Stick with it and save the credits.
sminchev•2h ago
Yes. When things get too complex Sonnet misses some things. For example, it creates all the components, but does not link them. Or it does not go deep enough in the code and misses certain usages and possible regressions. In other words, it does not, pro-actively, search for things that I have forgotten to tell the model about.
eddyzh•1h ago
Exactly this.

This may be worth the discount. Or not if your time and attention is worth (quite) a lot.

eddyzh•2h ago
At work I use opus max Fast It hardy ever fails for no reason even if I forget to give it all the right context. At home i run sonnet, and it does not get what I meant or expected 20-35% of the time. Due to the enormous difference in cost, depending on the value of your time (hourly rate) that might be a nett benefit.

Sonnet being faster alone would not be worth the failure rate for me.

At home i just not want to pay more than 20 bucks for incidental projects.

And opus max would just consume my tokens in one round.

aykutseker•38m ago
in short tasks they look identical and most people can't tell. opus shows its edge in long agent loops and 50k+ context, when sonnet starts dropping tool calls or rerunning steps. sonnet's fine for short stuff and the price is better. on longer agentic flows opus actually earns the cost in my experience.

The "just build it with Claude" paradox

3•ethantheswe•18m ago•1 comments

Tell HN: An app is silently installing itself on my iPhone every day

554•_-x-_•1d ago•184 comments

Ask HN: What does your agentic software dark factory look like?

4•ElFitz•3h ago•1 comments

Ask HN: Is there a good CV review service for tech roles in Switzerland?

3•swissdom•3h ago•0 comments

Ask HN: Can you tell the difference between Claude Sonnet and Opus?

4•muddi900•4h ago•5 comments

Ask HN: RedHat for Personal Use

4•bozdemir•5h ago•4 comments

Tell HN: Claude 4.7 is ignoring stop hooks

95•LatencyKills•2d ago•86 comments

Ask HN: Are you concerned by TLS-terminating proxies like Cloudflare Tunnels?

4•thom-gtdp•6h ago•4 comments

Ask HN: Is Ubuntu 26.04 LTS Consider GNU/Linux?

3•xlmnxp•6h ago•2 comments

Ask HN: Anyone want to collaborate on a local-first AI-based research assistant

4•venkatram-s•17h ago•5 comments

Ask HN: Do you read differently now that anything could be AI generated?

18•dwa3592•1d ago•25 comments

Ask HN: How I find a job where what is needed is solid code, not firefighting?

19•speeder•23h ago•9 comments

Tell HN: Medvi (telehealth) hardcodes 999 patient emails in public JavaScript

14•g48ywsJk6w48•1d ago•16 comments

Ask HN: Is anyone working on Gov Digital IDs or have implementation docs / FOSS

7•lifeisstillgood•1d ago•2 comments

Ask HN: How did the industry settle on weekly limits?

11•saratogacx•1d ago•10 comments

Batteries Included CLI Framework

9•maxalbarello•1d ago•7 comments

Ask HN: Which is Better–Android or iOS?

10•wasimsk•20h ago•9 comments

Ask HN: How do solo devs protect their work in the age of vibe coding?

33•langs•3d ago•16 comments

Ask HN: Anyone managed to get Google trends API?

13•visox•2d ago•1 comments

Ask HN: What file sharing apps do you guys use?

9•samarthv•1d ago•11 comments

Ask HN: Is Zuckerberg just a „one-hit-wonder"?

22•fandorin•2d ago•28 comments

Ask HN: Oh, What Places to Go (Seriously Tho)

11•thx•1d ago•7 comments

Ask HN: MicroVM setup for VS Code Dev Container-like experience?

10•Erndob•2d ago•2 comments

Tell HN: Anthropic won't reset usage limits for those who downgraded

17•vintagedave•2d ago•0 comments

Tell HN: YouTube RSS feeds no longer work

48•019•4d ago•14 comments

Ask HN: Scaling a targeted web crawler beyond 500M pages/day

27•honungsburk•3d ago•10 comments

Ask HN: Do you waste AI assisted time looking for answers?

8•Haeuserschlucht•1d ago•2 comments

GPT-5.5 – No ARC-AGI-3 scores

25•AG25•3d ago•3 comments

Ask HN: Cyberdecks are cool but do they serve a purpose?

10•hamiecod•1d ago•2 comments

Anthropic bans orgs without warning

46•alpinisme•5d ago•20 comments