frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Claude Opus 4.5 vs. GPT 5.1 Codex Max for coding. Worth the upgrade?

4•terabytest•1d ago
I’m using gpt-5.1-codex-max comfortably for coding and hitting the weekly limit sometimes (but a few extra credits usually cover it).

I’ve heard Opus 4.5 might be better for coding. SWE-bench shows an 8% improvement but I'm having a hard time guessing what kind of effect that maps to in reality. For those who’ve switched, what changes have you seen, and how has it affected your work? Is the $100/month upgrade worth it?

Comments

chaidhat•1d ago
You must have immense patience to daily drive codex. To be honest, I’ve observed better code quality from codex (in terms of separation of concerns, high cohesion loose coupling, etc.) but Opus has great quality at roughly 1/3rd of the speed. Try it on Cursor maybe then decide if you want to switch. I’m curious — have you tried gemini pro 3 and do you thibk deserve the hype?
dalmo3•12h ago
I'm using Opus through Cursor, but not a heavy user so price is "the same".

Opus is so good I can actually give it a task and move my attention somewhere else. So although the model itself is much slower my general workflow is faster and less frustrating.

sourdoughness•10h ago
Using Opus 4.5 through VScode/CoPilot gives so much better results than anything else I’ve tried that I kept paying when they briefly made it 3x token rate.

I really like the interaction flows better than Gemini 3 or Codex, though I can’t quite quantify why. The amount of explanation/supporting material in Opus’s output feels just right to me.

djinnrutger•9h ago
I have been using VSCode / CoPilot with Opus 4.5 and it has been working the best of any of the system I have tried. Very happy with it so far. I never really got good results with GPT5.1. though 5.2 seems better...but not by alot, so I will stick with Opus 4.5 for now.
muzani•8h ago
I'm fine with just Copilot.

Opus 4.5 has excellent tool use, meaning it can jump in and out of a broad undocumented codebase better. It can evaluate what the code is trying to do. It's perfect for PRs - caught things like people submitting code that looks right, but ended up running a poorly documented/incomplete method.

GPT codex just messes up a lot for me. Whatever I'm doing with it, it's not working. The plain GPT-5.2 is good overall, but it confidently makes mistakes and tell you that it's done.

If you have an excellent codebase, GPT 5.2 might actually work better. If you're not sure what you're doing or are using AI to find out how things work, then Opus 4.5 is great.

The Claude models are also very much behind in terms of UI and visuals.

Take note that a lot of the benchmarks are on Python. What I'm finding is all the major ones make mistakes, but they make mistakes differently. OpenAI and Anthropic tend to mimic one another for some reason, while Grok and Gemini tend to give very different answers.

otekengineering•4h ago
I'm impressed with Opus 4.5. It's been useful working on firmware projects where earlier models were of negative value.

Here's an example of a one-shot output, the only change I made was Replace All 'battlezone'->'battleclone':

"build a clone of the classic arcade game battlezone using SVG graphics that are calculated on the fly for the required vector wireframe graphics"

https://omnispect.dev/battleclone00.html

Ask HN: Does anyone understand how Hacker News works?

25•jannesblobel•7h ago•35 comments

Ask HN: Those making $500/month on side projects in 2025 – Show and tell

163•cvbox•5h ago•107 comments

Tell HN: HN was down

535•uyzstvqs•14h ago•296 comments

Ask HN: What Are You Working On? (December 2025)

435•david927•3d ago•1421 comments

Ask HN: Etiquette giving feedback on mostly AI-generated PRs from co-workers

2•chfritz•3h ago•4 comments

How Much Energy Does One Solar Panel Produce in Australia?

4•scorpeoanlibra•3h ago•0 comments

Tell HN: AI coding is sexy, but accounting is the real low-hanging target

60•bmadduma•6d ago•54 comments

Ask HN: Should I start a software foundation (goal: help emergency services)?

6•strgcmc•7h ago•0 comments

Ask HN: Is starting a personal blog still worth it in the age of AI?

60•nazarh•3d ago•74 comments

Ask HN: Is building a calm, non-gamified learning app a mistake?

86•hussein-khalil•2d ago•122 comments

Computer animator and Amiga fanatic Dick van Dyke turns 100

279•ggm•4d ago•92 comments

Ask HN: What are your predictions for 2026?

21•mfrw•1d ago•17 comments

Ask HN: Was HN just down for anyone else?

84•rozenmd•14h ago•2 comments

Ask HN: How are you vibe coding in an established code base?

10•adam_gyroscope•1d ago•7 comments

Memory Safety in C# vs. Rust

13•northlondoner•1d ago•12 comments

Ask HN: How can I get better at using AI for programming?

466•lemonlime227•4d ago•464 comments

Ask HN: How do you know what you're working on is worth working on?

8•ideavo•2d ago•18 comments

Ask HN: Claude Opus 4.5 vs. GPT 5.1 Codex Max for coding. Worth the upgrade?

4•terabytest•1d ago•6 comments

Who has enjoyed using PR code reviewers? What worked and what didn’t?

3•yashwantphogat•1d ago•2 comments

Ask HN: Bloggers, how do you manage your content?

10•freemanjiang•2d ago•14 comments

Tell HN: HP Smart Printers

2•_RPM•18h ago•3 comments

Ask HN: Did anyone else notice that the OpenAI Labs website was completely gone?

26•underlipton•5d ago•9 comments

Ask HN: Best back end to run models on Google TPU?

8•vood•3d ago•0 comments

Ask HN: How do you learn marketing as a developer? It's so different from coding

6•Gooblebrai•22h ago•5 comments

Ask HN: Thought-Provoking Books

18•Agraillo•4d ago•18 comments

Our "enterprise" experience with Stripe after $1B+ processed (be careful)

29•Boulderchaim•5d ago•15 comments

Ask HN: How do you get comfortable with shipping code you haven't reviewed?

7•fnimick•2d ago•11 comments

Ask HN: Why are modern AIs ignorant or reluctant to talk about "vibe coding"?

2•amichail•2d ago•16 comments

Ask HN: How do I navigate horror of requirement gathering in product management?

5•souravpradhan•3d ago•5 comments

Ask HN: Any online tech spaces you hang around that don't involve AI?

12•jc_811•4d ago•10 comments