frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Is Codex with GPT 5.5 Extra High being dumbed down?

5•setnone•6h ago
Hi HN, just want to rant and see if anybody can relate.

The product is not the same as i signed up for few month ago and the same shift i've experienced with Claude Code on Opus 4.6-4.7

The best way to describe the difference is you hire a reliable 'intelligent' tech lead who 'gives a shit' and in some time you eventually get over-confident junior dev that acts as destructive token burner.

No thinking during the process, not even really following instructions.

It's pretty much a binary shift that happens and in my experience can't be cured with prompting.

There is no way to actually tell what's happening under the hood with models but the difference is noticeable from the first reply.

I'm on max, GPT 5.5 extra high, always mindful of context window, use plan mode etc.

Is it the model: dumbing down or quietly route to another model for some reason? Is it the harness? or is it my imagination?

Curious what's your recent experience.

PS: frontier models should add "give-a-shit: max" along with 'thinking: max', 'effort: max' and make them actually work.

Comments

pranshuchittora•4h ago
Yes, I feel so. I started happening from june first weekish. I have shifted to claude code for planning things and codex for execution (as it is faster, though dumb)
montfort•3h ago
What I've noticed for the past couple of days is a spike in resource consumption for trivial tasks like examining a commit or listing a directory; it goes around in circles with absurd actions, it doesn't need to diff the files in a directory to search for a single file.

Tell HN: Installing Cursor on iOS irreversibly changes your privacy settings

168•zkldi•3h ago•25 comments

Ask HN: Secure wrapper for coding agents?

15•rjzzleep•7h ago•8 comments

Ask HN: Should I buy mac studio M4 max or macbook M5 pro?

3•akarshhegde18•4h ago•4 comments

Ask HN: Is Codex with GPT 5.5 Extra High being dumbed down?

5•setnone•6h ago•2 comments

Ask HN: Where is the programming profession going?

165•syntaxbush•5d ago•176 comments

The open source DOCX editor submitted to HN a few weeks ago has been deleted

105•gcanyon•3d ago•44 comments

Is aerc better than neomutt now?

5•hardikxk•1d ago•1 comments

Ask HN: Is "no source code was copied" still a sufficient copyright defense?

66•oscgam1•4d ago•81 comments

Ask HN: Mullvad Alternatives?

18•rpastuszak•1d ago•12 comments

Ask HN: What do SRE do at your company?

9•petemc_•2d ago•9 comments

Ask HN: Homeless, Former Software Developer, What Now?

14•current_robot•1d ago•16 comments

Ask HN: How do you handle QA at a startup with no QA team? Genuinely curious

4•ovi_firstqa•1d ago•11 comments

Ask HN: How much coding should beginners learn in the AI era?

41•JohnDSDev•6d ago•55 comments

Ask HN: MacBook vs. Dedicated GPU for LLM

37•mzubairtahir•3d ago•69 comments

Ask HN: Books about Genetic Algorithms

14•andyjohnson0•2d ago•9 comments

Everyone feared AI taking over; the real danger is AI serving just the few

112•PhilipDaineko•3d ago•74 comments

Ask HN: What do you predict the world will look like in 5-10 years?

12•justanything•3d ago•19 comments

Ask HN: What GUI/desktop app do you use to keep track of different AI sessions?

6•howToTestFE•3d ago•6 comments

I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR

6•i_am_rocoe•3d ago•2 comments

Ask HN: Smallest amount of working ML weights that can be tattooed on a body?

8•thoughtpeddler•3d ago•8 comments

Ask HN: Norway bans AI in elementary schools

19•mellosty•4d ago•20 comments

Ask HN: Is there a bad employers (who have a records of not paying) list?

54•trowa159•2d ago•65 comments

Tell Zillow: Fee-Simple vs. Leasehold Filter

6•HoldOnAMinute•3d ago•1 comments

Ask HN: How do I capture the right audience and find the product market fit

7•akarshhegde18•2d ago•16 comments

You've reached the end!