frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: What Are You Working On? (April 2026)

216•david927•16h ago•668 comments

Tell HN: Docker pull fails in Spain due to football Cloudflare block

875•littlecranky67•20h ago•325 comments

Tell HN: OpenAI silently removed Study Mode from ChatGPT

170•smokel•19h ago•71 comments

Tell HN: Reddit now demands to know why you won't use their app

15•josephcsible•6h ago•16 comments

Ask HN: What are all the bad things that AI companies have done which we forgot

7•Imustaskforhelp•19h ago•1 comments

Ask HN: Do you trust AI agents with API keys / private keys?

12•devendra116•1d ago•24 comments

Ask HN: Anyone using Nostr as a lightweight back end/DB for rapid prototyping?

6•wasimsk•1d ago•0 comments

Ask HN: What are you building that's not AI related?

148•meander_water•4d ago•204 comments

Ask HN: What's your experience with PoW captchas against form spam?

5•pentacent_hq•19h ago•8 comments

Ask HN: Hiring in the age of AI-assisted coding: what works?

26•nitramm•2d ago•17 comments

Is the pitch deck culture making founders worse at building businesses?

17•chinhqtran•1d ago•6 comments

Ask HN: Best books on building a programming language

17•ezzato•2d ago•8 comments

Ask HN: What should I do with my app? 130 downloads 3 real subscribers

3•oyaa52•1d ago•7 comments

Ask HN: Former grok-code-fast-1 users, what coding model are you using now?

2•whycombinetor•1d ago•3 comments

Any Open Source projects in need of documentation writer?

21•tree666•3d ago•13 comments

Ask HN: Why Databases Instead of Filesystem?

13•uticus•2d ago•20 comments

Ask HN: Agentic Permutation of Testing Paths In A System

4•davidajackson•1d ago•0 comments

Tor Browser on Android leaks IP in desktop mode

13•shchess•1d ago•2 comments

Ask HN: Has anyone reconsidered Antivirus software after recent security news?

6•pants2•1d ago•5 comments

Ask HN: Should AI credits be refunded on mistakes?

19•ed_elliott_asc•4d ago•20 comments

Do founders' political views affect how you see a product?

4•rishikeshs•1d ago•3 comments

I collected startup ideas. It changed how I think about ideas completely

10•vibecoder21•2d ago•11 comments

Is VC the new PMF strategy?

3•networkOne•2d ago•5 comments

Ask HN: How do you manage your digital legacy for after you die?

15•orbanlevi•4d ago•16 comments

Ask HN: Local-first meetings recorder and transcriber?

7•dandaka•3d ago•1 comments

Open Source card game cuttle.cards has its world championship Saturday at 1pm ET

4•aleph_one•2d ago•0 comments

Is it just me, or Opus 4.6 is sounding bit dumb lately

7•rambrrest•2d ago•4 comments

Ask HN: Are you encountering AI-related questions in the hiring market?

7•somthingwrong•3d ago•2 comments

You've reached the end!

Open in hackernews

Is it just me, or Opus 4.6 is sounding bit dumb lately

7•rambrrest•2d ago
Going round and round in one of the harness I use.

Comments

Imustaskforhelp•2d ago
Not using opus 4.6 but I have heard the same things.

https://old.reddit.com/r/LocalLLaMA/comments/1sgd7fp/its_ins...

There was also another post about how the perceived qualities of these models is going insanely down, something not reflected in benchmarks

I feel like it might be because the costs of GPU is reflecting back up and they might be having a more diluted model which makes it more dumb while still taking the 100$

I personally feel like this theory of these models slowing going down in intelligence until a new model which isn't bogged down intentionally might be of more interest than people think because my experience with even claude sonnet 3.7 when it had first launched was genuinely fascinating and gemini 3.1 premium and it really aligns with my personal experience tinkering with these models.

The AI industry feels quite scam-my to be honest and we would all be forced by IPO or index funds bending backwards to be left holding the bags :-/

It really feels like a great deception being played against the masses.

Areena_28•2d ago
Yeah, i noticed it too. Something feels off with the reasoning on complex multi-step tasks compared to a few months ago. hard to tell if it's actual regression or just expectations creeping up as you use it more.

Been mixing Opus with Sonnet depending on the task. Sonnet handles most things well enough and Opus for anything that genuinely needs deeper reasoning. Try it out, may be you find it useful

uberman•2d ago
Isn't it the expected thing that LLMs degrade over time?
UK-Al05•1h ago
Probably reducing its capabilities to make the new model look better.