frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Is GPT-5 a regression, or is it just me?

4•technocratius•1h ago
Context: I have been using GPT5 since its release over a month ago, within my Plus subscription. Before this release, I heavily relied on gpt-o3 for most complex tasks, with 4o for simple question. I use it for a mix of scientific literature websearch for e.g. understanding health related topics, the occasional coding assistance and helping me out with *nix sysadmin related tasks. Note that I have not used its API or integration with an IDE.

Based on a month of GPT5 usage, this model feels like primarily like a regression:

1. It's slow: thinking mode can take ages, and sometimes gets downright stuck. It's auto-assessment of whether or not it needs to think feels poorly tuned to most tasks and defaults too easily to going into deep reasoning mode.

2. Hallucinations are in overdrive: I would assess that in 7/10 tasks, hallucinations continuously clutter the responses and warrant corrections and careful monitoring and steering back. It hallucinates list items from your prompt that weren't there, software package functionalities/capabilities and CLI parameters etc. Even thorough prompting with explicit linking to sources, e.g. also wihtin deep research frequently goes of the rails.

3. Not self critical: even in thinking mode, it frequently spews out incorrect stuff, that a blatant "this is not correct, check your answer" can directly correct.

Note: I am not a super advanced prompt engineer, and this above assessment is mainly wrt the previous generation of models. I would expect that with progression of model capabilities, the need for users to apply careful prompt engineering goes down, not up.

I am very curious to hear your experiences.

Comments

patrakov•1h ago
You are not alone.

Another pet peeve is that it, when asked to provide several possible solutions, sometimes generates two that are identical but with different explanations.

technocratius•1h ago
Ah yes, I've had similar experiences actually. Also the variant where I ask it to provide an alternate solution/answer to the one it gave, where it than proceeds to basically regurgitate its previous answer with slight stylistic (i.e. maintaining content parity) modifications.
lcnPylGDnU4H9OF•27m ago
GPT-5 was an upgrade for investors. The primary feature of it is to use a router that will decide between a stronger model and a weaker one for a given query. The goal is to reduce operating costs without regard to improving the user experience, while they market it as "new and improved".
android521•20m ago
Hallucinations are definitely up at least 5X compared with gpt4 from my personal experience.

Show HN: Chartz.ai – Cursor for Data Analytics

https://chartz.ai
1•daolm•7m ago•0 comments

The Rise of 'Conspiracy Physics'

https://www.wsj.com/science/physics/the-rise-of-conspiracy-physics-dd79fe36
1•nsoonhui•8m ago•0 comments

Google tests forced pagination on SERPs

https://www.demandsphere.com/blog/google-tests-forced-pagination-on-serps/
1•tosh•8m ago•0 comments

Equatorial Guinea enforces yearlong internet outage for island that protested

https://apnews.com/article/equatorial-guinea-internet-shutdown-africa-d7daacc641475743972b33eafff...
1•perihelions•11m ago•0 comments

How to Boost Your Productivity While Managing Multiple Projects

https://freeter.io/blog/boost-your-productivity-while-managing-multiple-projects/
1•AlexKaul•13m ago•1 comments

Is AI taking the joy out of making things?

https://shankarganesh.blog/2025/09/14/is-ai-taking-the-happiness-away-from-making-things/
1•raknahs1991biz•13m ago•0 comments

A new approach could fractionate crude oil using less energy

https://news.mit.edu/2025/new-approach-could-fractionate-crude-oil-using-less-energy-0522
1•DocFeind•14m ago•0 comments

How fast is go? simulating particles on a smart TV

https://dgerrells.com/blog/how-fast-is-go-simulating-millions-of-particles-on-a-smart-tv
1•tweenagedream•16m ago•0 comments

Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit

https://dress-1-to-3.github.io
1•ceolin•16m ago•0 comments

A Clean, Well-Lighted Place on the Internet

https://every.to/context-window/a-clean-well-lighted-place-on-the-internet?ph_email=rbanffy%40gma...
1•rbanffy•17m ago•0 comments

TargetJS: JavaScript UI framework designed to simplify development, enhance UX

https://github.com/livetrails/targetjs
1•thunderbong•17m ago•0 comments

Create 1M AI powered apps a day

https://twitter.com/OfficialLoganK/status/1966643652240851059
1•selvan•30m ago•0 comments

How do AI models generate videos?

https://www.technologyreview.com/2025/09/12/1123562/how-do-ai-models-generate-videos/
1•pseudolus•30m ago•0 comments

Understanding GPU Architecture

https://cvw.cac.cornell.edu/gpu-architecture
1•redbell•32m ago•0 comments

Eternal-Tux: Crafting a Linux Kernel KSMBD 0-Click RCE Exploit from N-Days

https://www.willsroot.io/2025/09/ksmbd-0-click.html
1•skilled•37m ago•0 comments

Thinking in Higher Dimensions – Beautiful Visualizations by Jos Leys (2011)

https://www.youtube.com/playlist?list=PL3C690048E1531DC7
1•vismit2000•38m ago•0 comments

World Newspapers

https://world-newspapers.net/
2•sm-techq•41m ago•1 comments

Repetitive negative thinking is associated with cognitive function decline

https://bmcpsychiatry.biomedcentral.com/articles/10.1186/s12888-025-06815-2
10•redbell•47m ago•0 comments

Sketch2Anim: Transferring Sketch Storyboards into 3D Animation

https://zhongleilz.github.io/Sketch2Anim/
2•ceolin•50m ago•0 comments

The Paradox of Signaling

https://think-twice.me/?p=88
1•zug_zug•53m ago•0 comments

CorentinJ: Real-Time Voice Cloning

https://github.com/CorentinJ/Real-Time-Voice-Cloning
2•redbell•1h ago•0 comments

Desi Arnaz's Revolution Was Televised

https://reason.com/2025/09/14/desi-arnazs-revolution-was-televised/
1•mhb•1h ago•0 comments

Don't Let Your Mocks Mock You

https://revontulet.dev/p/2025-dont-let-your-mocks-mock-you/
2•ingve•1h ago•0 comments

Apple has no one left who can say no

https://world.hey.com/dhh/apple-has-no-one-left-who-can-say-no-1a542329
10•dotcoma•1h ago•2 comments

Test State, Not Interactions

http://rednafi.com/go/test_state_not_interactions/
2•rednafi•1h ago•0 comments

Show HN: Syncwave – MIT-licensed real-time Kanban board

https://github.com/syncwavedev/syncwave
1•tilyupo•1h ago•1 comments

Show HN: EZLive – lightweight serverless self-hosted livestream

https://github.com/mistivia/ezlive
2•mistivia•1h ago•0 comments

LLMs stop talking being usefull? (Thank you NetworkChuck) [video]

https://www.youtube.com/watch?v=GuTcle5edjk
2•asuarezfr•1h ago•1 comments

Why Did Righthand Travel Predominate in Colonial America?

https://highways.dot.gov/highway-history/general-highway-history/right-side-road
3•mhb•1h ago•0 comments

Rolling Stone Publisher Sues Google over AI Summaries

https://www.wsj.com/tech/ai/rolling-stone-publisher-sues-google-over-ai-summaries-3afde408
1•ushakov•1h ago•0 comments