frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Has ChatGPT-5.1 Regressed?

1•pedrozieg•46m ago
I’ve noticed that the quality of ChatGPT-5.1 occasionally drops substantially. I’m talking GPT-3 level hallucinations - wildly making stuff up or randomly inserting words in a language I do not speak.

In my repeat evaluations on the same datasets the scores are all over the place, sometimes scoring really high and sometimes doing very badly.

Has anyone experienced something similar?

I’m guessing this may be because “GPT-5.1” can sometimes choose to use a much smaller model, but for production use this makes it unreliable.

Comments

xXSLAYERXx•26m ago
I'm mainly using it for rewriting or helping me understand legacy code and to me 5.1 is the best yet.
chistev•3m ago
I think ChatGPT as a whole has regressed.

Show HN: Foggo – CLI Tool for Auto Generation of Go's Functional Option Pattern

https://github.com/rikeda71/foggo
1•rikeda71•28s ago•0 comments

Show HN: Gentoro OneMCP – open-source layer for accurate API calls by AI agents

https://github.com/Gentoro-OneMCP/onemcp
1•GentoroAI•37s ago•0 comments

Millions of children and teens lose access to social media accounts in Australia

https://www.theguardian.com/australia-news/2025/dec/09/australia-under-16-social-media-ban-begins...
1•bookofjoe•1m ago•0 comments

Show HN: Milkie – Drop-In Stripe Paywall for Next.js

https://github.com/akcho/milkie
1•akcho•2m ago•0 comments

Bazzite: A Gem for Linux Gamers

https://lwn.net/SubscriberLink/1046228/c2fcc84b4f159b6a/
1•askl•2m ago•0 comments

FedCM Action Required: Upcoming breaking changes

https://groups.google.com/g/fedcm-developer-newsletter/c/B1uzU6MzrYY/m/_6gqvaHzCgAJ
1•mooreds•3m ago•0 comments

Ransomware IAB abuses EDR for stealthy malware execution

https://www.bleepingcomputer.com/news/security/ransomware-iab-abuses-edr-for-stealthy-malware-exe...
1•fleahunter•3m ago•0 comments

Gemini 3 fails at simple geometry

1•zkmon•3m ago•0 comments

The story of Erdős problem #1026

https://terrytao.wordpress.com/2025/12/08/the-story-of-erdos-problem-126/
1•signa11•6m ago•0 comments

Learning to See (2013)

https://ia.net/topics/learning-to-see
1•levmiseri•6m ago•0 comments

Why would anyone still choose MVC over Blazor with server-side rendering

https://www.reddit.com/r/dotnet/s/EW4AN1XwgI
1•stsrki•6m ago•0 comments

Traits of a Good Tech Lead

https://world.hey.com/joaoqalves/traits-of-a-good-tech-lead-b5cac0ae
1•joaoqalves•8m ago•0 comments

Diaphragm-based CO electrolyzers for alkaline multicarbon production

https://dx.doi.org/10.1038/s41467-025-63004-1
1•PaulHoule•9m ago•0 comments

Tracking Phone Numbers via WhatsApp and Signal: Open-Source PoC

https://arxiv.org/abs/2411.11194
1•birdculture•9m ago•0 comments

Anthropic's Claude Code can now read your Slack messages and write code for you

https://venturebeat.com/ai/anthropics-claude-code-can-now-read-your-slack-messages-and-write-code...
1•ywnzzn•9m ago•0 comments

US Job Openings Climbed in October to a Five-Month High

https://www.bloomberg.com/news/articles/2025-12-09/us-job-openings-climbed-in-october-to-a-five-m...
1•toomuchtodo•10m ago•1 comments

Microsoft to invest $17.5B in India, CEO Nadella says

https://www.reuters.com/world/india/microsoft-invest-175-billion-india-ceo-nadella-says-2025-12-09/
1•alephnerd•10m ago•0 comments

America's Grip on Oil Weakens – Saudi Arabia's New Partnership [video]

https://www.youtube.com/watch?v=d9FzXGVHzd4
3•thelastgallon•11m ago•0 comments

Show HN: A simple natural-language scheduling tool for finding a time to meet

https://www.schedulegend.com
1•paul_brook•11m ago•0 comments

American Data Centers

https://tech.marksblogg.com/american-data-centers.html
1•marklit•12m ago•0 comments

Did 37Signals Just Accidentally Make Writebook Open Source?

https://kerrick.blog/posts/2025/did-37signals-just-accidentally-make-writebook-open-source/
2•speckx•12m ago•0 comments

Divyam-LLM-interop:LLM responses,requests translation across APIs and models

1•omkarashish•12m ago•0 comments

The Quest Toward That Perfect Compiler – ACM SPLASH / OOPSLA 2025 Keynote [video]

https://www.youtube.com/watch?v=Af70DptYlYQ
1•matt_d•12m ago•0 comments

What's Next? Clippy Copilot?

2•johnnyballgame•14m ago•1 comments

Social Signals 2025_v5

https://docs.google.com/presentation/d/1aSrP4Pojh7EZde8EZL5kTOH_DAY0SoXzEWF6o2RmFg4/edit?slide=id...
1•ericzawo•14m ago•0 comments

THEA1200 is a full-size working Amiga replica

https://www.theregister.com/2025/11/14/thea1200_fullsize_amiga_replica/
3•rbanffy•15m ago•0 comments

RIP Raul Malo (The Mavericks)

https://apnews.com/article/raul-malo-dies-mavericks-d31ac0c4e1a77c85fcd800f71f079eb7
1•pauseandplay•15m ago•1 comments

Shell permission errors for busy coding agents

https://www.da.vidbuchanan.co.uk/blog/agent-perms.html
2•cirwin•16m ago•0 comments

Show HN: Celeste – The 'Requests' for AI: Any provider, any capability

https://github.com/withceleste/celeste-python
1•Kamilbenkirane•16m ago•1 comments

A new nuclear 'island' where magic numbers break down

https://phys.org/news/2025-12-nuclear-island-magic.html
3•rbanffy•17m ago•0 comments