frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Benchmarking GPT-5

https://www.coderabbit.ai/blog/benchmarking-gpt-5-why-its-a-generational-leap-in-reasoning
9•aravindputrevu•2h ago

Comments

aravindputrevu•2h ago
We put GPT-5 through our Golden PR Dataset.

Here is the TL;DR

- GPT-5 outperformed Opus-4, Sonnet-4, and OpenAI’s O3 across a battery of 300 varying difficulty, error-diverse pull requests.

- GPT-5 scored highest on our comprehensive test and found 254 out of 300 bugs or 85% where other models found between 200 and 207 – 16% to 22% less.

- On our 25 hardest PRs from our evaluation dataset, GPT-5 achieved the highest ever overall pass rate (77.3%), representing a 190% improvement over Sonnet-4, 132% over Opus-4, and 76% over O3.

OpenAI bringing back GPT-4o to ChatGPT Plus users

https://old.reddit.com/r/ChatGPT/comments/1mkae1l/comment/n7nelhh/
2•rob•4m ago•0 comments

Show HN: New Angular OpenAPI Client gen (looking for testers)

https://ng-openapi.dev/
1•tjami•5m ago•0 comments

Ask HN: Does No Response Mean a Bad Idea?

1•samehsbs•5m ago•1 comments

Jim Lovell Has Died

https://en.wikipedia.org/wiki/Jim_Lovell
1•ColinWright•7m ago•1 comments

ChatGPT Will Apologize for Anything

https://www.aiweirdness.com/chatgpt-will-apologize-for-anything/
2•xnx•7m ago•0 comments

Apollo 13 Commander Jim Lovell has passed away

https://www.nasa.gov/news-release/acting-nasa-administrator-reflects-on-legacy-of-astronaut-jim-lovell/
3•LorenDB•8m ago•0 comments

Show HN: HackMaster Pi – A $30 Flipper Zero Alternative Built with Raspberry Pi

https://github.com/1PingSun/HackMaster-Pi
1•1ping•9m ago•0 comments

How to Teach Your Kids to Play Poker: Start with One Card

https://www.bloomberg.com/news/articles/2025-08-08/how-to-teach-your-kids-poker-with-one-card-at-age-four
1•ioblomov•9m ago•1 comments

ChatGPT-5 Can't Do Basic Math

5•MarcellusDrum•13m ago•0 comments

Security alerts in Gmail. What a mess

2•chrisjj•14m ago•0 comments

GPT-5 AMA

https://www.reddit.com/r/ChatGPT/s/37th7HY644
2•IdealeZahlen•15m ago•0 comments

Johns Hopkins is building its AI wargaming tools for DoD

https://breakingdefense.com/2025/08/johns-hopkins-is-building-classified-versions-of-its-ai-wargaming-tools-for-dod-ic/
1•geox•15m ago•0 comments

Fears of population collapse in the US are based on faulty assumptions

https://theconversation.com/fears-that-falling-birth-rates-in-us-could-lead-to-population-collapse-are-based-on-faulty-assumptions-261031
1•PaulHoule•16m ago•0 comments

GPT-5 Rollout Updates

https://twitter.com/sama/status/1953893841381273969
3•tosh•18m ago•0 comments

Cordoomceps – replacing an Amiga's brain with Doom

https://mjg59.dreamwidth.org/73001.html
1•LorenDB•18m ago•0 comments

Millions are flocking to grow virtual gardens in Roblox game created by teenager

https://apnews.com/article/roblox-game-grow-garden-trend-2f5e4368448d57002d08b1b3d4a289ca
1•petethomas•21m ago•1 comments

The Illustrated TLS 1.2 Connection

https://tls12.xargs.org/
1•dmazin•22m ago•0 comments

The surprising economics of the meat industry – Lewis Bollard

https://www.dwarkesh.com/p/lewis-bollard
2•paulpauper•22m ago•0 comments

Job growth has slowed sharply; the question is why

https://stayathomemacro.substack.com/p/job-growth-has-slowed-sharply-the
14•paulpauper•22m ago•5 comments

Campaigning for Extinction:Eradication of Sparrows and the Great Famine in China

https://www.nber.org/papers/w34087
1•paulpauper•23m ago•0 comments

GRETA to Open a New Eye on the Nucleus

https://newscenter.lbl.gov/2025/08/08/greta-to-open-a-new-eye-on-the-nucleus/
1•gnabgib•23m ago•0 comments

HTTP Is Not Simple

https://daniel.haxx.se/blog/2025/08/08/http-is-not-simple/
4•thunderbong•25m ago•1 comments

Looking for Testers for an AI Privacy Platform

https://scanonai.carrd.co
1•lotuslabs•26m ago•1 comments

Three Tiers of Responses to Fact

https://medium.com/on-history/three-tiers-of-responses-to-fact-9b551f2a4fb6
2•wsgeorge•29m ago•0 comments

Toxic convenience: what science tells us about plastic's hidden costs

https://www.rfi.fr/en/international/20250808-toxic-convenience-what-science-tells-us-about-plastic-s-hidden-costs
2•everybodyknows•30m ago•0 comments

ChatGPT users hate GPT-5's overworked secretary energy, miss their GPT-4o buddy

https://arstechnica.com/ai/2025/08/chatgpt-users-outraged-as-gpt-5-replaces-the-models-they-love/
6•rntn•31m ago•0 comments

Welcome to DIY Rich Guy Fantasy Camp

https://www.theglobeandmail.com/arts/article-diy-rich-guy-fantasy-camp-mandle-cheung-bezos-ackman/
2•throw0101a•34m ago•1 comments

FIN - Fish Extensible Text Editor Written in Fish

https://codeberg.org/Digit/fin/
2•ashitlerferad•34m ago•0 comments

json2dir: a JSON-to-directory converter, a fast alternative to home-manager

https://github.com/alurm/json2dir
7•alurm•34m ago•1 comments

M5 MacBook Pro No Longer Coming in 2025

https://www.macrumors.com/2025/07/10/no-m5-macbook-pro-2025/
9•behnamoh•37m ago•0 comments