frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

ChatGPT Will Apologize for Anything

https://www.aiweirdness.com/chatgpt-will-apologize-for-anything/
1•xnx•39s ago•0 comments

Apollo 13 Commander Jim Lovell has passed away

https://www.nasa.gov/news-release/acting-nasa-administrator-reflects-on-legacy-of-astronaut-jim-lovell/
1•LorenDB•1m ago•0 comments

Show HN: HackMaster Pi – A $30 Flipper Zero Alternative Built with Raspberry Pi

https://github.com/1PingSun/HackMaster-Pi
1•1ping•2m ago•0 comments

How to Teach Your Kids to Play Poker: Start with One Card

https://www.bloomberg.com/news/articles/2025-08-08/how-to-teach-your-kids-poker-with-one-card-at-age-four
1•ioblomov•2m ago•1 comments

ChatGPT-5 Can't Do Basic Math

3•MarcellusDrum•6m ago•0 comments

Security alerts in Gmail. What a mess

1•chrisjj•7m ago•0 comments

GPT-5 AMA

https://www.reddit.com/r/ChatGPT/s/37th7HY644
1•IdealeZahlen•8m ago•0 comments

Johns Hopkins is building its AI wargaming tools for DoD

https://breakingdefense.com/2025/08/johns-hopkins-is-building-classified-versions-of-its-ai-wargaming-tools-for-dod-ic/
1•geox•9m ago•0 comments

Fears of population collapse in the US are based on faulty assumptions

https://theconversation.com/fears-that-falling-birth-rates-in-us-could-lead-to-population-collapse-are-based-on-faulty-assumptions-261031
1•PaulHoule•9m ago•0 comments

GPT-5 Rollout Updates

https://twitter.com/sama/status/1953893841381273969
1•tosh•11m ago•0 comments

Cordoomceps – replacing an Amiga's brain with Doom

https://mjg59.dreamwidth.org/73001.html
1•LorenDB•11m ago•0 comments

Millions are flocking to grow virtual gardens in Roblox game created by teenager

https://apnews.com/article/roblox-game-grow-garden-trend-2f5e4368448d57002d08b1b3d4a289ca
1•petethomas•14m ago•1 comments

The Illustrated TLS 1.2 Connection

https://tls12.xargs.org/
1•dmazin•15m ago•0 comments

The surprising economics of the meat industry – Lewis Bollard

https://www.dwarkesh.com/p/lewis-bollard
2•paulpauper•15m ago•0 comments

Job growth has slowed sharply; the question is why

https://stayathomemacro.substack.com/p/job-growth-has-slowed-sharply-the
11•paulpauper•16m ago•3 comments

Campaigning for Extinction:Eradication of Sparrows and the Great Famine in China

https://www.nber.org/papers/w34087
1•paulpauper•16m ago•0 comments

GRETA to Open a New Eye on the Nucleus

https://newscenter.lbl.gov/2025/08/08/greta-to-open-a-new-eye-on-the-nucleus/
1•gnabgib•17m ago•0 comments

HTTP Is Not Simple

https://daniel.haxx.se/blog/2025/08/08/http-is-not-simple/
4•thunderbong•19m ago•1 comments

Looking for Testers for an AI Privacy Platform

https://scanonai.carrd.co
1•lotuslabs•20m ago•1 comments

Three Tiers of Responses to Fact

https://medium.com/on-history/three-tiers-of-responses-to-fact-9b551f2a4fb6
2•wsgeorge•22m ago•0 comments

Toxic convenience: what science tells us about plastic's hidden costs

https://www.rfi.fr/en/international/20250808-toxic-convenience-what-science-tells-us-about-plastic-s-hidden-costs
2•everybodyknows•23m ago•0 comments

ChatGPT users hate GPT-5's overworked secretary energy, miss their GPT-4o buddy

https://arstechnica.com/ai/2025/08/chatgpt-users-outraged-as-gpt-5-replaces-the-models-they-love/
5•rntn•24m ago•0 comments

Welcome to DIY Rich Guy Fantasy Camp

https://www.theglobeandmail.com/arts/article-diy-rich-guy-fantasy-camp-mandle-cheung-bezos-ackman/
2•throw0101a•27m ago•1 comments

FIN - Fish Extensible Text Editor Written in Fish

https://codeberg.org/Digit/fin/
2•ashitlerferad•27m ago•0 comments

json2dir: a JSON-to-directory converter, a fast alternative to home-manager

https://github.com/alurm/json2dir
4•alurm•27m ago•0 comments

M5 MacBook Pro No Longer Coming in 2025

https://www.macrumors.com/2025/07/10/no-m5-macbook-pro-2025/
6•behnamoh•30m ago•0 comments

(Evil)Doggie: An open-source CAN bus research and penetration testing tool

https://www.blackhat.com/us-25/arsenal/schedule/#evildoggie-a-modular-open-source-can-bus-research-and-penetration-testing-tool-45525
1•wslh•32m ago•0 comments

LVFS Sustainability Plan

https://blogs.gnome.org/hughsie/2025/08/08/lvfs-sustainability-plan/
2•Bogdanp•32m ago•0 comments

Query-Mutating Data Race in Go

https://coder.com/blog/query-mutating-data-race-in-go
3•kylecarbs•36m ago•0 comments

How Samsung Missed the AI Moment [video]

https://www.youtube.com/watch?v=wS57SInZt8g
1•mgh2•36m ago•0 comments
Open in hackernews

GPT-5 on SWE-bench: Cost and performance deep-dive

https://mini-swe-agent.com/latest/blog/2024/01/15/gpt-5-on-swe-bench-cost--performance-deep-dive/
4•lieret•2h ago

Comments

lieret•2h ago
We evaluated the new GPT models with a minimal agent on SWE-bench verified. GPT-5 scores 65%, mini 60%, nano 35%. Still behind Opus 5 (68%), on par with Sonnet 4 (65%). But a lot cheaper, especially mini!

Cost is tricky to compare with agents, because agents succeed fast, but fail slowly. If an agent doesn't succeed, it should just continue trying until it succeeds, or hits a run time limit. And that's (almost) what happens.

But even so, it's very clear that

1. GPT-5 is cheaper than Sonnet 4 2. GPT-5-mini is _incredibly_ cheap for what it provides (you only sacrifice some 5%pts, but end up paying maybe 1/5th of the total cost)

All of the code to reproduce our numbers is open-source. There's a box on the bottom with the exact command to run in order to reproduce our numbers.

Also very happy to answer questions here!

techpineapple•2h ago
I'm curious if this might help Cursor's lighting money on fire problem?

https://pivot-to-ai.com/2025/07/09/cursor-tries-setting-less...

is this enough of a price difference to make cursor profitable?

lieret•2h ago
I think gpt-5-mini should really help them. At least from these benchmark scores, there probably shouldn't be a huge performance degradation for letting gpt-5-mini drive most of the workflow. Of course users might still want to just run with latest and greatest (but still gpt-5 will be cheaper I think)