frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Synchron vs. Neuralink: BCI Brain Implants for Thought-Control [video]

https://www.youtube.com/watch?v=BGj4pYfgr5w
1•mgh2•5m ago•0 comments

Consent for Hackers, Negotiating consent based on the HTTP protocol [video]

https://media.ccc.de/v/why2025-3-consent-for-hackers-negotiating-consent-based-on-the-http-protocol
1•marvinborner•6m ago•0 comments

Effects of Anthropogenic Vibratory Noise on Plant Development and Herbivory

https://www.mdpi.com/2624-599X/7/3/45
1•PaulHoule•10m ago•0 comments

Try and

https://ygdp.yale.edu/phenomena/try-and
1•treetalker•11m ago•1 comments

Wall Street and AI Startups Are Fighting over Entry-Level Quants

https://www.bloomberg.com/news/articles/2025-08-08/open-ai-perplexity-make-pitch-to-recruit-quant-traders-from-banks
1•Terretta•11m ago•1 comments

The rise of America's intangible economy

https://www.ft.com/content/38c3ccd8-3aa0-4dbb-a832-00177c40996c
2•hhs•12m ago•0 comments

Ask HN: Advice for someone who wants to try AI-assisted coding?

1•inglor_cz•12m ago•0 comments

Design Patterns for Securing LLM Agents Against Prompt Injections

https://arxiv.org/abs/2506.08837
1•pyman•14m ago•0 comments

The Ottoman Mirror

https://worldhistory.substack.com/p/the-ottoman-mirror
1•crescit_eundo•14m ago•0 comments

DEF CON hackers plug security holes in US water systems amid tsunami of threats

https://www.theregister.com/2025/08/10/def_con_hackers_water_security/
1•rntn•17m ago•0 comments

Buy now pay later for your annual software subscriptions

1•bfayyumii•17m ago•1 comments

Mesmerizing Hypnoloid, a Kinetic Desktop Sculpture

https://www.core77.com/posts/138054/This-Mesmerizing-Hypnoloid-a-Kinetic-Desktop-Sculpture
1•surprisetalk•23m ago•0 comments

I'll pay you $100k to get me married

https://aella.substack.com/p/ill-pay-you-100k-to-get-me-married
1•surprisetalk•24m ago•1 comments

Angus Willows designs a better coat hanger

https://www.core77.com/posts/137963/Product-Design-Students-Self-Project-Leads-to-Massive-Business-Success
1•surprisetalk•24m ago•0 comments

Platform/community for passionate hobby OS enthusiasts

https://oshub.org
3•joexbayer•24m ago•1 comments

Site Scout

https://github.com/hashhooshy/site-scout
1•hashhooshys•27m ago•0 comments

Inside OS/2

https://gitpi.us/article-archive/inside-os2/
16•rbanffy•28m ago•4 comments

Mixing Regolith with Polymer Saves Mass for 3D Printing – Universe Today

https://www.universetoday.com/articles/mixing-regolith-with-polymer-saves-mass-for-3d-printing
1•rbanffy•28m ago•0 comments

Can we just have one day when no one mentions AI?

https://www.ft.com/content/94481c96-03f2-420e-a17c-64394804bd04
1•bookofjoe•30m ago•1 comments

The Tycoons Who Profit from India's Thirst for Russian Oil

https://www.nytimes.com/2025/08/09/business/india-russian-oil-ambani.html
1•ripe•34m ago•0 comments

Computer Networking Resources for DevOps and Software Engineers

https://leandromoreira.com/2021/12/16/computer-networking-resources-for-devops-and-software-engineers/
2•dreampeppers99•37m ago•0 comments

Avatarl: Training langauge models from scratch with pure RL

https://tokenbender.com/post.html?id=avatarl
2•krkartikay•39m ago•0 comments

Show HN: Interactive Map and Smart Search – Explore Wonosobo

https://explorewonosobo.com/
1•harimurti•40m ago•0 comments

Fermi Question of the Day: How many babies were born worldwide in 2024?

https://www.fermiquestions.org/#/2025-08-10
1•danielfetz•45m ago•0 comments

Sex is getting scrubbed from the internet; billionaires can sell you AI nudes

https://www.theverge.com/internet-censorship/756831/grok-spicy-videos-nonconsensual-deepfakes-online-safety
8•microsoftedging•46m ago•2 comments

Show HN: QRCodes Shaped with Text/Images – NitroQR

https://nitroqr.com
1•bhasinanant•46m ago•1 comments

Venomous Lionfish invading the Mediterranean. Best control may be to eat it

https://www.washingtonpost.com/climate-environment/2025/08/09/lionfish-invasive-mediterranean-diet-food/
2•perihelions•50m ago•0 comments

MCP: An (Accidentally) Universal Plugin System

https://worksonmymachine.ai/p/mcp-an-accidentally-universal-plugin
4•azhenley•50m ago•1 comments

How Kentucky bourbon went from boom to bust

https://www.bbc.com/news/articles/ckglnk6yxlko
1•bookofjoe•50m ago•0 comments

Imagining the psychology of aliens evolved from ambush predators (2023)

https://old.reddit.com/r/slatestarcodex/comments/16a9m2h/imagining_the_psychology_of_aliens_evolved_from/
1•optimalsolver•55m ago•0 comments
Open in hackernews

GPT5 is worse than 4.1-mini for text and worse than Sonnet 4 for coding

5•hitradostava•3h ago
It seems that OpenAI have got the PR machine working amazingly. The Cursor CEO said it's the best, as did Simon Willison (https://simonwillison.net/2025/Aug/7/gpt-5/).

But I've found it terrible. For coding (in Cursor), it's slow, fails with tool calls often (no MCP just stock Cursor tools) and stored some new application state in globalThis - something that no model has ever attempted to do in over a year of very heavy Cursor / Claude Code use).

For a summarization/insights API that I work on, it was way worse than gpt-4.1-mini. I tried both mini and full gpt5, with different reasoning settings. It didn't follow instructions, and output was worse across all my evals, even after heavy prompt adjustment. I did a lot of sampling and the results were objectively bad.

Am I the only one? Has anyone seen actual real-world benefits of GPT-5 vs other models?

Comments

cranberryturkey•3h ago
it solved a huge bug i've been struggling with.
hitradostava•3h ago
Had Sonnet 4 not been able to?
revskill•3h ago
Sure.
cranberryturkey•3h ago
No, it kept going in circles....spent like 3 weeks trying to fix it. Got access to gpt5 yesterday and all major bugs are resolved.
wdb•1h ago
Interesting I tried it to fix some unit tests that were failing but made the problem worse. Sonnet was able to fix the failing unit tests and the new problems introduced by GPT5. I used Claude Code for Sonnet and Cursor Agent for GPT-5. Maybe Cursor Agent is just bad?
cranberryturkey•31m ago
I don't know I use roocode.
tim_angus•2h ago
And yet the media keeps using the term "exponential improvement"...
8thcross•1h ago
I tried it with cursor-agent, their cli - and it generated better code than expected. YMMV. It was more thoughtful and strategic than the other frontier models.
hitradostava•1h ago
Planning was ok for me, much slower than Sonnet, but comparable. But some of the code it produces is just terrible. Maybe the routing layer sends some code-generation tasks to a much smaller model- but then I don't get why it's so slow!

The only thing that seems better to me is the parallel tool calling.

canerdogan•1h ago
GPT-5 isn’t really a brand-new model in the way people think. From what I’ve seen, the goal was more about reducing costs and unifying the interface than releasing a totally different architecture. Under the hood it is still routing to models we already know, just picking what it thinks will give the “best” result for the request.

That can be fine for a lot of general use cases, but if you’re working in specific domains like coding agents or high-precision summarization, that routing can actually make results worse compared to sticking with a model you know performs well for your workload.

hitradostava•41m ago
Thats not what OpenAI are claiming. They are claiming that there are two new flagship models and a router that routes between them.

"GPT‑5 is a unified system with a smart, efficient model that answers most questions, a deeper reasoning model (GPT‑5 thinking) for harder problems, and a real‑time router that quickly decides which to use"

softwaredoug•1h ago
I feel like they should have let GPT 5 overlap in experimental mode for a month or so. It took a while to get the kinks out of GPT-4 until people trusted it. Just switching it on is really hurting their brand.

The fact they didn’t do this makes me think their finances are in very bad shape.

hitradostava•1h ago
I agree, I just don't understand how the team at Cursor can say this:

“GPT-5 is the smartest coding model we've used. Our team has found GPT-5 to be remarkably intelligent, easy to steer, and even to have a personality we haven’t seen in any other model. It not only catches tricky, deeply-hidden bugs but can also run long, multi-turn background agents to see complex tasks through to the finish—the kinds of problems that used to leave other models stuck. It’s become our daily driver for everything from scoping and planning PRs to completing end-to-end builds.”

The cynic in me thinks that Cursor had to give positive PR in order to secure better pricing...