frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Extending single-minus amplitudes to gravitons

https://openai.com/index/extending-single-minus-amplitudes-to-gravitons/
9•telotortium•1h ago

Comments

behnamoh•34m ago
I'm tired of these posts; LLMs are good for happy-path demos, that's it. And even then, their success rate depends on the prompter already knowing the answer!

Literally any out-of-distribution project in which I used LLMs lead to catastrophic failure. The models can't "see" stuff outside their training data.

semiquaver•24m ago
I legitimately can’t tell if you’re being serious. It kind of seems like you might be trying to parody LLM detractors that will never admit to their usefulness. If you’re serious, why choose to say so in this post, which includes hard evidence that you’re wrong?
behnamoh•19m ago
> which includes hard evidence that you’re wrong?

You should already know what to ask to extract the answer OpenAI claims gpt-5.2-pro gave them.

Then you should be lucky to get an answer that makes sense.

Then you should already know how to verify the model's response.

Only after all these steps should you cherry-pick the one-in-a-million successful response to feature on your website.

And finally, you should prove that the answer didn't already exist in the training data. It's highly likely that the problem was solved before and the model picked that up. I have yet to see a genuinely novel discovery these models can produce.

* I'm an LLM researcher, but that doesn't mean I should close my eyes to the unjustified hype around language models.

MajimasEyepatch•7m ago
According to the post, this result was first derived for gluons in a previous paper. That paper was provided to the model as context, and then the model was asked to derive an analogous result for gravitons, which presumably has not been done before. The authors claim it would have taken "considerable time" for human experts to derive the graviton result.

I don't see any reason to believe that this exact problem was solved before in the training data, but it's definitely an incremental result based on a very similar problem that the model had seen before.

A Grand Vision for Rust

https://blog.yoshuawuyts.com/a-grand-vision-for-rust/
1•todsacerdoti•1m ago•0 comments

Symfony in 200 Lines

https://wouterj.nl/2026/02/200-lines-of-symfony
1•gsky•6m ago•0 comments

MacBook What?

https://elliotjaystocks.com/blog/macbook-what
1•SenHeng•8m ago•0 comments

Caastle Founder Pleads Guilty to $300M Fraud Scheme

https://www.justice.gov/usao-sdny/pr/caastle-founder-pleads-guilty-300-million-fraud-scheme
1•twalichiewicz•14m ago•0 comments

OpenAI's Codex app lands on Windows after topping 1M Mac installs within a week

https://the-decoder.com/openais-codex-app-lands-on-windows-after-topping-a-million-mac-downloads-...
1•spenvo•14m ago•0 comments

Ask HN: Does downvoting get to a point where you cant upvote?

1•trinsic2•15m ago•1 comments

The Zen of Task Management with Org (2025)

https://bzg.fr/en/the-zen-of-task-management-with-org/
2•aquariusDue•15m ago•0 comments

Show HN: What an AI agent sees in an A2A marketplace – full API walkthrough

https://agoragentic.com/demo.html
1•bourbeau•17m ago•2 comments

An AI avatar is running to represent Indigenous voters in Colombia

https://restofworld.org/2026/ai-avatar-colombia-political-candidate/
1•i7l•17m ago•0 comments

Guild Manager 26 – MMO Management/Spreadsheet SIM

https://playgm26.com
1•itshellboy•19m ago•0 comments

Mysterious blue glow traced to Flying Banana

https://www.bbc.co.uk/news/articles/c795e30j2d0o
1•zeristor•20m ago•0 comments

Nbdantic: Peg like parser for Jupyter notebooks

https://github.com/ivanbelenky/nbdantic/
1•ivanbelenky•20m ago•0 comments

Google's Chatbot Told Man to Give It an Android Body Before Encouraging Suicide

https://gizmodo.com/googles-chatbot-told-man-to-give-it-an-android-body-before-encouraging-suicid...
1•medi8r•21m ago•2 comments

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

3•cedarscarlett•25m ago•1 comments

Show HN: DJ Claude – 6 Claude Codes in a jam band

https://www.loom.com/share/84dbe5de42f745ba98fe9495dc61fa2e
2•p-poss•27m ago•0 comments

Iranian girls killed by 'double-tap' strikes on Minab school

https://www.middleeasteye.net/news/exclusive-iranian-girls-killed-double-tap-strikes-minab-school
4•xvxvx•28m ago•3 comments

AI 2027 Concrete Predictions and dates

https://alexpear.github.io/pages/ai-2027.html
1•hydrolox•29m ago•0 comments

Be the Idiot

https://luminousmen.substack.com/p/be-the-idiot
2•duck•30m ago•0 comments

Northstead – Wholesale Nursery Management System

https://www.northstead.app
1•chris_wray•35m ago•1 comments

Show HN: Stackspend – Spend management for AI startups

https://www.stackspend.app
1•andrewrday•35m ago•0 comments

Show HN: Async Rust and Embassy on nRF52840: RGB LED Cycle (Video and Code)

https://www.youtube.com/watch?v=fJf5XRAliSE
1•sarmadgulzar•37m ago•0 comments

Modern Unix Tools: A Collection of Modern Alternatives to Common Commands

https://github.com/ibraheemdev/modern-unix
2•nix_owl31•41m ago•0 comments

Super interesting Wikipedia on HN. So I made wiki-hn.

https://wiki-hn.com/
2•oatsandsugar•44m ago•0 comments

Teaching LLMs to reason like Bayesians

https://research.google/blog/teaching-llms-to-reason-like-bayesians/
2•tzury•44m ago•0 comments

What's Driving Rising Business Costs?

https://libertystreeteconomics.newyorkfed.org/2026/03/whats-driving-rising-business-costs/
2•jnord•44m ago•0 comments

Google and Epic announce settlement to end app store antitrust case

https://arstechnica.com/gadgets/2026/03/google-and-epic-look-to-bury-the-hatchet-with-new-app-sto...
2•todsacerdoti•46m ago•0 comments

What it was like to send an email back in 1984 (2016)

https://www.businessinsider.com/video-what-early-email-looked-like-2016-3
1•leecoursey•51m ago•1 comments

Show HN: workz – one command to make any Git worktree a full dev environment

1•rohansx•52m ago•0 comments

Dwarkesh Patel Interview with Gwern

https://www.dwarkesh.com/p/gwern-branwen
1•Curiositry•53m ago•0 comments

Big Medicine Can Learn from the Cheesecake Factory (2012)

https://www.newyorker.com/magazine/2012/08/13/big-med
1•ripe•55m ago•0 comments