frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT 5.5 sets new record in proofreading benchmark

https://revise.io/errata-bench
2•artursapek•1h ago

Comments

artursapek•1h ago
Hi HN - this is a benchmark I developed that tests various models against large samples of text, asking them to find and fix a variety of errors. Its purpose is to evaluate how good models are at proofreading (a common use case of LLMs) and how efficient they are on various axes.

I've been working on this to inform my own decisions about which models to use in my agentic word processor, but I think it's also just useful data.

I just ran GPT 5.5 and it broke Gemini's previous high score of 92.5%!

The code and run artifacts are available on Github: https://github.com/reviseio/errata-bench

Is Italy the new tax haven for the global rich?

https://www.bbc.com/worklife/article/20260421-is-italy-the-new-tax-haven-for-the-global-rich
1•andsoitis•1m ago•0 comments

Jeff Bezos is raising his game in space

https://www.economist.com/business/2026/04/23/jeff-bezos-is-raising-his-game-in-space
1•andsoitis•2m ago•0 comments

Bdelloid Rotifer

https://en.wikipedia.org/wiki/Bdelloidea
1•embedding-shape•3m ago•0 comments

Tim Cook wrote a winning recipe for Apple

https://www.economist.com/leaders/2026/04/23/tim-cook-wrote-a-winning-recipe-for-apple
1•andsoitis•3m ago•0 comments

Peter Sarnak – The Riemann Hypothesis [video]

https://www.youtube.com/watch?v=DtaFyE9BcXw
1•delhanty•6m ago•1 comments

Google is building a Claude Code challenger, Sergey Brin is involved

https://www.indiatoday.in/technology/news/story/google-is-secretly-building-a-claude-code-challen...
1•nsoonhui•12m ago•0 comments

Michael review: 'A bland and barely competent daytime TV movie'

https://www.bbc.com/culture/article/20260421-michael-review
1•dnnddidiej•21m ago•0 comments

Education must go beyond the mere production of words

https://www.ncregister.com/commentaries/schnell-repairing-the-ruins
2•signor_bosco•24m ago•0 comments

Decoupled DiLoCo for Resilient Distributed Pre-Training

https://arxiv.org/abs/2604.21428
1•matt_d•28m ago•0 comments

Serendipity Machines

https://www.shishyko.com/essays/serendipity-machines.html
1•philip1209•33m ago•0 comments

Mac-use: open-source Codex computer-use clone for your OpenClaw on Mac OS

https://github.com/TheGuyWithoutH/mac-computer-use
1•guywithnoh•37m ago•2 comments

ChatGPT ads targeting farmers (YouTube Link) [video]

https://www.youtube.com/watch?v=4rzeW4dbvlQ
1•ki4jgt•39m ago•0 comments

Prop 13 Didn't Shrink Government. It Handed It to Sacramento

https://maxmautner.com/2026/04/23/prop-13-changed-things.html
1•mslate•42m ago•0 comments

Why does the Rainbow have 7 colors?

https://glorify.com/learn/why-does-the-rainbow-have-seven-colors
2•airstrike•43m ago•0 comments

You're about to feel the AI money squeeze

https://www.theverge.com/ai-artificial-intelligence/917380/ai-monetization-anthropic-openai-token...
2•cdrnsf•45m ago•1 comments

Anthropic now requires Pro Plans to enable/purchase extra usage for Opus

https://support.claude.com/en/articles/11940350-claude-code-model-configuration
7•qdot76367•48m ago•3 comments

Context Pricing and Accounting [video]

https://www.youtube.com/watch?v=xcYhV4S7faI
1•journal•50m ago•0 comments

Chinese National Pleads Guilty to Photographing Air Force Base and Equipment

https://www.justice.gov/usao-wdmo/pr/chinese-national-pleads-guilty-unlawfully-photographing-air-...
2•737min•53m ago•3 comments

Databases Were Not Designed for This

https://arpitbhayani.me/blogs/defensive-databases/
1•mooreds•54m ago•0 comments

James Bosworth on the 'Orange Wave' Happening Across Latin America

https://www.bloomberg.com/news/articles/2026-04-24/james-bosworth-on-the-orange-wave-happening-ac...
1•mooreds•55m ago•1 comments

Alex Bores' AI Policy Framework for Congress [pdf]

https://www.alexbores.nyc/files/Bores_AI_Framework.pdf
1•mooreds•1h ago•0 comments

Andrej Karpathy's microgpt as a Triptych

https://karpathy.art/
1•stared•1h ago•0 comments

Chinese National Arrested for Illegally Photographing Military Aircraft at AFB

https://www.justice.gov/opa/pr/chinese-national-arrested-jfk-international-airport-federal-charge...
2•737min•1h ago•1 comments

Exodus, from former Mass Effect devs, couldn't look more like Mass Effect

https://www.pcgamer.com/games/rpg/exodus-the-sci-fi-rpg-from-former-mass-effect-devs-couldnt-look...
2•evo_9•1h ago•0 comments

Ancient amber reveals a true bug equipped with claws, a highly unusual feature

https://phys.org/news/2026-04-ancient-amber-reveals-true-bug.html
2•bookofjoe•1h ago•0 comments

The bull case for graph DBs in law

https://alanyahya.com/writing/bull-case-graph-dbs-law
2•alansaber•1h ago•0 comments

Microsoft offers voluntary employee buyout/retirement for 7% of U.S. workforce

https://www.cnbc.com/2026/04/23/microsoft-plans-first-voluntary-retirement-program-for-us-employe...
3•mgh2•1h ago•0 comments

Show HN: RoboAPI – A unified REST API for robots, like Stripe but for hardware

https://github.com/amitb-quantum/roboapi
1•xmas123•1h ago•1 comments

Lumitime Automata – The Most Amazing Digital Clock Is a Machine [video]

https://www.youtube.com/shorts/LTInOMdjs5o
1•thunderbong•1h ago•0 comments

David Choi's Mars FX Collapse Sparks Global Hunt for Almost $600M

https://www.bloomberg.com/news/articles/2026-04-24/hedge-fund-collapse-sparks-global-hunt-for-alm...
1•simonpure•1h ago•1 comments