frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Can Europe get kids off social media?

https://www.ft.com/content/cf465c21-4789-490b-b328-41f6383567d7
1•thm•1m ago•0 comments

I Built a NAS (Buildlog)

https://arne.me/blog/buildlog-nas
1•abahlo•1m ago•0 comments

Making Software: How do computers store data?

https://www.makingsoftware.com/chapters/how-is-data-stored
1•Garbage•3m ago•0 comments

A timeline of claims about AI/LLMs

https://blog.nethuml.xyz/posts/2026/02/timeline-of-claims-about-ai-llms/
1•nethuml•5m ago•0 comments

Freeciv 3D with hex map tiles and WebGPU renderer

https://freecivworld.net/
1•roschdal•6m ago•0 comments

SpaceX-xAI Merger: Nobody's Talking About the von Neumann Elephant in the Room

1•juanpabloaj•10m ago•0 comments

Smart Homes Are Terrible

https://www.theatlantic.com/ideas/2026/02/smart-homes-technology/685867/
4•aarghh•14m ago•0 comments

Ask HN: Would you use an ESLint-like tool for SEO that fails your CI/CD build?

1•YannBuilds•15m ago•0 comments

Praise for Price Gouging

https://www.grumpy-economist.com/p/praise-for-price-gouging
1•mhb•18m ago•0 comments

Open source infra orchestrator agent clanker CLI

https://github.com/bgdnvk/clanker
1•tekbog•20m ago•0 comments

Lance table format explained simply, stupid (Animated)

https://tontinton.com/posts/lance/
1•tontinton•21m ago•0 comments

Solving Soma

https://anekstein.com/posts/2026-02-01-blocker
1•davidanekstein•21m ago•0 comments

We built a cloud platform for agentic software (our virtualization, etc.)

https://agentuity.com/
1•rblalock•21m ago•2 comments

Show HN: WLM-SLP – A 0D-27D Structural Language for Multi-Agent Alignment

https://github.com/gavingu2255-ai/WLM-Open-Source/blob/main/README.md
1•WujieGuGavin•21m ago•0 comments

Former Tumblr Head Jeff D'Onofrio Steps in as Acting CEO at the Washington Post

https://www.theverge.com/tech/875433/tumblr-jeff-donofrio-ceo-washington-post-layoffs
2•bookofjoe•25m ago•0 comments

Bounded Flexible Arrays in C

https://people.kernel.org/kees/bounded-flexible-arrays-in-c
1•fanf2•25m ago•0 comments

The Invisible Labor Force Powering AI

https://cacm.acm.org/news/the-invisible-labor-force-powering-ai/
1•pseudolus•27m ago•0 comments

Reading Recursion via Pascal

https://journal.paoloamoroso.com/reading-recursion-via-pascal
1•AlexeyBrin•27m ago•0 comments

Show HN: I made a website that finds patterns on your spreadsheet

https://analyzetable.com
1•kouhxp•28m ago•0 comments

Jokes on You AI: Turning the Tables – LLMs for Learning

https://www.dev-log.me/jokes_on_you_ai_llms_for_learning/
1•wazHFsRy•28m ago•0 comments

You don't need RAG in 2026

https://ryanlineng.substack.com/p/you-dont-need-rag-in-2026
1•kareninoverseas•30m ago•0 comments

WatchLLM – Cost kill switch for AI agents (with loop detection)

https://www.watchllm.dev/
1•Kaadz•33m ago•2 comments

I turned myself into an AI-generated deathbot – here's what I found

https://www.bbc.com/news/articles/c93wjywz5p5o
1•cmsefton•44m ago•1 comments

Management style doesn't predict survival

https://orchidfiles.com/management-style-doesnt-predict-survival/
1•theorchid•44m ago•0 comments

One Generation Runs the Country. The Next Cashed in on Crypto

https://www.wsj.com/finance/currencies/trump-sons-crypto-billions-1e7f1414
1•impish9208•46m ago•1 comments

"I Was Wrong": Why the Civil War Is Running Late [video][2h21m]

https://www.youtube.com/watch?v=RDmkKZ7vAkI
1•Bender•47m ago•0 comments

Show HN: A sandboxed execution environment for AI agents via WASM

https://github.com/Parassharmaa/agent-sandbox
1•paraaz•49m ago•0 comments

Wine-Staging 11.2 Brings More Patches to Help Adobe Photoshop on Linux

https://www.phoronix.com/news/Wine-Staging-11.2
2•doener•49m ago•0 comments

The Nature of the Beast

https://cinemasojourns.com/2026/02/07/the-nature-of-the-beast/
1•jjgreen•50m ago•0 comments

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

1•JanusPater•50m ago•0 comments
Open in hackernews

The Path to Medical Superintelligence

https://microsoft.ai/new/the-path-to-medical-superintelligence/
10•brandonb•7mo ago

Comments

PaulHoule•7mo ago
I was doing a comparative analysis of the acquistion strategies of various "big tech" firms and was a little startled that I missed Microsoft's 2022 acquistion of Nuance, largely for its speech recognition systems aimed at the medical sector:

https://news.microsoft.com/source/2022/03/04/microsoft-compl...

gm678•7mo ago
> Microsoft AI Diagnostic Orchestrator (MAI-DxO) correctly diagnoses up to 85% of NEJM case proceedings, a rate more than four times higher than a group of experienced physicians.

> Clinicians in our study worked without access to colleagues, textbooks, or even generative AI, which may feature in their normal clinical practice.

1. As I understand, it's very common for doctors to fall back on reference material in their practice, especially for the most complex cases. If all access to resources was cut off (as seems to be implied by the second quote), the comparison seems somewhat unfair.

2. What were the publication dates of the case records? I can't find this information, and it makes a difference if the NEJM case studies were in the LLMs' training data.

miraculixx•7mo ago
Exactly. The study has been set up to produce this exact result. They essentially limited the human doctors to bare essentials, on specialist cases(!), while providing the LLMs with all sorts of help, including discussion among several AIs.

That's like letting one group of students have a strict closed-book exam, while another group can take the test as a group exercise and accessing any material they like, then claiming that closed-book exams lead to worse outcomes.

In a nutshell the study is just slop designed to get attention. The headline result is what they really want people to hear, and that's all the media will be repeating.

miraculixx•7mo ago
As any AI researcher knows, if you have a model that does 4x better than the naive baseline (the humans, in this case), you are likely looking at overfit, not real-life performance. This study is just slop, and you can tell so by the mere fact that they did not submit a paper, but just published a PR article.
LargoLasskhyfv•7mo ago
They didn't? What am I looking at, then?

https://arxiv.org/abs/2506.22405

This appears when you click on 'View Publication' in the article near the end, right before Q&A.

brandonb•7mo ago
In the paper, they say they used the most recent 56 cases (from 2024–2025) as a holdout set. The majority of those cases happened after the o4 training cutoff of May 31, 2024.
miraculixx•7mo ago
Are these 56 cases distinct from all other cases in the data?
FlyingLawnmower•7mo ago
Yes. They are about entirely different patient reports.