frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The Path to Medical Superintelligence

https://microsoft.ai/new/the-path-to-medical-superintelligence/
7•brandonb•3h ago

Comments

PaulHoule•3h ago
I was doing a comparative analysis of the acquistion strategies of various "big tech" firms and was a little startled that I missed Microsoft's 2022 acquistion of Nuance, largely for its speech recognition systems aimed at the medical sector:

https://news.microsoft.com/source/2022/03/04/microsoft-compl...

gm678•2h ago
> Microsoft AI Diagnostic Orchestrator (MAI-DxO) correctly diagnoses up to 85% of NEJM case proceedings, a rate more than four times higher than a group of experienced physicians.

> Clinicians in our study worked without access to colleagues, textbooks, or even generative AI, which may feature in their normal clinical practice.

1. As I understand, it's very common for doctors to fall back on reference material in their practice, especially for the most complex cases. If all access to resources was cut off (as seems to be implied by the second quote), the comparison seems somewhat unfair.

2. What were the publication dates of the case records? I can't find this information, and it makes a difference if the NEJM case studies were in the LLMs' training data.

miraculixx•1h ago
Exactly. The study has been set up to produce this exact result. They essentially limited the human doctors to bare essentials, on specialist cases(!), while providing the LLMs with all sorts of help, including discussion among several AIs.

That's like letting one group of students have a strict closed-book exam, while another group can take the test as a group exercise and accessing any material they like, then claiming that closed-book exams lead to worse outcomes.

In a nutshell the study is just slop designed to get attention. The headline result is what they really want people to hear, and that's all the media will be repeating.

miraculixx•1h ago
As any AI researcher knows, if you have a model that does 4x better than the naive baseline (the humans, in this case), you are likely looking at overfit, not real-life performance. This study is just slop, and you can tell so by the mere fact that they did not submit a paper, but just published a PR article.
LargoLasskhyfv•38m ago
They didn't? What am I looking at, then?

https://arxiv.org/abs/2506.22405

This appears when you click on 'View Publication' in the article near the end, right before Q&A.

A vision researcher's guide to some RL stuff: PPO and GRPO

https://yugeten.github.io/posts/2025/01/ppogrpo/
1•fzliu•3m ago•0 comments

Insider Trading on SEC Filings

https://www.bloomberg.com/opinion/newsletters/2025-06-30/insider-trading-on-sec-filings
1•ioblomov•3m ago•0 comments

High-Severity Vulnerability in Notepad++

https://www.csa.gov.sg/alerts-and-advisories/alerts/al-2025-063
1•onlinenotepad•6m ago•0 comments

Therapy dogs: stop crafting loopholes to fair, reasonable laws

https://dirtamericana.com/2025/04/therapy-dogs-business-interior-violations/
2•speckx•8m ago•0 comments

Show HN: FastPitchDeck – AI to generate VC-ready pitch decks

https://fastpitchdeckai.vercel.app/
1•ramyavarahagiri•10m ago•0 comments

Writing a Little Gosh

https://flak.tedunangst.com/post/writing-a-gosh
1•dpassens•13m ago•0 comments

Martech Engineer

1•smwbauer•14m ago•1 comments

Can we ever understand our dogs?

https://www.vox.com/explain-it-to-me/418008/dog-pets-perception-science-research-animal-smell
1•lr0•14m ago•0 comments

GenAI – Will Workers Disappear?

https://www.nominalnews.com/p/ai-labor-workers-economy
1•MPLan•15m ago•1 comments

Iran's Internet Blackout Accidentally Revealed Coordinated Narrative in the West

4•Memetic-tracer•16m ago•1 comments

Resources for Disaster Preparedness in Heritage (2024)

https://conserv.io/blog/7-resources-for-disaster-preparedness-in-heritage/
1•mooreds•16m ago•0 comments

Senator Chides FBI for Weak Advice on Mobile Security

https://krebsonsecurity.com/2025/06/senator-chides-fbi-for-weak-advice-on-mobile-security/
2•todsacerdoti•16m ago•0 comments

20 years on, Max Payne is as stylish as ever (2021)

https://www.eurogamer.net/20-years-on-max-payne-is-as-stylish-as-ever
3•Michelangelo11•17m ago•1 comments

Ask HN: Is "ethical AI" possible, or is there a catch?

1•mrdependable•21m ago•2 comments

Nvidia Gr00T - An Improved Open Foundation Model for Generalist Humanoid Robots

https://research.nvidia.com/labs/gear/gr00t-n1_5/
1•bottomotto•22m ago•0 comments

Zuckerberg Announces Meta 'Superintelligence' Effort, More Hires

https://www.bloomberg.com/news/articles/2025-06-30/zuckerberg-announces-meta-superintelligence-effort-more-hires
3•mfiguiere•22m ago•0 comments

Screenshot guide to check AI Search traffic in 2 minutes

https://ottic.ai/blog/how-to-check-your-chatgpt-traffic/
1•rafaepta•22m ago•0 comments

He Thought an Employee Stole Crypto. The FBI Says It Was a North Korean Scammer

https://www.wsj.com/business/he-thought-an-employee-stole-crypto-the-fbi-says-it-was-a-north-korean-scammer-8aa533a8
2•ddlatham•23m ago•0 comments

Show HN: Lunova – Custom rule-based SMS/email alerts for QuickBooks

https://uselunova.com/
1•chidog12•23m ago•0 comments

That XOR Trick (2020)

https://florian.github.io//xor-trick/
1•hundredwatt•23m ago•0 comments

Using Machine Learning to Detect Vault (Anti-Forensic) Apps

https://www.mdpi.com/1999-5903/17/5/186
1•bikenaga•23m ago•0 comments

Books of Bowie

http://www.bowiebookclub.com/david-bowies-100-most-influential-books
1•BruceEel•24m ago•0 comments

Building Scalable Systems While Paying the Bills

https://eric.mann.blog/finding-balance-building-scalable-systems-while-paying-the-bills/
1•eamann•25m ago•0 comments

Show HN: ArcFont – Font Embedding Model

https://github.com/JErnestoMtz/ArcFont
1•jernestomg•25m ago•0 comments

LLMs and Artists

https://blog.brokk.ai/llms-and-artists/
2•jbellis•28m ago•0 comments

Cincinnati's Grand Hall Recreated in PlayCanvas Using 3D Gaussian Splatting

https://www.ryanfellers.com/oldmain/
2•ovenchips•28m ago•0 comments

Giant plankton could help coral fight climate change

https://phys.org/news/2025-06-giant-plankton-coral-climate.html
1•PaulHoule•29m ago•0 comments

Vulnerability Advisory: Sudo Chroot Elevation of Privilege

https://www.stratascale.com/vulnerability-alert-CVE-2025-32463-sudo-chroot
1•eyberg•29m ago•0 comments

What's it like to be Batfished? A new word for mistaking LLMs for true subjects

https://partiallyexaminedlife.com/2025/06/30/what-is-it-like-to-be-batfished/
1•mistidoi•29m ago•0 comments

Trip June 2025 ISO C++ standards meeting (Sofia, Bulgaria)

https://herbsutter.com/2025/06/21/trip-report-june-2025-iso-c-standards-meeting-sofia-bulgaria/
1•klaussilveira•30m ago•0 comments