frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The last-ever penny will be minted today in Philadelphia

https://www.cnn.com/2025/11/12/business/last-penny-minted
256•andrewl•3h ago•383 comments

Maestro Technology Sells Used SSD Drives as New

https://kozubik.com/items/MaestroTechnology/
72•walterbell•1h ago•32 comments

Steam Machine

https://store.steampowered.com/sale/steammachine
529•davikr•1h ago•226 comments

Steam Frame

https://store.steampowered.com/sale/steamframe
381•Philpax•2h ago•103 comments

Learn Prolog Now

https://lpn.swi-prolog.org/lpnpage.php?pageid=top
187•rramadass•5h ago•107 comments

Project Euler

https://projecteuler.net
65•swatson741•2h ago•19 comments

Yt-dlp: External JavaScript runtime now required for full YouTube support

https://github.com/yt-dlp/yt-dlp/issues/15012
700•bertman•9h ago•442 comments

Launch HN: JSX Tool (YC F25) – A Browser Dev-Panel IDE for React

28•jsunderland323•2h ago•20 comments

Async and Finaliser Deadlocks

https://tratt.net/laurie/blog/2025/async_and_finaliser_deadlocks.html
29•emailed•1h ago•5 comments

Ioannis Yannas invented artificial skin for treatment of burns–dies at 90

https://news.mit.edu/2025/professor-ioannis-yannas-dies-1027
88•bookofjoe•1w ago•4 comments

Blasting Yeast with UV Light

https://chillphysicsenjoyer.substack.com/p/results-from-blasting-yeast-with
7•Gormisdomai•1h ago•0 comments

Fighting the New York Times' invasion of user privacy

https://openai.com/index/fighting-nyt-user-privacy-invasion
175•meetpateltech•5h ago•187 comments

A brief look at FreeBSD

https://yorickpeterse.com/articles/a-brief-look-at-freebsd/
35•todsacerdoti•7h ago•9 comments

.NET 10

https://devblogs.microsoft.com/dotnet/announcing-dotnet-10/
416•runesoerensen•1d ago•341 comments

GPT-5.1: A smarter, more conversational ChatGPT

https://openai.com/index/gpt-5-1/
105•tedsanders•50m ago•121 comments

Valve Announces New Steam Machine, Steam Controller and Steam Frame

https://www.phoronix.com/news/Steam-Machines-Frame-2026
70•doener•1h ago•1 comments

Waymo robotaxis are now giving rides on freeways in LA, SF and Phoenix

https://techcrunch.com/2025/11/12/waymo-robotaxis-are-now-giving-rides-on-freeways-in-these-3-cit...
215•nharada•3h ago•245 comments

How Tube Amplifiers Work

https://robrobinette.com/How_Amps_Work.htm
10•gokhan•1h ago•0 comments

What happened to Transmeta, the last big dotcom IPO

https://dfarq.homeip.net/what-happened-to-transmeta-the-last-big-dotcom-ipo/
174•onename•10h ago•90 comments

The Single Byte That Kills Your Exploit: Understanding Endianness

https://pwnforfunandprofit.substack.com/p/the-single-byte-that-kills-your-exploit
15•andwati•3d ago•2 comments

Yann LeCun to depart Meta and launch AI startup focused on 'world models'

https://www.nasdaq.com/articles/metas-chief-ai-scientist-yann-lecun-depart-and-launch-ai-start-fo...
743•MindBreaker2605•12h ago•555 comments

Micro.blog launches new 'Studio' tier with video hosting

https://heydingus.net/blog/2025/11/micro-blog-offers-an-indie-alternative-to-youtube-with-its-stu...
83•justin-reeves•6h ago•27 comments

Archive or Delete?

https://email-is-good.com/2025/11/05/archive-or-delete/
4•speckx•1w ago•0 comments

Is your electric bill going up? AI is partly to blame

https://www.npr.org/2025/11/06/nx-s1-5597971/electricity-bills-utilities-ai
17•ilamont•35m ago•15 comments

The Geometry Behind Normal Maps

https://www.shlom.dev/articles/geometry-behind-normal-maps/
86•betamark•6h ago•5 comments

NetHack4 Philosophy

http://nethack4.org/philosophy.html
47•suioir•1w ago•21 comments

UK pauses intelligence-sharing with US on suspected drug vessels in Caribbean

https://www.theguardian.com/uk-news/2025/nov/11/uk-suspends-intelligence-sharing-with-us-amid-air...
51•beardyw•1h ago•3 comments

Building a CI/CD Pipeline Runner from Scratch in Python

https://muhammadraza.me/2025/building-cicd-pipeline-runner-python/
22•mr_o47•3d ago•3 comments

Show HN: Cancer diagnosis makes for an interesting RL environment for LLMs

17•dchu17•2h ago•3 comments

Hard drives on backorder for two years as AI data centers trigger HDD shortage

https://www.tomshardware.com/pc-components/hdds/ai-triggers-hard-drive-shortage-amidst-dram-squee...
94•pabs3•14h ago•79 comments
Open in hackernews

Show HN: Cancer diagnosis makes for an interesting RL environment for LLMs

17•dchu17•2h ago
Hey HN, this is David from Aluna (YC S24). We work with diagnostic labs to build datasets and evals for oncology tasks.

I wanted to share a simple RL environment I built that gave frontier LLMs a set of tools that lets it zoom and pan across a digitized pathology slide to find the relevant regions to make a diagnosis. Here are some videos of the LLM performing diagnosis on a few slides:

(https://www.youtube.com/watch?v=k7ixTWswT5c): traces of an LLM choosing different regions to view before making a diagnosis on a case of small-cell carcinoma of the lung

(https://youtube.com/watch?v=0cMbqLnKkGU): traces of an LLM choosing different regions to view before making a diagnosis on a case of benign fibroadenoma of the breast

Why I built this:

Pathology slides are the backbone of modern cancer diagnosis. Tissue from a biopsy is sliced, stained, and mounted on glass for a pathologist to examine abnormalities.

Today, many of these slides are digitized into whole-slide images (WSIs)in TIF or SVS format and are several gigabytes in size.

While there exists several pathology-focused AI models, I was curious to test whether frontier LLMs can perform well on pathology-based tasks. The main challenge is that WSIs are too large to fit into an LLM’s context window. The standard workaround, splitting them into thousands of smaller tiles, is inefficient for large frontier LLMs.

Inspired by how pathologists zoom and pan under a microscope, I built a set of tools that let LLMs control magnification and coordinates, viewing small regions at a time and deciding where to look next.

This ended up resulting in some interesting behaviors, and actually seemed to yield pretty good results with prompt engineering:

- GPT 5: explored up to ~30 regions before deciding (concurred with an expert pathologist on 4 out of 6 cancer subtyping tasks and 3 out of 5 IHC scoring tasks)

- Claude 4.5: Typically used 10–15 views but similar accuracy as GPT-5 (concurred with the pathologist on 3 out of 6 cancer subtyping tasks and 4 out of 5 IHC scoring tasks)

- Smaller models (GPT 4o, Claude 3.5 Haiku): examined ~8 frames and were less accurate overall (1 out of 6 cancer subtytping tasks and 1 out of 5 IHC scoring tasks)

Obviously, this was a small sample set, so we are working on creating a larger benchmark suite with more cases and types of tasks, but I thought this was cool that it even worked so I wanted to share with HN!

Comments

n2d4•51m ago
How would a human classify the cancers? I assume the LLM training data does not include a whole bunch of cancer samples, so assumably there are some rules that it follows?

> While there exists several pathology-focused AI models

Would also be curious how the LLM compares to this and other approaches. What's the performance of the models trained specifically on this task, and random guessing, compared to the expert pathologist? Correct me if I'm wrong but this seems like the sort of task where being right 90% of the time is not good enough, so even if the LLM beats other approaches, it still needs to close the gap to human performance.

dchu17•26m ago
> What's the performance of the models trained specifically on this task, and random guessing, compared to the expert pathologist?

I should probably first clarify here, the disease classification tasks are about subtyping the type of cancer (i.e classifying a case as invasive ductal carcinoma of the breast) rather than just broad malignant/benign classification so random guessing is much more difficult and makes this model performance more impressive.

> Would also be curious how the LLM compares to this and other approaches.

There aren't a lot of public general purpose pathology benchmarks. There are some like (https://github.com/sinai-computational-pathology/SSL_tile_be...) but focus on just binary benign/malignant classification tasks and binary biomarker detection tasks.

I am currently working on self-hosting the available open-source models.

> this seems like the sort of task where being right 90% of the time is not good enough, so even if the LLM beats other approaches, it still needs to close the gap to human performance

Yep, your intuition is right here, and actually the expectation is probably closer to mid-high 90%, especially for FDA approval (and most AI tools position as co-pilots at the moment). There is obviously a long way to go, but what I find about interesting about this approach is that it allows LLMs to generalize across (1) a variety of tissue types and (2) pathology tasks such as IHC H-score scoring.

ytrt54e•5m ago
You should read out to Eric Topol...