news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

'Comically bad' datasets used to train clinical models for stroke and diabetes

https://retractionwatch.com/2026/05/18/kaggle-dataset-clinical-models-stroke-diabetes/

13•leephillips•1h ago

Comments

Legend2440•36m ago

A lot of researchers think their job is to build models. They don't want to collect their own data, so they go find whatever dataset they can on kaggle or from a previous paper or wherever.

This is backwards. The model is the easy part. Getting good data is 99% of the job, and nearly any clown can make a good model once you hand them a good dataset.

skvmb•25m ago

As a clown, I can confirm.

If you hand me a clean, well-labeled, representative dataset, I can make the model do a respectable little dance by lunch.

If you hand me a Kaggle CSV with duplicated rows, target leakage, mislabeled outcomes, and columns named final_final_v2_REAL, suddenly I’m not doing ML anymore. I’m doing archaeology with a red nose on.

The model is the balloon animal. The dataset is the elephant you had to drag into the tent.

nradov•21m ago

For a lot of clinical decision support use cases you don't even need fancy AI models to get accurate results. If you have good quality cleansed data you can literally just import it into Excel and run a simple linear regression analysis. But unfortunately that won't get you a reputation as an "AI thought leader".

I’ve built a virtual museum with nearly every operating system you can think of

https://virtualosmuseum.org/

272•andreww591•2h ago•57 comments

Apple unveils new accessibility features

https://www.apple.com/newsroom/2026/05/apple-unveils-new-accessibility-features-and-updates-with-...

434•interpol_p•6h ago•231 comments

I’ve joined Anthropic

https://twitter.com/karpathy/status/2056753169888334312

755•dmarcos•3h ago•285 comments

Gaussian Splat of a Strawberry

https://superspl.at/scene/84df8849

385•danybittel•7h ago•147 comments

Gentoo News: Copy Fail, Dirty Frag, and Fragnesia Kernel Vulnerabilities

https://www.gentoo.org/news/2026/05/19/copy-fail-fragnesia-vulnerabilities.html

59•akhuettel•3h ago•13 comments

Show HN: Superlog (YC P26) – Observability that installs itself and fixes bugs

https://superlog.sh/

28•Magnanten•2h ago•26 comments

Gemini 3.5: frontier intelligence with action

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/

90•meetpateltech•41m ago•36 comments

KV Cache Is Becoming the Memory Hierarchy of Inference

https://touchdown-labs.com/blog/kv-cache-memory-hierarchy-inference.html

10•matt_d•2d ago•0 comments

Intro to TLA+ for the LLM Era: Prompt Your Way to Victory

https://emptysqua.re/blog/intro-to-tla-plus-for-the-llm-era/

67•zdw•2d ago•16 comments

CISA Admin Leaked AWS GovCloud Keys on GitHub

https://krebsonsecurity.com/2026/05/cisa-admin-leaked-aws-govcloud-keys-on-github/

270•LelouBil•10h ago•123 comments

Hanoi’s humble beer glass and the memory of a nation

https://sundaylongread.com/2026/05/15/hanois-humble-beer-glass-and-the-memory-of-a-nation/

83•NaOH•1d ago•13 comments

Google I/O

https://io.google/2026/

132•thanhhaimai•1h ago•155 comments

Cursor Cloud Agents Down

https://forum.cursor.com/t/cloud-agents-broken-ii/161036

7•mopatches•54m ago•1 comments

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

https://medium.com/@breid.at/ultra-pure-quantum-crystals-from-an-abandoned-mine-in-a-mysterious-d...

220•vi_sextus_vi•2d ago•83 comments

The last six months in LLMs in five minutes

https://simonwillison.net/2026/May/19/5-minute-llms/

653•yakkomajuri•17h ago•524 comments

KV Sharing, MHC, and Compressed Attention

https://magazine.sebastianraschka.com/p/recent-developments-in-llm-architectures

8•gmays•1h ago•0 comments

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

https://safedep.io/mini-shai-hulud-strikes-again-314-npm-packages-compromised/

307•theanonymousone•13h ago•227 comments

Peter Neumann has died

https://www.tuhs.org/pipermail/tuhs/2026-May/033748.html

279•pabs3•15h ago•23 comments

Show HN: I made a 3D pose maker for artists

https://setpose.com/

51•augustvdv•4h ago•26 comments

An Apple (II) for Teacher

https://technicshistory.com/2026/05/19/an-apple-ii-for-teacher/

44•cfmcdonald•18h ago•12 comments

Show HN: Haystack – Review the PRs that need human attention

https://haystackeditor.com/

10•akshaysg•1d ago•5 comments

OpenBSD 7.9

https://www.openbsd.org/79.html

279•bradley_taunt•5h ago•193 comments

Polypad

https://polypad.amplify.com/

190•ivank•2d ago•22 comments

Google IO 26 Keynote [video]

https://www.youtube.com/watch?v=wYSncx9zLIU

26•Dinux•1h ago•2 comments

Cursor Introduces Composer 2.5

https://cursor.com/blog/composer-2-5

261•asar•1d ago•195 comments

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

https://www.kv4p.com/

154•krupan•3d ago•66 comments

Gemini Omni

https://deepmind.google/models/gemini-omni/

16•meetpateltech•45m ago•3 comments

Click (2016)

https://clickclickclick.click/

356•andrewzeno•19h ago•91 comments

Nim-Presto – REST API Framework for Nim Language (2024)

https://github.com/status-im/nim-presto

52•TheWiggles•2d ago•10 comments

Anthropic acquires Stainless

https://www.anthropic.com/news/anthropic-acquires-stainless

518•tomeraberbach•1d ago•362 comments