frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

UK Biobank leak: Health details of 500 000 people are offered for sale

https://www.bmj.com/content/393/bmj.s781
131•dberhane•2h ago

Comments

WalterGR•2h ago
Related: https://news.ycombinator.com/item?id=47875843 “UK Biobank health data keeps ending up on GitHub”
blitzar•1h ago
Extremely related - my red string on the wall points to this being the source of the data leak rather the latest heist by Oceans Crew.

Given the whack-a-mole takedowns, its pretty clear everyone involved knew what was going on.

fragmede•1h ago
I want to get my DNA digitized so I can do all sorts of health stuff for myself, but finding a place that won't leak my data is troublesome. 23andme is right out.
grey-area•1h ago
Buy a desktop sequencer?

https://nanoporetech.com/products/sequence/minion

ogundipeore•1h ago
Great suggestion. Thank you for sharing!
fenaer•1h ago
I have the same sentiment as OP, but for me the main benefit of a company doing it is the analysis that comes with it.
odyssey7•49m ago
If we are censoring our daily activities and major life decisions like healthcare due to the data economy, then it is making us less free. But who knows how many generations will pass before a solution shows up. We would need representatives who act collectively towards motives beyond profits.
conception•1h ago
https://sequencing.com/our-difference/privacy-forever seems the best choice these days.
sheiyei•1h ago
I can believe the company does their best to keep the records private.

...until they're inevitably sold.

GistNoesis•1h ago
Similar to https://xcancel.com/SethSHowes ~10k budget based on minION sequencer. (Edit : his dedicated project page https://iwantosequencemygenomeathome.com/ )

But once your data has been digitized even if it is under your control the likelihood that it gets leaked is still high. Specially now with AI agents running everywhere, or people just asking AI services for medical advice.

Today the choice for advice is between low quality local AI advice or higher quality advice but lose your data control, the rational choice is probably losing your data control even if if will almost certainly comes back to bite you.

londons_explore•1h ago
There isn't much difference between giving this data to 20,000 researchers all over the world and simply publishing the data on the web.

I personally would like data like this to simply be published, together with a law that says using the data to make personalized decisions affecting those individuals is punishable with life in prison.

Basically, this data is 'opensource', but not for use to decide insurance premiums, job offers, or the contents of news articles.

Pay08•1h ago
I can't wait for this to be used for assassination by peanut.
keybored•1h ago
“We didn’t make a decision based on that.” Done and dusted?
chii•6m ago
or it's made the onus for the proof that the data wasn't used, so if your decision didn't come with a proof it wasn't, the party making the decision can be sued for it.

Like a clean room implementation requirement.

basisword•1h ago
Which would be fine if that's what the people who gave their data over agreed to.
spacebanana7•1h ago
> together with a law that says using the data to make personalized decisions affecting those individuals is punishable with life in prison.

This works well in theory but is basically unenforceable. It's barely possible, if possible at all, to audit how FB or google make ad targeting decisions - but once stuff gets into the fragmented ecosystem of data brokers and market intelligence consultancies all hope is lost.

To say nothing of state actors, like countries who might deny you a visa based on adverse medical info or otherwise use your information against you.

estearum•29m ago
well you just articulated the difference

licensing it to researchers allows you to create, monitor, and enforce policies like the one you describe

stealing it does not

probably_wrong•24m ago
> There isn't much difference between giving this data to 20,000 researchers all over the world and simply publishing the data on the web.

As a researcher who regularly deals with such data there is a MASSIVE difference. Yes, I have access to the data but I am restricted on how it can be stored (no cloud), what I can and can't do with it, and for some of it I'm even mandated to destroy it once the research project is over. I have the informed consent of every participant, some of which withdrew halfway throughout the collection without any penalty to them. I also don't need a new law because I'm already bound by existing ones, by the contract I signed when I joined, and by the confidentiality agreement I signed when the project started. While I don't know that the leaker(s) will be identified, the existence of the data itself already calls for legal action while giving a starting point for investigation.

Your suggestion, on the other hand, seems to be "let's put this data out there without people's consent and make companies pinky promise that they won't use it in their black boxes in a way that's virtually impossible to detect or prosecute". Those two things are definitely not equivalent.

Aboutplants•1h ago
Gonna wager the US government is the first to purchase
gib444•1h ago
I thought we pay them to have it via Palantir contracts or something?
blitzar•1h ago
I think it is google that we pay to backdoor the data
cbg0•1h ago
The US has over 70 million on Medicare, why would they care about 500K brits?
azan_•1h ago
"Access this article for 1 day for: £50 / $60/ €56 (excludes VAT)" Man, the scientific publishing cartel is something else. Note that author will generally get exactly £0 / $0 / €0 for his text.
scotty79•1h ago
That kind of data should be public anyways.
alt227•1h ago
Yeah, as long as all 500,000 people in the data set agreed for it to be public then thats fine. But how do we verify that?
Ylpertnodi•34m ago
They're on the list, their information is out there. Isn't that what 'opt in' means?
PunchTornado•41m ago
When i signed up as a volunteer they assured me it was not going to be public, only veted researchers allowed to access it.
greg_dc•1h ago
In fairness, is this any worse than what Palantir will do with the whole countries NHS records? And they're being paid by the government to do it!
jjice•55m ago
Both are bad
estearum•30m ago
Is allowing random malicious actors to buy health data worse than allowing NHS's own employees to interact with that data productively?

yes

chromehearts•21m ago
Palantir may not be random but it's certainly a malicious actor
philipallstar•17m ago
The NHS does it so badly that they brought in Palantir.
crimsoneer•26m ago
Well, one is a thing that has happened, and one is a thing that hasn't happened.
Aurornis•20m ago
> In fairness, is this any worse than what Palantir will do with the whole countries NHS records?

I don’t get this trend of seeing bad thing happen and then commenting that other bad thing exists and therefore “in fairness” we should downplay it.

Bad things are bad. Comparing them to other things we don’t like doesn’t make them less bad. I don’t like Palantir either but they’re not intentionally leaking health details so this comparison doesn’t even make any sense.

cassianoleal•15m ago
> they’re not intentionally leaking health details

To many, they are. They're leaking information that has been trusted to the NHS to their own databases.

The fact that it's being done under government contract and (arguably) within the law shouldn't immediately make it any less bad.

mentalgear•1h ago
> Data for sale included people’s gender, age, month and year of birth, socioeconomic status, lifestyle habits, mental health, self-reported medical history, cognitive function, and physical measures.

If this is not traceable back to individuals, it would probably good to be made public. But I assume the UK Biobank only gives access to trusted partners since - as we know in our 'data analytics' day and age - with enough general data quantity you can trace back anything to anyone if you have the resources. And the capitalist-surveillance econonmy certainly provides the profit-motive.

mellosouls•42m ago
Already being discussed:

UK Biobank health data keeps ending up on GitHub

https://news.ycombinator.com/item?id=47875843

UK Biobank health data listed for sale in China, government confirms

https://news.ycombinator.com/item?id=47874732

noname120•19m ago
How can the fulltext be accessed?
jonathanstrange•8m ago
In the same way as the "UK Biobank" software accesses it.

UK Biobank leak: Health details of 500 000 people are offered for sale

https://www.bmj.com/content/393/bmj.s781
137•dberhane•2h ago•49 comments

S. Korea police arrest man over AI image of runaway wolf that misled authorities

https://www.bbc.com/news/articles/c4gx1n0dl9no
161•giuliomagnifico•4h ago•93 comments

Spinel: Ruby AOT Native Compiler

https://github.com/matz/spinel
140•dluan•5h ago•33 comments

How to be anti-social – a guide to incoherent and isolating social experiences

https://nate.leaflet.pub/3mk4xkaxobc2p
98•calcifer•2h ago•76 comments

DeepSeek v4

https://api-docs.deepseek.com/
1276•impact_sy•10h ago•903 comments

Mounting tar archives as a filesystem in WebAssembly

https://jeroen.github.io/notes/webassembly-tar/
37•datajeroen•3h ago•5 comments

US special forces soldier arrested after allegedly winning $400k on Maduro raid

https://www.cnn.com/2026/04/23/politics/us-special-forces-soldier-arrested-maduro-raid-trade
379•nkrisc•15h ago•421 comments

Show HN: How LLMs Work – Interactive visual guide based on Karpathy's lecture

https://ynarwal.github.io/how-llms-work/
129•ynarwal__•6h ago•31 comments

An update on recent Claude Code quality reports

https://www.anthropic.com/engineering/april-23-postmortem
800•mfiguiere•19h ago•615 comments

Why I Write (1946)

https://www.orwellfoundation.com/the-orwell-foundation/orwell/essays-and-other-works/why-i-write/
209•RyanShook•11h ago•51 comments

Bitwarden CLI compromised in ongoing Checkmarx supply chain campaign

https://socket.dev/blog/bitwarden-cli-compromised
802•tosh•23h ago•381 comments

Aspartame is not that bad?

https://dynomight.net/aspartame/
61•pHequals7•1h ago•63 comments

GPT-5.5

https://openai.com/index/introducing-gpt-5-5/
1430•rd•19h ago•951 comments

8087 Emulation on 8086 Systems

https://www.os2museum.com/wp/learn-something-old-every-day-part-xx-8087-emulation-on-8086-systems/
8•ingve•2h ago•0 comments

Show HN: Gova – The declarative GUI framework for Go

https://github.com/NV404/gova
72•aliezsid•7h ago•14 comments

Hear your agent suffer through your code

https://github.com/AndrewVos/endless-toil
47•AndrewVos•2h ago•14 comments

The operating cost of adult and gambling startups

https://orchidfiles.com/stigma-is-a-tax-on-every-operational-decision/
48•theorchid•1h ago•44 comments

MeshCore development team splits over trademark dispute and AI-generated code

https://blog.meshcore.io/2026/04/23/the-split
240•wielebny•20h ago•127 comments

Meta tells staff it will cut 10% of jobs

https://www.bloomberg.com/news/articles/2026-04-23/meta-tells-staff-it-will-cut-10-of-jobs-in-pus...
677•Vaslo•18h ago•654 comments

Using the internet like it's 1999

https://joshblais.com/blog/using-the-internet-like-its-1999/
185•joshuablais•17h ago•128 comments

Show HN: Tolaria – Open-source macOS app to manage Markdown knowledge bases

https://github.com/refactoringhq/tolaria
238•lucaronin•15h ago•100 comments

Familiarity is the enemy: On why Enterprise systems have failed for 60 years

https://felixbarbalet.com/familiarity-is-the-enemy/
68•adityaathalye•8h ago•34 comments

UK Biobank health data keeps ending up on GitHub

https://biobank.rocher.lc
167•Cynddl•23h ago•41 comments

Habitual coffee intake shapes the microbiome, modifies physiology and cognition

https://www.nature.com/articles/s41467-026-71264-8
207•scubakid•9h ago•145 comments

TorchTPU: Running PyTorch Natively on TPUs at Google Scale

https://developers.googleblog.com/torchtpu-running-pytorch-natively-on-tpus-at-google-scale/
159•mji•16h ago•14 comments

My phone replaced a brass plug

https://drobinin.com/posts/my-phone-replaced-a-brass-plug/
162•valzevul•21h ago•40 comments

Alberta startup sells no-tech tractors for half price

https://wheelfront.com/this-alberta-startup-sells-no-tech-tractors-for-half-price/
2229•Kaibeezy•1d ago•750 comments

Show HN: Agent Vault – Open-source credential proxy and vault for agents

https://github.com/Infisical/agent-vault
120•dangtony98•1d ago•40 comments

A programmable watch you can actually wear

https://www.hackster.io/news/a-diy-watch-you-can-actually-wear-8f91c2dac682
200•sarusso•3d ago•95 comments

Show HN: Honker – Postgres NOTIFY/LISTEN Semantics for SQLite

https://github.com/russellromney/honker
277•russellthehippo•1d ago•68 comments