frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AI-generated medical data can sidestep usual ethics review, universities say

https://www.nature.com/articles/d41586-025-02911-1
5•qnleigh•1h ago

Comments

evil-olive•4m ago
> To generate what is called synthetic data, researchers train generative AI models using real human medical information, then ask the models to create data sets with statistical properties that represent, but do not include, human data.

famously, "garbage in, garbage out"

but thanks to AI, we now have the exciting innovation that you can inject garbage into the middle of the process.

you have data from actual humans. it has some statistical properties.

you could look at those statistical properties, and do research on them, looking for hidden correlations or whatever. that's been possible for decades, no need for LLMs.

or, you can take those statistical properties, ask a chatbot to generate synthetic data based on them, and then do research on that synthetic data. but...why?

any valid conclusions from the research will be based on the statistical properties that were already there in the original data. the extra step of using the LLM gains nothing, and adds risk of the research being faulty because it found some correlation that the LLM made up.

this is like taking an image, saving it as a JPEG with 5% quality (or some other lossy process), and then asking an AI to upscale and enhance it for you. in the best-case, all you get is a reconstruction of the original. and realistically you'll almost certainly introduce misleading artifacts and noise.

or, scramble an egg, take a picture, and ask the chatbot to generate a picture for you of what the unbroken egg might have looked like. maybe it'll do a decent job of it...but 5 minutes ago you had the unbroken egg in your hand.

LLMs cannot reverse entropy. they cannot unscramble the egg. you can easily add randomness to a data set, but you cannot easily remove it.

Should LLMs Write FOSS Books?

1•DavidCanHelp•6m ago•3 comments

A Word about Complexity

https://dillo-browser.github.io/complexity.html
2•rodarima•9m ago•0 comments

The Age of Greater Reykjavík

https://www.flother.is/blog/reykjavik-age/
1•alexharri•9m ago•0 comments

Show HN: Freeze Trap

https://deepanwadhwa.github.io/freeze_trap/
1•dwa3592•9m ago•0 comments

How to Scam the Scammers

https://derogab.com/2025/09/14/How-to-Scam-the-Scammers/
2•derogab•10m ago•0 comments

Stop After Current

1•Toby1VC•10m ago•0 comments

The Vibe Coder's Guide to Product Management (Open Source Book)

https://github.com/cloudstreet-dev/The-Vibe-Coder-s-Guide-to-Product-Management/blob/main/chapter...
1•DavidCanHelp•16m ago•0 comments

Njalla Has Silently Changed: A Word of Caution

https://xn--gckvb8fzb.com/njalla-has-silently-changed-a-word-of-caution/
2•Improvement•17m ago•0 comments

CVC acquires majority stake in Namecheap for $1.5B

https://webhosting.today/2025/09/12/cvc-acquires-majority-stake-in-namecheap-for-1-5-billion/
3•ajdude•18m ago•0 comments

The Qweremin

https://linusakesson.net/qweremin/index.php
1•bookofjoe•19m ago•0 comments

Experimental platform using LLMs to generate algorithmic music

https://vibecompose.vercel.app/
1•maddmann•24m ago•1 comments

TLD domain name renewal grace periods

https://www.namecheap.com/support/knowledgebase/article.aspx/9916/2207/tlds-grace-periods/
1•fanf2•27m ago•0 comments

New and simple detection method for nanoplastics

https://www.uni-stuttgart.de/en/university/news/all/New-and-simple-detection-method-for-nanoplast...
2•geox•32m ago•0 comments

My Thoughts on Renting Versus Buying

https://milesbarr.me/posts/my-thoughts-on-renting-versus-buying/
2•milesbarr•33m ago•0 comments

Quote Posts

https://blog.joinmastodon.org/2025/09/introducing-quote-posts/
3•doener•38m ago•0 comments

Microsoft mandates RTO – claims Teams and all remote work solutions are inferior

https://www.windowscentral.com/software-apps/microsoft-mandates-return-to-office-claims-teams-and...
6•taubek•39m ago•1 comments

Riff

https://www.letsriff.ai/
3•wheresclark•46m ago•0 comments

Dev3000 – The browser for AI-based development

https://dev3000.ai/
1•plurby•50m ago•0 comments

Female jumping spiders drive hybridization by favoring red males across species

https://phys.org/news/2025-08-female-spiders-hybridization-favoring-red.html
4•PaulHoule•51m ago•0 comments

AI as Teleportation

https://www.geoffreylitt.com/2025/09/10/ai-as-teleportation.html
1•WhyNotHugo•51m ago•0 comments

How the WSJ Analyzed More Than One Million FAA Reports

https://www.wsj.com/business/airlines/how-the-journal-analyzed-more-than-one-million-faa-reports-...
3•JumpCrisscross•53m ago•0 comments

Interactive Tree of Life Explorer

https://www.onezoom.org
1•downboots•53m ago•0 comments

Ask HN: Why does AWS Route53 price .click domains at only $3/year?

3•michaelstewart•55m ago•3 comments

Elastically Graded Embroidered Tessellations

https://techxplore.com/news/2025-09-machine-embroidery-encodes-skin-tension.html
1•lif•55m ago•0 comments

"We must break with the idea that it is civil liberty to use encrypted apps"

https://mastodon.social/@chatcontrol/115204439983078498
5•nickslaughter02•55m ago•2 comments

Created a map of all the research on Asthma for the past 10 years

https://old.reddit.com/r/Asthma/comments/1na86tz/i_created_an_interactive_map_of_all_the_research/
3•SantiagoVargas•57m ago•1 comments

Show HN: Proxmox-GitOps: Recursive IaC LXC Container Automation

https://github.com/stevius10/Proxmox-GitOps
1•stevius•59m ago•0 comments

Home is where the home server is

https://ounapuu.ee/posts/2025/05/15/home/
3•markyouk•59m ago•0 comments

Show HN: Worried about your pet? Health assessments with instant answers

https://petcheckai.com
2•pcrausaz•1h ago•0 comments

Daisugi, the 600-Year-Old Japanese Technique of Growing Trees Out of Other Trees (2020)

https://www.openculture.com/2020/10/daisugi.html
5•coloneltcb•1h ago•0 comments