frontpage.

We Tried Helping Medical AI Startups Get Diverse Data. Here's What We Learned

1•mehashim•3h ago

I’m a medical doctor and founder of a small startup called Craniolabs. A month ago, my cofounders and I started working on a painfully obvious problem in medical AI: the lack of diverse, high-quality, ethically sourced medical imaging data, especially from underrepresented regions like Africa, Asia, and the Middle East.

Everyone in the field talks about bias in datasets, but very few seem to be solving it at the root. So we tried.

We signed memorandums of understanding with hospitals in Egypt and Dubai. We got access to radiology departments and de-identified DICOM archives. We partnered with radiologists. We integrated NVIDIA MONAI to streamline annotation. And then… we reached out to over 30 diagnostic AI startups who we thought would jump at the chance to access better data.

Almost no one replied.

Some opened our emails 3–4 times. A few asked for more info. One or two made it to pricing discussions. But the reality is: most teams either weren’t ready to buy, didn’t have budget, or were hesitant to engage in anything that looked operational.

Here’s what we learned: • Most diagnostic AI startups are resource-strapped, even if well-funded • Everyone wants clean, diverse data, but no one wants to manage the plumbing • Many are still stuck using NIH ChestXray or CheXpert and don’t trust third-party data easily • Even at small scale, hospitals need very clear legal, ethical, and financial frameworks to move data

We’re now restructuring toward a subscription-based model with ready-to-use curated batches, compliance built in, and optional annotations. But the lesson stuck: getting the first few customers is way harder than building the product.

Curious if others here have faced something similar, whether in healthcare, infra, or AI. If you’re building in this space or just have thoughts, I’d love to hear how you’d approach this.

(More about what we’re doing at: https://craniolabs.tech)

Air India Flight 171 Accident Preliminary Report [pdf]

Psilocybin Delays Aging

Photos: The Scale of China's Solar-Power Projects

Matrix Live S11E04 – Gathering the Community

Type Inference Zoo

AI Running on a Gaming GPU Now Classifies and Values Ancient Chinese Ceramics

Ten Simple Rules for Mathematical Writing

Dutch Childcare Benefits Scandal

Monorepo Tooling with NPM and Shell

How I use Emacs + Denote to take notes as a researcher

Norilsk

Own, a new social media app, aims to tokenize the creator economy

Why tariffs haven't raised inflation much (yet)

Remember Corporate Training Programs?

I spent $200 to test every LLM on a complex SQL query generation task

Preliminary report into Air India crash released

How I Use ChatGPT in Notion to Write PM Reports Faster

US customs duties top $100B for first time in a fiscal year

Figma's $300k Daily AWS Bill Isn't the Scandal You Think It Is

Preserving Traditions: Unveiling the Timeless History of Lacto-Fermentation

Global Measles Outbreaks

Show HN: SaaS Template Optimized for AI

Flux Kontext Image editing tests

How to Interview AI Engineers

Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs

Creating a Website from Obsidian

Talking Postgres with Shireesh Thota, Microsoft CVP

Pasilalinic-Sympathetic Compass

Ask HN: Advice for someone choosing a college path

Chinese TV uses AI to translate broadcasts to sign language. It's not going well