frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Biomni: A General-Purpose Biomedical AI Agent

https://github.com/snap-stanford/Biomni
162•GavCo•10h ago

Comments

freedomben•9h ago
Awesome! This is the type of stuff I'm most excited about with AI - improvements to medical research and capabilities. AI can be awesome at identifying patterns in data that humans can't, and there has to be troves of data out there full of patterns that we aren't catching.

Of course there's also the possibility of engineering new drugs/treatments and things, which is also super exciting.

panabee•4h ago
Agreed. There is deep potential for ML in healthcare. We need more contributors advancing research in this space. One opportunity as people look around: many priors merit reconsideration.

For instance, genomic data that may seem identical may not actually be identical. In classic biological representations (FASTA), canonical cytosine and methylated cytosine are both collapsed into the letter "C" even though differences may spur differential gene expression.

What's the optimal tokenization algorithm and architecture for genomic models? How about protein binding prediction? Unclear!

There are so many open questions in biomedical ML.

The openness-impact ratio is arguably as high in biomedicine as anywhere else: if you help answer some of these questions, you could save lives.

Hopefully, awesome frameworks like this lower barriers and attract more people.

AIorNot•9h ago
very cool -passed on to my friend who is working a Crispr lab
Edmond•9h ago
This is nice, a lot of possibilities regarding AI use for scientific research.

There is also the possibility of building intelligent workspaces that could prove useful in aiding scientific research:

https://news.ycombinator.com/item?id=44509078

SalmoShalazar•9h ago
Not to take away from this or its usefulness (not my intent), but it is wild to me how many pieces of software of this type are being developed. We’re seeing endless waves of specialized wrappers around LLM API calls. There’s very little innovation happening beyond specializing around particular niches and invoking LLMs in slightly different ways with carefully directed context and prompts.
gronky_•9h ago
I see it a bit differently - LLMs are an incredible innovation but it’s hard to do anything useful with them without the right wrapper.

A good wrapper has deep domain knowledge baked into it, combined with automation and expert use of the LLM.

It maybe isn’t super innovative but it’s a bit of an art form and unlocks the utility of the underlying LLM

mrlongroots•8h ago
Exactly.

To present a potential usecase: there's a ridiculous and massive backlog in the Indian judicial system. LLMs can be let loose on the entire workflow: triage cases (simple, complicated, intractable, grouped by legal principles or parties), pull up related caselaw, provide recommendations, throw more LLMs and more reasoning at unclear problems. Now you can't do this with just a desktop and chatgpt, you need a systemic pipeline of LLM-driven workflows, but doing that unlocks potentially billions of dollars of value that is otherwise elusive.

lawlessone•8h ago
>pull up related caselaw

Or just make some up...

mrlongroots•7h ago
At the token layer an LLM can make things up, but not as part of a structured pipeline that validates an invariant that all suggestions are valid entities in the database.

Can google search hallucinate webpages?

tedy1996•6h ago
How is something that cant admit it doesnt know, and hallucinates a good innovation?
knowaveragejoe•3h ago
Modern LLMs frequently do state that they "don't know", for what it's worth. Like everything, it highly depends on the question.
okdood64•9h ago
> We’re seeing endless waves of specialized wrappers around LLM API calls.

AFAIK, doing proper RAG is much, much more than this.

What's your technical background if you don't mind me asking?

SalmoShalazar•8h ago
I’m a software engineer in the biotech space. I haven’t worked with RAG though, maybe I’m underestimating the complexity.
agpagpws•8h ago
I work at a top three lab. RAG is just Mumbai magic. Throwaway. Hi dang.
jjtheblunt•8h ago
What is a top three lab?
zachthewf•8h ago
We know they don't work at OpenAI or Anthropic, but beyond that have no information
epistasis•8h ago
The application of a new technology to new fields always looks like this. SQL databases become widespread, there's a wave of specialized software development for business practices. The internet becomes widespread, and there's a wave of SaaS solving specialized use cases.

We are going to see the same for anything that Claude or similar can't handle out of the box.

mlboss•7h ago
By that argument every SaaS is a db wrapper
goda90•6h ago
Think of it this way: before the internal combustion engine people used animal power, steam power, human power, wind power, etc to move cargo, passengers, and even specialized loads like water pumps for the fire brigade. Then with internal combustion they did those things faster and at greater scale. That wasn't innovating on the ICE itself, or solving new problems. But it was still useful. Of course they also eventually did innovate on the ICE, and they solved new problems with it(heavier than air flight, for example) but it took awhile.
ImaCake•6h ago
I suspect it's jumping on the hype train. Especially since its from a big Uni. Funding in research is all about marketing and latching onto the right keywords (just like VC really) so the most successful researchers are those who can market themselves effectively. Whether this tool is actually any good is secondary to whether it achieves the real goal of getting future funding for it's author.
andy99•8h ago
I'm sure they've thought of this but curious how it fared on evaluations for supporting biological threats, ie elevating threat actor capabilities with respect to making biological weapons.

I'm personally sceptical that LLMs can currently do this (and it's based on Claude that does test this) but still interesting to see.

greazy•5h ago
Creating a biological weapon requires a whole bunch of unique and specialised skills, equipment, safety measures (so you don't infect/kill yourself/your people) and even multidisciplinary skill sets. Take for example the Kameido (Japan) incident by the Aum Shinrikyo cult/religious group [1]. Same group which committed the Sarin attack [2].

> The use of an attenuated B. anthracis strain, low spore concentrations, ineffective dispersal, a clogged spray device, and inactivation of the spores by sunlight are all likely contributing factors to the lack of human cases.

Now you may say, that's bacteria, what about viruses? A similar set of problems would arise, how do you successfully grow virus to high titers? Even vaccine companies struggle to do this with certain viruses. Then the issue of dispersal, infectivity and mortality arise (too quick, it kills the host without spreading and authorities will notice, too slow, same problem: authorities will notice). I haven't even mentioned biological engineering which requires years of technical knowledge and laboratory experience combined with a intimate knowledge of the organism you're working with.

What worries me the most is nature springing a new influenza subtype. Our farming practices, especially in developing countries, is bound to breed a new subtype. It happened in 2009 (H1N1pdm) and it is bound to happen again. We got lucky with H1N1pdm.

1. https://pmc.ncbi.nlm.nih.gov/articles/PMC3322761/ 2. https://en.wikipedia.org/wiki/Tokyo_subway_sarin_attack

deepdarkforest•8h ago
Interesting. It's just an agent loop with access to python exec and web search as standard, BUT with premade, curated, 150 tools like analyze_circular_dichroism_spectra, with very specific params that just execute a hardcoded python function. Also with easy to load databases that conform to the tools' standards.

The argument is that if you just ask claude code to do niche biomed tasks, it will not have the knowledge to do it like that by just searching pubmed and doing RAG on the fly, which is fair, given the current gen of LLM's. It's an interesting approach, they show some generalization on the paper(with well known tidy datasets), but real life data is messier, and the approach here(correct me if im wrong) is to identify the correct tool for a task, and then use the generic python exec tool to shape the data into the acceptable format if needed, try the tool and go again.

It would be useful to use the tools just as a guidance to inform a generic code agent imo, but executing the "verified" hardcoded tools narrows the error scope, as long as you can check your data is shaped correctly, the analysis will be correct. Not sure how much of an advantage this is in the long term for working with proprietary datasets, but it's an interesting direction

epistasis•8h ago
This is great, I've been on the waitlist for their website for a while and am now excited to be able to try it out!
teenvan_1995•7h ago
I wonder if giving 150+ tools is really a good idea considering context limitations. Need to check out if this works IRL.
Herring•7h ago
There's an inner ToolRetriever which is a LLM call to select the most relevant tools/data/libraries.
dmezzetti•6h ago
Very interesting work!

If biomedical research and paper analysis is of interest to you, I've been working on a set of open source projects that enable RAG over medical literature for a while.

PaperAI: https://github.com/neuml/paperai

PaperETL: https://github.com/neuml/paperetl

There is also this tool that annotates papers inline.

AnnotateAI: https://github.com/neuml/annotateai

A Virginia public library is fighting off a takeover by private equity

https://lithub.com/a-virginia-public-library-is-fighting-off-a-threatened-takeover-by-private-equity/
148•sharkweek•2h ago•93 comments

German court rules Meta tracking technology violates European privacy laws

https://therecord.media/german-court-meta-tracking-tech
29•bundie•42m ago•2 comments

MCP-B: A Protocol for AI Browser Automation

https://mcp-b.ai/
180•bustodisgusto•7h ago•83 comments

Tree Borrows

https://plf.inf.ethz.ch/research/pldi25-tree-borrows.html
439•zdw•15h ago•88 comments

Biomni: A General-Purpose Biomedical AI Agent

https://github.com/snap-stanford/Biomni
162•GavCo•10h ago•27 comments

A Typology of Canadianisms

https://dchp.arts.ubc.ca/how-to-use
116•gnabgib•7h ago•119 comments

The Origin of the Research University

https://asteriskmag.com/issues/10/the-origin-of-the-research-university
40•Petiver•3d ago•0 comments

Show HN: FlopperZiro – A DIY open-source Flipper Zero clone

https://github.com/lraton/FlopperZiro
231•iraton•12h ago•55 comments

Show HN: MCP server for searching and downloading documents from Anna's Archive

https://github.com/iosifache/annas-mcp
103•iosifache•8h ago•29 comments

The jank programming language

https://jank-lang.org/
257•akkad33•3d ago•64 comments

Code and Trust: Vibrators to Pacemakers

https://punkx.org/jackdoe/code-and-trust.html
20•jackdoe•3d ago•7 comments

A fast 3D collision detection algorithm

https://cairno.substack.com/p/improvements-to-the-separating-axis
202•OlympicMarmoto•15h ago•26 comments

Evaluating the Effectiveness of Memory Safety Sanitizers

https://www.computer.org/csdl/proceedings-article/sp/2025/223600a088/21TfesaEHTy
6•signa11•2d ago•1 comments

Show HN: Petrichor – a free, open-source, offline music player for macOS

https://github.com/kushalpandya/Petrichor
75•kushalpandya•7h ago•30 comments

Configuring Split Horizon DNS with Pi-Hole and Tailscale

https://www.bentasker.co.uk/posts/blog/general/configuring-pihole-to-serve-different-records-to-different-clients.html
88•gm678•12h ago•23 comments

Archaeologists unveil 3,500-year-old city in Peru

https://www.bbc.co.uk/news/articles/c07dmx38kyeo
140•neversaydie•2d ago•45 comments

Linda Yaccarino is leaving X

https://www.nytimes.com/2025/07/09/technology/linda-yaccarino-x-steps-down.html
413•donohoe•14h ago•653 comments

Understand CPU Branch Instructions Better

https://chrisfeilbach.com/2025/07/05/understand-cpu-branch-instructions-better/
54•mfiguiere•3d ago•12 comments

Bootstrapping a side project into a profitable seven-figure business

https://projectionlab.com/blog/we-reached-1m-arr-with-zero-funding
820•jonkuipers•2d ago•217 comments

White Noise – secure and private messenger

https://www.whitenoise.chat/
61•onhacker•8h ago•24 comments

Ruby 3.4 frozen string literals: What Rails developers need to know

https://www.prateekcodes.dev/ruby-34-frozen-string-literals-rails-upgrade-guide/
218•thomas_witt•3d ago•106 comments

Generic Interfaces

https://go.dev/blog/generic-interfaces
10•Merovius•2d ago•5 comments

Show HN: I built a playground to showcase what Flux Kontext is good at

https://fluxkontextlab.com
14•Zephyrion•4h ago•3 comments

Solar power has begun to transform the world’s energy system

https://www.newyorker.com/news/annals-of-a-warming-planet/46-billion-years-on-the-sun-is-having-a-moment
75•dmazin•17h ago•111 comments

HyAB k-means for color quantization

https://30fps.net/pages/hyab-kmeans/
31•ibobev•8h ago•9 comments

The most otherworldly, mysterious forms of lightning on Earth

https://www.nationalgeographic.com/science/article/lightning-sprites-transient-luminous-events-thunderstorms
82•Anon84•3d ago•28 comments

Most RESTful APIs aren't really RESTful

https://florian-kraemer.net//software-architecture/2025/07/07/Most-RESTful-APIs-are-not-really-RESTful.html
316•BerislavLopac•22h ago•480 comments

Xenharmlib: A music theory library that supports non-western harmonic systems

https://xenharmlib.readthedocs.io/en/latest/
159•retooth•1d ago•13 comments

Making Explainable Minesweeper

https://sublevelgames.github.io/blogs/2025-07-06-making-explainable-minesweeper/
31•greentec•3d ago•25 comments

Multi-Region Row Level Security in CockroachDB

https://www.cockroachlabs.com/blog/fine-grained-access-control-row-level-security/
42•rusticwizard•7h ago•9 comments