frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

https://www.biotradingarena.com/hn
1•dchu17•1h ago
Hi HN,

My friend and I have been experimenting with using LLMs to reason about biotech stocks. Unlike many other sectors, Biotech trading is largely event-driven: FDA decisions, clinical trial readouts, safety updates, or changes in trial design can cause a stock to 3x in a single day (https://www.biotradingarena.com/cases/MDGL_2023-12-14_Resmet...).

Interpreting these ‘catalysts,’ which comes in the form of a press release, usually requires analysts with previous expertise in biology or medicine. A catalyst that sounds “positive” can still lead to a selloff if, for example: the effect size is weaker than expected

- results apply only to a narrow subgroup

- endpoints don’t meaningfully de-risk later phases,

- the readout doesn’t materially change approval odds.

To explore this, we built BioTradingArena, a benchmark for evaluating how well LLMs can interpret biotech catalysts and predict stock reactions. Given only the catalyst and the information available before the date of the press release (trial design, prior data, PubMed articles, and market expectations), the benchmark tests to see how accurate the model is at predicting the stock movement for when the catalyst is released.

The benchmark currently includes 317 historical catalysts. We also created subsets for specific indications (with the largest in Oncology) as different indications often have different patterns. We plan to add more catalysts to the public dataset over the next few weeks. The dataset spans companies of different sizes and creates an adjusted score, since large-cap biotech tends to exhibit much lower volatility than small and mid-cap names.

Each row of data includes:

- Real historical biotech catalysts (Phase 1–3 readouts, FDA actions, etc.) and pricing data from the day before, and the day of the catalyst

- Linked Clinical Trial data, and PubMed pdfs

Note, there are may exist some fairly obvious problems with our approach. First, many clinical trial press releases are likely already included in the LLMs’ pretraining data. While we try to reduce this by ‘de-identifying each press release’, and providing only the data available to the LLM up to the date of the catalyst, there are obviously some uncertainties about whether this is sufficient.

We’ve been using this benchmark to test prompting strategies and model families. Results so far are mixed but interesting as the most reliable approach we found was to use LLMs to quantify qualitative features and then a linear regression of these features, rather than direct price prediction.

Just wanted to share this with HN. I built a playground link for those of you who would like to play around with it in a sandbox. Would love to hear some ideas and hope people can play around with this!

China Testing Nuclear Weapons and Covering Its Tracks, U.S. Alleges

https://www.twz.com/nuclear/china-secretly-testing-nuclear-weapons-and-covering-its-tracks-u-s-al...
1•DustinEchoes•3m ago•0 comments

Show HN: I let strangers vote on commands to a Claude Code instance with root

https://claudecrowd.clodhost.com/
1•zhoujianfu•4m ago•0 comments

A tenancy controller and experiment in AI driven product building

https://leebriggs.co.uk/blog/2026/02/06/landlord
1•jaxxstorm•4m ago•0 comments

A Refuge from the Sloppocalypse

https://www.cybrsecmedia.com/from-the-editor-a-refuge-from-the-sloppocalypse/
1•ohjeez•5m ago•0 comments

Show HN: Jourdle (Coop Wordle)

https://jourdle.com/
1•furyofantares•5m ago•0 comments

The Rumsfeld Matrix for AI

https://hollisrobbinsanecdotal.substack.com/p/the-rumsfeld-matrix
1•HR01•5m ago•0 comments

Opus of the People | Opus des Volkes

https://christopher-helm.com/opus-des-volkes-endlich-frei-von-kompetenz/
1•chelm•6m ago•0 comments

Microbiome-associated phenotypes that reshape agricultural sustainability

https://www.science.org/doi/10.1126/sciadv.aed3360
1•PaulHoule•6m ago•0 comments

Show HN: Structured devil's advocate code review as a Claude Code slash command

https://github.com/richiethomas/claude-devils-advocate
1•toomanyrichies•7m ago•0 comments

Welcome to the $600B AI era, where Big Tech is spending like it's a Gilded Age

https://www.businessinsider.com/amazon-google-meta-microsoft-boost-ai-spending-stocks-2026-2
2•zerosizedweasle•7m ago•0 comments

Claude Code and What Comes Next

https://www.oneusefulthing.org/p/claude-code-and-what-comes-next
1•speckx•7m ago•0 comments

NASA to Save $1.4B by Insourcing

https://twitter.com/NASAAdmin/status/2019823962465923366
1•trothamel•8m ago•0 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
2•eljojo•11m ago•0 comments

Trump says he'll stop health care fraudsters. Last time, he let them walk

https://tennesseelookout.com/2025/04/02/trump-says-hell-stop-health-care-fraudsters-last-time-he-...
2•everybodyknows•12m ago•0 comments

How to effectively write quality code with AI

https://heidenstedt.org/posts/2026/how-to-effectively-write-quality-code-with-ai/
1•i5heu•13m ago•0 comments

Eli_kinsey_prefrontal_cortex.exe

https://ekinsey.dev/
1•Twixes•14m ago•0 comments

Wall Street Is Paywalling Your Kids' Sports

https://www.levernews.com/wall-street-is-paywalling-your-kids-sports/
2•ripe•15m ago•0 comments

MCP Directories or Gateways

https://mcpstatus.io/blog/202601-mcp-server-directories
1•s-xyz•15m ago•0 comments

AMD's 96-core beast with watercooling engraved into CPU

https://www.tomshardware.com/pc-components/cpus/amds-96-core-beast-with-watercooling-engraved-int...
1•rbanffy•15m ago•0 comments

256 TB/s data rates over 200 km distance on single mode fiber optic

https://twitter.com/ID_AA_Carmack/status/2019839335382790342
1•tosh•16m ago•0 comments

Pushing the Intel Panther Lake CPU Performance Further on Linux Review

https://www.phoronix.com/review/intel-panther-lake-linux-power
1•rbanffy•16m ago•0 comments

Quantum Twins: Silicon's Leap in Analog Simulation

https://spectrum.ieee.org/quantum-twins
1•rbanffy•16m ago•0 comments

Fun with Shell Emojis

https://www.lasantha.org/blog/fun-with-shell-emojis/
1•kiriberty•17m ago•1 comments

AI-invented cryptocurrencies and a dark GitHub

https://darksource.ai
2•DarkSource•17m ago•1 comments

Clelp – AI skills directory where AI agents write the reviews

https://clelp.ai
1•jhaugh•18m ago•2 comments

Virtualization Support to Ironclad

https://blog.ironclad-os.org/introducing-virtualization-support-to-ironclad/
1•jaypatelani•19m ago•0 comments

0-Days \ Red.anthropic.com

https://red.anthropic.com/2026/zero-days/
3•msolujic•19m ago•0 comments

An archaeologist's guide to Christopher Nolan's 'The Odyssey' trailer [video]

https://www.youtube.com/watch?v=4gxpQ6G3y6E
2•incomplete•20m ago•0 comments

Agentic NetOps: How to beat the cloud monsters at their own game

https://www.kentik.com/blog/agentic-netops-how-to-beat-the-cloud-monsters-at-their-own-game/
1•oavioklein•20m ago•0 comments

Show HN: Generate Amazon product images from a photo in 30 seconds

https://greenonion.ai
1•yanjiechg•20m ago•0 comments