frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Mining Scientific Papers

2•davidbjaffe•1h ago
What are peoples' experiences with using LLMs to mine information from scientific papers?

My own experience: I first attempted to extract the anti-drug antibody (ADA) rate from each of 3730 clinical-trial papers, all indexed in PubMed. I started from PDFs. Claude Opus 4.7 analyzed each PDF using a written rules doc that we had formulated. Running all the papers took about a week because I kept hitting session limits; the total cost was ~$25 (USD). We got actual rates from 909 papers. The rest were mostly cases where the rate was not present or did not meet our criteria, including administering only one drug at a time.

I read thirty of the papers and re-read those where I got a different answer from Claude, concluding that it had erred one time and I had erred three times.

So this works, but is not totally convenient: session limits mean that I can't start it up and walk away. Or I don't know how to engineer this capability. In addition I was curious how local models would perform.

To that end I tried llama 3.3 70B on my Mac M5 Max (128 GB mem). I used Ollama, Q4_K_M, 128 k context, ~80 k input tokens after pdftotext -layout.

One paper took 18 minutes; the model was unable to determine the ADA rate, whereas it is clearly in the paper. One paper is not a proper benchmark but it's too slow to do a proper test. Clearly part of the speed issue here is that Claude has access to a server farm, whereas I'm running on just one Mac. This is part of the practical problem that someone would face with local computation.

What is the state of the art on this type of problem, for answering questions one paper at a time or using many papers at once? I'd love to hear success stories!

Comments

smartypant•1h ago
Why don't you use deepseek V4 or kimi k2.5 or 2.6? They are really good models and you will not hit these token limits.

The US Tech Giant Where Employees Wear IDF Uniforms to Work

https://www.donotpanic.news/p/exclusive-the-us-tech-giant-where
2•sosomoxie•58s ago•0 comments

At Protocol: Building the Social Internet

https://atproto.com/
1•resiros•3m ago•0 comments

Codex and ForgeCAD: Generating a Model of the Teenage Engineering KO II

https://twitter.com/theopuslabs/status/2049195007404380244
1•opuslabs•3m ago•0 comments

NASA chief Jared Isaacman says he's fighting for Pluto

https://www.space.com/astronomy/pluto/nasa-chief-jared-isaacman-says-hes-fighting-for-pluto-i-am-...
2•thunderbong•5m ago•0 comments

Better Hardware Could Turn Zeros into AI Heroes

https://spectrum.ieee.org/sparse-ai
1•Brajeshwar•6m ago•0 comments

Anaconda Acquires Outerbounds to Unify AI-Native Development

https://www.anaconda.com/blog/anaconda-acquires-outerbounds
1•htrp•6m ago•0 comments

Potemkin Village

https://en.wikipedia.org/wiki/Potemkin_village
1•rbanffy•7m ago•0 comments

Show HN: VT Code – Rust coding agent with AST-level code intelligence

https://github.com/vinhnx/VTCode
1•vinhnx•7m ago•0 comments

Nikita Bier Runs X. Give Me a Few Hours. Iranian flag change and account purge

https://dannykpolitics.substack.com/p/part-two-the-pattern-nikita-biers
4•logcode•7m ago•0 comments

FastCGI: 30 Years Old and Still the Better Protocol for Reverse Proxies

https://www.agwa.name/blog/post/fastcgi_is_the_better_protocol_for_reverse_proxies
2•agwa•7m ago•0 comments

TI-84 Evo

https://education.ti.com/en/products/calculators/graphing-calculators/ti-84-evo
2•kermatt•7m ago•0 comments

Customer.io told me to delete 80% of my list. Rebuilt it with Claude in 27 days

https://twitter.com/JakeMRuth/status/2049521900464791604
1•hippofluff•8m ago•0 comments

Maximising the Value of Ajinomoto

https://mms.businesswire.com/media/20260331226478/en/2761328/1/EN_Palliser_-_Ajinomoto_Value_Enha...
1•num42•8m ago•0 comments

30 ClawHub skills secretly turn AI agents into a crypto swarm

https://www.theregister.com/2026/04/29/30_clawhub_skills_mine_crypto/
1•Bender•8m ago•0 comments

Ramping Figure 03 Production

https://www.figure.ai/news/ramping-figure-03-production
1•denysvitali•8m ago•0 comments

Superpower for Gemini – Chrome Extension

https://superpowerforai.com/Gemini/Home/
1•Kindly_Revenue•9m ago•0 comments

NASA Boss: Make Pluto a Planet Again

https://www.theregister.com/2026/04/29/nasa_boss_make_pluto_a_planet_again/
1•LorenDB•10m ago•0 comments

Is there any way to stop getting AI made video suggestions in YouTube?

2•tukunjil•12m ago•2 comments

Why Math's Final Axiom Proved So Controversial

https://www.quantamagazine.org/why-maths-final-axiom-proved-so-controversial-20260429/
1•Tomte•13m ago•0 comments

Cyberdeck Design Log #1

https://strangelyentangled.com/blog/cyberdeck-design-log1/
1•abnercoimbre•14m ago•0 comments

Canada Proposes Poet Mission to Hunt Earth-Sized Planets

https://www.universetoday.com/articles/canada-proposes-poet-mission-to-hunt-earth-sized-planets
1•rbanffy•14m ago•0 comments

Session-Surface Protocol v0.1: A draft spec for private surfaces in LUIs

https://www.curatedfuture.com/the-session-surface-protocol/
1•reyperalta•15m ago•0 comments

Show HN: Chrome extension for Gmail/Workspace users to alias emails at signup

https://zaai.com/clean-autofill/
1•manuelgruber•15m ago•0 comments

Court Rules 2nd Amendment Covers Firearms Parts Good News Those Who Build Guns

https://cowboystatedaily.com/2026/04/28/court-rules-2nd-amendment-covers-firearms-parts-good-news...
7•Bender•16m ago•1 comments

Why TVs Are Getting Uncomfortably Bright, and Here's Why

https://www.cnet.com/tech/home-entertainment/tvs-are-getting-brighter-we-tested-them-but-why-is-t...
1•pseudolus•16m ago•0 comments

Show HN: TripBalls – plan road trips to away games (MLB, NFL, NBA, WC2026)

https://tripballs.now/
1•sanjosanjo•17m ago•0 comments

CPanel, WHM emergency update fixes critical auth bypass bug

https://www.bleepingcomputer.com/news/security/cpanel-whm-emergency-update-fixes-critical-auth-by...
1•cdrnsf•17m ago•0 comments

Communicating Our Research with Stakeholders to Achieve Alignment and Trust

https://blog.ptidej.net/ghost/#/editor/post/699bd9175e8d158bfbb87c42
1•Minette•17m ago•1 comments

DESI Completes Its Epic 3D Map, Hinting That Dark Energy Might Be Changing

https://www.universetoday.com/articles/desi-completes-its-epic-3d-map-hinting-that-dark-energy-mi...
1•rbanffy•17m ago•0 comments

Show HN: Ccmeter – local-first cost and cache dashboard for Claude Code

https://github.com/vnmoorthy/ccmeter
1•vnmoorthy•18m ago•0 comments