frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How to exhaustively search the scientific literature?

3•cossatot•2h ago
I have a need for a comprehensive database of a certain type of event described in the scientific literature. For what it's worth, the event is a 'paleoearthquake', which is a historic or prehistoric earthquake that is found in the geologic record, usually by digging a trench across a fault line and identifying the disturbances in the geologic strata across or adjacent to the fault and, if possible, dating them via radiocarbon or other geochronological methods. However I don't think the specifics are particularly important.

The issue is that these are generally reported in the literature from local investigations of one or two faults, yielding a few events. These studies are done wherever there are earthquakes on land, so we have a global scope and language issues. Even limiting the results to the English peer-reviewed literature, however, it's a huge distributed search.

I estimate that there are on the order of 10,000 published events, and a mean of 2-3 events per publication.

For my immediate use of the database, it is very important for the database to be as complete as possible--I'm not looking for a sort of statistically representative sample. The literature itself is quite incomplete of course, but we're limited to what exists for now.

Starting with the first step of collating publications, what tools would one use? I have access to most journals through various university affiliations. Are there particular APIs? Web scraping tools? LLMs?

Thanks!

Comments

CamperBob2•1h ago
One option that shouldn't be overlooked: get a temporary subscription to an OpenAI model that allows you to run what they originally called "deep research" (nowadays called "Extended Pro" mode.) This isn't available on the freebie chat page, it will require at least a $20/month subscription (and maybe more, not sure.)

Then, basically paste your post into the prompt and let it crunch. It will take up to 30 minutes or so, and will often give you a reasonably comprehensive report in which most of the references actually exist. It is absolutely a better-Google-than-Google class of resource.

I'll do that and see if it comes up with anything meaningful, and also try it on Gemini 3.1. For a query like this I wouldn't expect it to return a list of thousands of individual reports, but it might give you some good leads that you can follow up with your existing journal access.

Edit:

GPT results: https://chatgpt.com/share/699df5db-b3d4-800b-b737-224319593e...

Gemini 3.1 Pro results: https://gemini.google.com/share/bd22eb43c13b

cossatot•1h ago
Thanks. I've got an OpenAI subscription and tried this in the past, and got a handful of results, but nothing comprehensive. Perhaps it is better now, or I could change the way I ask.
CamperBob2•1h ago
No prob, see if there's anything useful in any of the links I added to the post. I'm always interested in good benchmarks and test cases, as I usually don't have enough of my own to justify my expensive pro subscriptions. (I did not review them myself as I don't know what I'm looking at.)

Show HN: MantleDB – Anonymous JSON storage for your side projects

https://mantledb.sh/
1•moonwizard•4m ago•0 comments

Leaks point to Nvidia's N1/N1X launching sometime in the first half of 2026

https://www.tomshardware.com/pc-components/cpus/nvidias-n1-n1x-chips-leak-once-again-this-time-ti...
2•Tuldok•6m ago•0 comments

Perplexity.ai tries to connect via UDP without being open

2•roscas•6m ago•0 comments

nsnotifyd-2.4 released

https://dotat.at/@/2026-02-24-nsnotifyd-2-4-released.html
3•fanf2•8m ago•0 comments

Show HN: I built an iOS app that turns EPUBs into audiobooks

https://apps.apple.com/ua/app/audiobooks-mp3-m4b-player/id6471399965
3•pklym•8m ago•0 comments

Paediatricians' blood used to make new treatments for RSV and colds

https://www.newscientist.com/article/2516079-paediatricians-blood-used-to-make-new-treatments-for...
2•MaysonL•8m ago•0 comments

Basis raises $100M Series B at a $1.15B valuation led by Accel alongside GV

https://www.getbasis.ai/blogs/basis-raises-100m-series-b-led-by-accel-and-google-ventures
2•petethomas•11m ago•0 comments

What Happens When a Neighborhood Is Built Around a Farm?

https://reasonstobecheerful.world/agrihoods-neighborhoods-built-around-farms/
3•PaulHoule•11m ago•0 comments

Software stocks rebound as Anthropic announces new partnerships

https://www.cnbc.com/2026/02/24/software-stocks-anthropic-ai.html
2•kristianp•12m ago•0 comments

An Open Letter Opposing Android Developer Verification

https://f-droid.org/2026/02/24/open-letter-opposing-developer-verification.html
4•lu4p•13m ago•0 comments

Russia opens criminal case into Telegram founder Pavel Durov

https://www.theguardian.com/world/2026/feb/24/russia-criminal-case-telegram-founder-pavel-durov
2•mitchbob•13m ago•0 comments

Show HN: Claude Code Canvas

https://github.com/raulriera/claude-code-canvas
3•raulriera•13m ago•1 comments

The quixotic team trying to build a world in a 20-year-old game

https://arstechnica.com/gaming/2026/02/inside-the-quixotic-team-trying-to-build-an-entire-world-i...
2•nxobject•13m ago•0 comments

Mlx-ONNX: Run your MLX models in the browser using WebGPU

https://github.com/skryl/mlx-onnx
2•skryl•13m ago•1 comments

Looking 4 open-source knowledge base and project management tool 4 personal use

2•TheAlgorist•14m ago•0 comments

Adaptive Data

https://www.adaptionlabs.ai/blog
3•sethbannon•16m ago•0 comments

How to talk to anyone – and why you should (The Guardian)

https://www.theguardian.com/lifeandstyle/2026/feb/24/stranger-secret-how-to-talk-to-anyone-why-yo...
1•Looky1173•16m ago•0 comments

Why demand for beds at WA psychiatric hospitals continues to surge

https://www.seattletimes.com/seattle-news/mental-health/why-demand-for-beds-at-wa-psychiatric-hos...
1•petethomas•16m ago•0 comments

Disrupting 59M Malicious Imps: Inside D-Shortiez Testing Infra and Campaign Mgmt

https://confiant.substack.com/p/disrupting-59m-malicious-impressions
1•prettyblocks•16m ago•0 comments

How we rebuilt Next.js with AI in one week

https://blog.cloudflare.com/vinext/
8•ghostwriternr•17m ago•0 comments

Oblique Strategies

https://en.wikipedia.org/wiki/Oblique_Strategies
3•doruk101•18m ago•0 comments

Graph to Hyperspace: How Daimon Replaced Knowledge Graph with 10k-Bit Vectors

https://blog.brojo.ai/from-graph-to-hyperspace-how-daimon-replaced-its-knowledge-graph-with-10-00...
1•bojo•18m ago•0 comments

Discord delay global rollout of age verification to improve transparency

https://www.gamingonlinux.com/2026/02/discord-delay-global-rollout-of-age-verification-to-improve...
5•speckx•19m ago•1 comments

I Fixed Spotify Shuffle

https://spindles.me/
2•ViktorOsadsky•19m ago•0 comments

HashiCorp limits free tier to 500 managed resources

https://www.hashicorp.com/en/blog/continuing-hcp-terraform-s-enhanced-free-tier-experience
2•alexboden•19m ago•0 comments

Holy Cowtown: On Nadia Lee Cohen's "Holy Ohio"

https://clereviewofbooks.com/holy-cowtown-on-nadia-lee-cohens-holy-ohio/
1•podracingchamp•20m ago•0 comments

OpenAI makes GPT-5.3-Codex available through their API

https://developers.openai.com/api/docs/models/gpt-5.3-codex
1•rbranson•21m ago•0 comments

Éliane Radigue has died at 94

https://cdm.link/eliane-radigue-portraits/
2•NaOH•21m ago•0 comments

We Are Changing Our Developer Productivity Experiment Design

https://metr.org/blog/2026-02-24-uplift-update/
2•ej88•22m ago•2 comments

Dealing with the pressure to adopt AI as a designer

https://www.mynameismartin.co.uk/blog/how-im-dealing-with-the-pressure-to-adopt-ai-as-a-designer
2•pentagrama•23m ago•0 comments