frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

AI for Scientific Search

https://arxiv.org/abs/2507.01903
75•omarsar•7h ago

Comments

mixedmath•5h ago
From the title, I had thought that this would be a new tool for searching science, such as searching the arxiv. But this is actually a survey.

I quote the conclusion of the survey:

---

In conclusion, rapid advancements in artificial intelligence, particularly large language models like OpenAI-o1 and DeepSeek-R1, have demonstrated substantial potential in areas such as logical reasoning and experimental coding. These developments have sparked increasing interest in applying AI to scientific research. However, despite the growing potential of AI in this domain, there is a lack of comprehensive surveys that consolidate current knowledge, hindering further progress. This paper addresses this gap by providing a detailed survey and unified framework for AI4Research. Our contributions include a systematic taxonomy for classifying AI4Research tasks, identification of key research gaps and future directions, and a compilation of open-source resources to support the community. We believe this work will enhance our understanding of AI’s role in research and serve as a catalyst for future advancements in the field.

---

I jumped at this because I'm a mathematician who has been complaining about the lack of effective mathematical search for several years.

Davidzheng•3h ago
How do you view o3? I personally find it superior to google search almost always. Do you find that it often misses key references? (also mathematician)
masterjack•24m ago
Have you found https://sugaku.net/ useful? It’s focused on math research
gavinray•5h ago
I was hoping for this to announce a tool for research.

Anyone know of the best way to do something like:

"Find most relevant papers related to topic XYZ, download them, extract metadata, generate big-picture summary and entity-relationship graph"?

Having a nice workflow for this would be the best thing since sliced bread for hobbyists interested in niche science topics.

Recently found https://minicule.com which is free and lets you search + import, but it focuses more on "concept-extraction" than LLM synthesis/summary.

AustinBGibbons•4h ago
Check out https://elicit.com/
gavinray•3h ago
Seems potentially useful, thanks! Only drawback I can see is the small number of papers provided by the free plan, but that's reasonable I suppose.
hugeBirb•4h ago
I've been trying to tackle this exact problem. Current process is to use exa.ai to collect a wide breadth of research papers. Do a summarization pass and convert to markdown. Search for more specific terms then give the relevant papers/context to Gemini 2.5 pro and say give me a summary. Looking for very specific resources and to be honest it's been a terrible process :|
kianN•3h ago
Linking to a nearby thread in case this is helpful: https://news.ycombinator.com/item?id=44457928
dmezzetti•4h ago
PaperAI is also an option if you prefer open-source: https://github.com/neuml/paperai

Disclaimer: I'm the primary author of this project.

kianN•3h ago
I built a public literature review search tool for some graduate student friends that became pretty popular in the Santa Barbara area. It actually does exactly what you are describing.

It’s not neural network based: it leverages hierarchical mixture models to give a statistical overview of the data. It lets you build these analysis graphs via search or citation networks.

Example: https://platform.sturdystatistics.com/deepdive?search_type=e...

gavinray•3h ago
This is genuinely incredible, tried it using a recent-ish paper on the pharmacology and mechanisms of the Androgen Receptor and my mind is blown:

https://platform.sturdystatistics.com/deepdive?fast=1&q=http...

andjar•3h ago
A while ago, I started working on two R packages for creating 'living reviews': metawoRld and DataFindR, see https://andjar.github.io/metawoRld/articles/conceptual_overv... . You do the broad literature search yourself, but the idea is to use LLMs to select relevant studies and perform data extraction in a structured, reproducible manner. The extracted data is stored in a git repository for collaboration and version tracking, with automated validation and website generation for presenting results.
TechDebtDevin•2h ago
"Structured and Reproducable"
tkuipers•3h ago
I’ve found a lot of success with https://www.undermind.ai/ though I’m not sure it has the graph you’re looking for
gavinray•2h ago
This also looks excellent, thank you!
whattheheckheck•2h ago
Connectedpapers.com
fabmilo•2h ago
I like zotero, I started vibe coding some integration for my workflow, the project is a bit clunky to build and iterate the development specially with gemini & claude. But I think that is the direction to take instead of reinvent from scratch something
scientific_ass•1h ago
Was expecting a product I can try out. But still, not disappointed.
bossyTeacher•17m ago
AI for Scientific Search yes. LLM for Scientific Search I am not sure. AI is not equivalent with LLM. I dislike it when people do it.

AI will have a brand crisis once LLMs get abandoned and researchers need to explain the public that the new AI (not LLM based) is different than the old AI (LLM based) which is different from the old AI (GOFAI)

Lingerie Fighting Championships to Add $2M in Bitcoin to Treasury

https://www.coindesk.com/markets/2025/06/26/lingerie-fighting-championships-to-add-2m-in-bitcoin-to-treasury-ahead-of-expansion-to-uk
1•PaulHoule•1m ago•0 comments

Congress passes 'big, beautiful bill'

https://www.bbc.co.uk/news/articles/cddz3n6vz0go
2•masfuerte•1m ago•0 comments

Coding in Advent

https://panadestein.github.io/blog/posts/aoc24.html
1•todsacerdoti•4m ago•0 comments

Call with Current Continuation (2018)

http://dmitrykandalov.com/call-with-current-continuation
1•gone35•4m ago•0 comments

The Rise of "Context Engineering"

https://blog.langchain.com/the-rise-of-context-engineering/
1•sebg•6m ago•0 comments

The engineer burning through tech by working at 3 to 4 startups simultaneously

https://www.theverge.com/news/697846/soham-parekh-startups-multiple-jobs-founders
1•alienreborn•8m ago•1 comments

Michael Madsen Has Died

https://www.nytimes.com/2025/07/03/movies/michael-madsen-dead.html
2•anigbrowl•9m ago•0 comments

Serial SPI RAM Emulation on Raspberry Pi Pico RP2040 MCU

https://github.com/MichaelBell/spi-ram-emu
1•transpute•15m ago•0 comments

Extending That XOR Trick to Billions of Rows

https://nochlin.com/blog/extending-that-xor-trick
1•hundredwatt•21m ago•0 comments

Create llms.txt files for any website in seconds

https://llmstxtgenerator.org/
1•indigodaddy•22m ago•0 comments

Ask HN: Any purely data driven public policy research think tanks?

1•ishaanbhagwat•24m ago•0 comments

Self-optimisers: why 'microefficiencies' are on the rise

https://www.theguardian.com/lifeandstyle/2025/jul/03/the-secrets-of-self-optimisers-why-microefficiencies-are-on-the-rise
3•devonnull•25m ago•0 comments

Can cheese turn your dreams into nightmares?

https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2025.1544475/full
1•domofutu•25m ago•0 comments

How to Surf the Web in 2025, and Why You Should

https://www.raptitude.com/2025/06/how-to-surf-the-web-in-2025-and-why-you-should/
1•sebg•25m ago•0 comments

Complected and Orthogonal Persistence

https://jyn.dev/complected-and-orthogonal-persistence/
2•Bogdanp•28m ago•0 comments

Harvesting 5oz/d H2O in Death Valley with meter scale panel using only sunlight

https://www.thebrighterside.news/post/mits-high-tech-hydrogel-window-pulls-clean-drinking-water-from-dry-desert-air/
3•gsf_emergency_2•28m ago•0 comments

Discovering cognitive strategies with tiny recurrent neural networks

https://www.nature.com/articles/s41586-025-09142-4
2•sebg•30m ago•0 comments

Apple gets punchy in Brussels DMA compliance workshop

https://brucelawson.co.uk/2025/up-the-kriek-apple-gets-punchy-in-brussels-dma-compliance-workshop/
3•edent•32m ago•0 comments

Ask HN: Should I use microservices or monolithic architecture?

3•its_kritix•34m ago•2 comments

Garmin patents a sensor that can detect dehydration levels

https://www.yankodesign.com/2025/07/03/garmin-patents-a-sensor-that-can-detect-dehydration-levels-and-remind-you-to-drink-water/
1•exiguus•36m ago•0 comments

Building Organizational Capacity for Large-Scale Change

https://www.psychologytoday.com/us/blog/the-emergency-mind/202507/building-organizational-capacity-for-large-scale-change
1•exiguus•39m ago•0 comments

Satellite evidence points to heightened operations at N. Korean nuclear site

https://www.dailynk.com/english/satellite-evidence-points-heightened-operations-north-korean-nuclear-site/
3•giuliomagnifico•41m ago•0 comments

Performance Profiling on AMD GPUs – Part 1: Foundations

https://rocm.blogs.amd.com/software-tools-optimization/profiling-guide/intro/README.html
1•ibobev•45m ago•0 comments

UFOs, Aliens, and the Unknown Other

https://www.psychologytoday.com/us/blog/evil-deeds/202501/ufos-aliens-and-the-unknown-other
2•exiguus•48m ago•0 comments

US Job Growth Picks Up

https://www.bloomberg.com/news/articles/2025-07-03/us-payroll-growth-beats-forecasts-jobless-rate-drops-to-4-1
3•Redoubts•49m ago•1 comments

Real-Time Importance Deep Shadows Maps with Hardware Ray Tracing

https://diglib.eg.org/items/ff5055b6-be32-414a-8d63-41fdb7296e10
1•ibobev•50m ago•0 comments

LoRA Fine-Tuning Without GPUs

https://arxiv.org/abs/2507.01806
1•elashri•51m ago•0 comments

UPS Offers Buyouts to Drivers, a First in Its 117-Year History

https://www.wsj.com/business/logistics/ups-driver-buyout-first-117-years-f3be772c
4•bookofjoe•52m ago•1 comments

Caddy module to help match against DNS blocklists

https://git.madhouse-project.org/caddy/http.matchers.dnsbl
3•gslin•53m ago•0 comments

Show HN: Claude Code History Viewer for macOS

https://github.com/jhlee0409/claude-code-history-viewer/releases
2•jackleee•53m ago•1 comments