frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Semantic search engine for ArXiv, biorxiv and medrxiv

https://arxivxplorer.com/
80•0101111101•6h ago

Comments

elliotec•4h ago
This is really cool, and very relevant to something I'm working on. Would you be willing to do a quick explanation of the build?
0101111101•3h ago
Sure! I first used openai embeddings on all the paper titles, abstracts and authors. When a user submits a search query, I embed the query, find the closest matching papers and return those results. Nothing too fancy involved!

I'm also maintaining a dataset of all the embeddings on kaggle if you want to use them yourself: https://www.kaggle.com/datasets/tomtum/openai-arxiv-embeddin...

heisenburgzero•40m ago
So did you just combine Title+Abstracts+Authors into a single chunk and embed them or embedded them individually?
madars•4h ago
Looks great! Could you add eprint.iacr.org (Cryptology ePrint Archive)?
0101111101•3h ago
Do they have a public API/dataset?
madars•2h ago
They have RSS feeds for new/updated papers: https://eprint.iacr.org/rss/
bbor•3h ago
Oh god, there's a medrxiv?? TIL...

Don't forget chemrXiv!

0101111101•3h ago
Sadly I couldn't find a public API for chemrxiv, but would be happy to be proven wrong!
sitkack•3h ago
embedding search via https://searchthearxiv.com/ takes either a word vector, or an abs or pdf link to an arxiv paper.

https://news.ycombinator.com/item?id=42519487

I just did a spot check, I think searchthearxiv search results are superior.

0101111101•3h ago
Looks cool! You can input either a search query or a paper URL on arxiv xplorer. You can even combine paper URLs to search for combinations of ideas by putting + or - before the URL, like `+ 2501.12948 + 1712.01815`
masterjack•2h ago
There’s also the search and browsing on https://sugaku.net, it’s more focused on math but does also have all of the arxiv on it
nblgbg•1h ago
Just curious, are there any techniques other than using embeddings, computing cosine similarity, and sorting the results based on that? RRF could be used but again its very simple as well.

Veo 3 and Imagen 4, and a new tool for filmmaking called Flow

https://blog.google/technology/ai/generative-media-models-io-2025/
506•youssefarizk•10h ago•302 comments

Writing into Uninitialized Buffers in Rust

https://blog.sunfishcode.online/writingintouninitializedbuffersinrust/
26•luu•1d ago•4 comments

Litestream: Revamped

https://fly.io/blog/litestream-revamped/
256•usrme•8h ago•57 comments

“ZLinq”, a Zero-Allocation LINQ Library for .NET

https://neuecc.medium.com/zlinq-a-zero-allocation-linq-library-for-net-1bb0a3e5c749
99•cempaka•5h ago•32 comments

Gemma 3n preview: Mobile-first AI

https://developers.googleblog.com/en/introducing-gemma-3n/
250•meetpateltech•9h ago•86 comments

A Secret Trove of Rare Guitars Heads to the Met

https://www.newyorker.com/magazine/2025/05/26/a-secret-trove-of-rare-guitars-heads-to-the-met
33•bookofjoe•1h ago•6 comments

Clojuring the web application stack: Meditation One

https://www.evalapply.org/posts/clojure-web-app-from-scratch/index.html
11•adityaathalye•14h ago•3 comments

The NSA Selector

https://github.com/wenzellabs/the_NSA_selector
195•anigbrowl•9h ago•59 comments

Deep Learning Is Applied Topology

https://theahura.substack.com/p/deep-learning-is-applied-topology
372•theahura•14h ago•157 comments

Magic of software; what makes a good engineer also makes a good engineering org

https://moxie.org/2024/09/23/a-good-engineer.html
76•kiyanwang•1d ago•15 comments

Semantic search engine for ArXiv, biorxiv and medrxiv

https://arxivxplorer.com/
80•0101111101•6h ago•12 comments

Instagram Addiction

https://blog.greg.technology/2025/05/19/on-instagram-addiction.html
59•gregsadetsky•5h ago•32 comments

Show HN: apply.coop - Matching people with jobs that fit their values & passions

https://apply.coop
16•blainsmith•3h ago•1 comments

Red Programming Language

https://www.red-lang.org/p/about.html
123•hotpocket777•9h ago•60 comments

My favourite fonts to use with LaTeX (2022)

https://www.lfe.pt/latex/fonts/typography/2022/11/21/latex-fonts-part1.html
78•todsacerdoti•4d ago•21 comments

What if Vintage and Modern got together

https://www.jaydip.me/
3•jdsane•54m ago•0 comments

Show HN: 90s.dev – Game maker that runs on the web

https://90s.dev/blog/finally-releasing-90s-dev.html
242•90s_dev•13h ago•94 comments

Show HN: A Tiling Window Manager for Windows, Written in Janet

https://agent-kilo.github.io/jwno/
214•agentkilo•12h ago•72 comments

AI's energy footprint

https://www.technologyreview.com/2025/05/20/1116327/ai-energy-usage-climate-footprint-big-tech/
128•pseudolus•17h ago•139 comments

Why does the U.S. always run a trade deficit?

https://libertystreeteconomics.newyorkfed.org/2025/05/why-does-the-u-s-always-run-a-trade-deficit/
200•jnord•16h ago•417 comments

Robin: A multi-agent system for automating scientific discovery

https://arxiv.org/abs/2505.13400
122•nopinsight•11h ago•17 comments

New stem cell model sheds light on human amniotic sac development

https://www.crick.ac.uk/news/2025-05-15_new-stem-cell-model-sheds-light-on-human-amniotic-sac-development
24•gmays•4d ago•1 comments

The Dawn of Nvidia's Technology

https://blog.dshr.org/2025/05/the-dawn-of-nvidias-technology.html
138•wmf•10h ago•42 comments

The Value Isn't in the Code

https://jonayre.uk/blog/2022/10/30/the-real-value-isnt-in-the-code/
82•fragmede•4h ago•46 comments

Ashby (YC W19) Is Hiring Engineering Managers

https://www.ashbyhq.com/careers?utm_source=hn&ashby_jid=933570bc-a3d6-4fcc-991d-dc399c53a58a
1•abhikp•10h ago

Show HN: TitleBridge - A FinalCut Workflow Plugin

https://bustin.tech/apps/titlebridge/
7•_morph3ous•2h ago•1 comments

Linguists find proof of sweeping language pattern once deemed a 'hoax'

https://www.scientificamerican.com/article/linguists-find-proof-of-sweeping-language-pattern-once-deemed-a-hoax/
72•bryanrasmussen•1d ago•58 comments

Magnus Carlsen forced into a draw by more than 143000 people playing against him

https://apnews.com/article/chess-magnus-carlsen-match-world-freestyle-grandmaster-963a977765fa02d05a14d701666dfcd7
22•namanyayg•1h ago•6 comments

Gail Wellington, former Commodore executive, has died

https://www.legacy.com/us/obituaries/name/gail-wellington-obituary?id=58418580
85•erickhill•3d ago•32 comments

Ask HN: Conversational AI to Learn a Language

28•edweis•3d ago•14 comments