Don't forget chemrXiv!
https://news.ycombinator.com/item?id=42519487
I just did a spot check, I think searchthearxiv search results are superior.
It would be cool if the "More Like This" had a + button that would append the arxiv id to the search query.
Colbert being a good google-able application of utilizing more embeddings.
Search ends up often being a funnel of techniques. Cheap and high recall for phase 1 and ratchet up the flops and precision in subsequent passes on the previous result set.
I've built similar thing for github stars[1], might implement the same for it.
elliotec•8mo ago
0101111101•8mo ago
I'm also maintaining a dataset of all the embeddings on kaggle if you want to use them yourself: https://www.kaggle.com/datasets/tomtum/openai-arxiv-embeddin...
heisenburgzero•8mo ago
synctext•8mo ago
0101111101•8mo ago
elliotec•8mo ago
cluckindan•8mo ago
0101111101•8mo ago
cluckindan•8mo ago