frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Erdos 281 solved with ChatGPT 5.2 Pro

https://twitter.com/neelsomani/status/2012695714187325745
93•nl•1h ago•30 comments

Lopado­temacho­selacho­galeo­kranio­leipsano­drim­hypo­trimmato­silphio­karab

https://en.wikipedia.org/wiki/Lopado%C2%ADtemacho%C2%ADselacho%C2%ADgaleo%C2%ADkranio%C2%ADleipsa...
42•firloop•1h ago•14 comments

How scientists are using Claude to accelerate research and discovery

https://www.anthropic.com/news/accelerating-scientific-research
28•gmays•2h ago•10 comments

No knives, only cook knives

https://kellykozakandjoshdonald.substack.com/p/no-knives-only-cook-knives
18•firloop•5h ago•0 comments

Profession by Isaac Asimov

https://www.abelard.org/asimov.php
22•bkudria•3h ago•0 comments

ASCII characters are not pixels: a deep dive into ASCII rendering

https://alexharri.com/blog/ascii-rendering
928•alexharri•18h ago•112 comments

Dark Mode vs. Light Mode: Which Is Better?

https://www.nngroup.com/articles/dark-mode/
12•seanwilson•3h ago•5 comments

Kip: A programming language based on grammatical cases of Turkish

https://github.com/kip-dili/kip
152•nhatcher•8h ago•49 comments

Computer Systems Security 6.566 / Spring 2024

https://css.csail.mit.edu/6.858/2024/
61•barishnamazov•5h ago•7 comments

jQuery 4.0.0 Released

https://blog.jquery.com/2026/01/17/jquery-4-0-0/
40•OuterVale•1h ago•5 comments

The recurring dream of replacing developers

https://www.caimito.net/en/blog/2025/12/07/the-recurring-dream-of-replacing-developers.html
365•glimshe•15h ago•290 comments

Podcasting Could Use a Good Asteroid

https://www.joanwestenberg.com/podcasting-could-use-a-good-asteroid/
9•zdw•2d ago•2 comments

We put Claude Code in Rollercoaster Tycoon

https://labs.ramp.com/rct
407•iamwil•5d ago•229 comments

If you put Apple icons in reverse it looks like someone getting good at design

https://mastodon.social/@heliographe_studio/115890819509545391
357•lateforwork•5h ago•155 comments

Xous Operating System

https://xous.dev/
105•eustoria•3d ago•30 comments

Raising money fucked me up

https://blog.yakkomajuri.com/blog/raising-money-fucked-me-up
194•yakkomajuri•11h ago•64 comments

How London cracked mobile phone coverage on the Underground

https://www.ianvisits.co.uk/articles/how-london-finally-cracked-mobile-phone-coverage-on-the-unde...
33•beardyw•4d ago•7 comments

The Olivetti Company

https://www.abortretry.fail/p/the-olivetti-company
152•rbanffy•6d ago•32 comments

Why Object of Arrays beat interleaved arrays: a JavaScript performance issue

https://www.royalbhati.com/posts/js-array-vs-typedarray
14•howToTestFE•6d ago•5 comments

The relentless rule of my fitness tracker

https://timharford.com/2025/10/the-relentless-rule-of-my-fitness-tracker/
8•Arnt•1h ago•2 comments

Show HN: ChunkHound, a local-first tool for understanding large codebases

https://github.com/chunkhound/chunkhound
66•NadavBenItzhak•8h ago•16 comments

The life of a playboy publisher who shaped 20th-century literature

https://www.washingtonpost.com/books/2026/01/09/bennett-cerf-biography-nothing-random-feldman-boo...
5•benbreen•4h ago•1 comments

Light Mode InFFFFFFlation

https://willhbr.net/2025/10/20/light-mode-infffffflation/
178•Fudgel•7h ago•133 comments

IRISC: An ARMv7 assembly interpreter and computer architecture simulator

https://polysoftit.co.uk/irisc-web/
21•rtybanana•5h ago•2 comments

Below the Surface: Archeological Finds from the Amsterdam Noord/Zuid Metro Line

https://belowthesurface.amsterdam/en/vondsten
73•stefanvdw1•6d ago•10 comments

U.S. Court Order Against Anna's Archive Spells More Trouble for the Site

https://torrentfreak.com/u-s-court-order-against-annas-archive-spells-more-trouble-for-the-site/
27•t-3•1h ago•12 comments

An Elizabethan mansion's secrets for staying warm

https://www.bbc.com/future/article/20260116-an-elizabethan-mansions-secrets-for-staying-warm
128•Tachyooon•12h ago•145 comments

The thing that brought me joy

https://www.stephenlewis.me/blog/the-thing-that-brought-me-joy/
85•monooso•10h ago•39 comments

Show HN: Speed Miners – A tiny RTS resource mini-game

https://speedminers.fun/
18•nickponline•7h ago•2 comments

M8SBC-486 (Homebrew 486 computer)

https://maniek86.xyz/projects/m8sbc_486.php
102•rasz•6d ago•8 comments