frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

The struggle of resizing windows on macOS Tahoe

https://noheger.at/blog/2026/01/11/the-struggle-of-resizing-windows-on-macos-tahoe/
1843•happosai•14h ago•772 comments

Lightpanda migrate DOM implementation to Zig

https://lightpanda.io/blog/posts/migrating-our-dom-to-zig
43•gearnode•1h ago•10 comments

JRR Tolkien reads from The Hobbit for 30 Minutes (1952)

https://www.openculture.com/2026/01/j-r-r-tolkien-reads-from-the-hobbit-for-30-minutes-1952.html
125•bookofjoe•4d ago•27 comments

Ai, Japanese chimpanzee who counted and painted dies at 49

https://www.bbc.com/news/articles/cj9r3zl2ywyo
25•reconnecting•2h ago•6 comments

CLI agents make self-hosting on a home server easier and fun

https://fulghum.io/self-hosting
576•websku•14h ago•376 comments

39c3: In-house electronics manufacturing from scratch: How hard can it be? [video]

https://media.ccc.de/v/39c3-in-house-electronics-manufacturing-from-scratch-how-hard-can-it-be
130•fried-gluttony•2d ago•49 comments

This game is a single 13 KiB file that runs on Windows, Linux and in the Browser

https://iczelia.net/posts/snake-polyglot/
223•snoofydude•13h ago•57 comments

iCloud Photos Downloader

https://github.com/icloud-photos-downloader/icloud_photos_downloader
473•reconnecting•16h ago•200 comments

Conbini Wars – Map of Japanese convenience store ratios

https://conbini.kikkia.dev/
59•zdw•5d ago•22 comments

XMPP and Metadata

https://blog.mathieui.net/xmpp-and-metadata.html
24•todsacerdoti•5d ago•0 comments

Show HN: DevicePrint – device fingerprinting without cookies

5•silverrump•5d ago•4 comments

I'm making a game engine based on dynamic signed distance fields (SDFs) [video]

https://www.youtube.com/watch?v=il-TXbn5iMA
336•imagiro•4d ago•46 comments

Fossil versus Git

https://fossil-scm.org/home/doc/trunk/www/fossil-v-git.wiki
6•vednig•1h ago•1 comments

Uncrossy

https://uncrossy.com/
86•dgacmu•10h ago•26 comments

The next two years of software engineering

https://addyosmani.com/blog/next-two-years/
165•napolux•13h ago•147 comments

FUSE is All You Need – Giving agents access to anything via filesystems

https://jakobemmerling.de/posts/fuse-is-all-you-need/
152•jakobem•14h ago•53 comments

Sampling at negative temperature

https://cavendishlabs.org/blog/negative-temperature/
176•ag8•15h ago•52 comments

Perfectly Replicating Coca Cola [video]

https://www.youtube.com/watch?v=TDkH3EbWTYc
235•HansVanEijsden•3d ago•154 comments

Insights into Claude Opus 4.5 from Pokémon

https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-into-claude-opus-4-5-from-pokemon
91•surprisetalk•5d ago•18 comments

Himalayas bare and rocky after reduced winter snowfall, scientists warn

https://www.bbc.com/news/articles/clyndv7zd20o
123•koolhead17•8h ago•96 comments

Ask HN: What are you working on? (January 2026)

201•david927•18h ago•648 comments

Garbage collection is contrarian

https://trynova.dev/blog/garbage-collection-is-contrarian
51•aapoalas•2d ago•7 comments

Elo – A data expression language which compiles to JavaScript, Ruby, and SQL

https://elo-lang.org/
90•ravenical•4d ago•25 comments

Don't fall into the anti-AI hype

https://antirez.com/news/158
1040•todsacerdoti•1d ago•1237 comments

Gadget Exposed a Spy Camera [video]

https://www.youtube.com/watch?v=1reman2waLs
56•rib3ye•11h ago•27 comments

Poison Fountain

https://rnsaffn.com/poison3/
207•atomic128•18h ago•122 comments

Xfce is great

https://rubenerd.com/xfce-is-great/
231•mikece•7h ago•153 comments

Erich von Däniken has died

https://daniken.com/en/startseite-english/
96•Kaibeezy•16h ago•164 comments

A set of Idiomatic prod-grade katas for experienced devs transitioning to Go

https://github.com/MedUnes/go-kata
130•medunes•4d ago•23 comments

Show HN: Shellock, a real-time CLI flag explainer for fish shell

https://github.com/ibehnam/shellock
11•behnamoh•5d ago•4 comments