frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

What an unprocessed photo looks like

https://maurycyz.com/misc/raw_photo/
600•zdw•4h ago•145 comments

You can make up HTML tags

https://maurycyz.com/misc/make-up-tags/
13•todsacerdoti•21m ago•0 comments

Stepping down as Mockito maintainer after 10 years

https://github.com/mockito/mockito/issues/3777
222•saikatsg•6h ago•121 comments

62 years in the making: NYC's newest water tunnel nears the finish line

https://ny1.com/nyc/all-boroughs/news/2025/11/09/water--dep--tunnels-
76•eatonphil•4h ago•35 comments

Unity's Mono problem: Why your C# code runs slower than it should

https://marekfiser.com/blog/mono-vs-dot-net-in-unity/
115•iliketrains•5h ago•55 comments

Spherical Cow

https://lib.rs/crates/spherical-cow
58•Natfan•3h ago•5 comments

MongoBleed Explained Simply

https://bigdata.2minutestreaming.com/p/mongobleed-explained-simply
110•todsacerdoti•6h ago•35 comments

PySDR: A Guide to SDR and DSP Using Python

https://pysdr.org/content/intro.html
131•kklisura•7h ago•6 comments

Slaughtering Competition Problems with Quantifier Elimination (2021)

https://grossack.site/2021/12/22/qe-competition.html
32•todsacerdoti•3h ago•0 comments

Growing up in “404 Not Found”: China's nuclear city in the Gobi Desert

https://substack.com/inbox/post/182743659
706•Vincent_Yan404•20h ago•303 comments

Researchers Discover Molecular Difference in Autistic Brains

https://medicine.yale.edu/news-article/molecular-difference-in-autistic-brains/
61•amichail•4h ago•46 comments

Building a macOS app to know when my Mac is thermal throttling

https://stanislas.blog/2025/12/macos-thermal-throttling-app/
238•angristan•15h ago•102 comments

Why I Disappeared – My week with minimal internet in a remote island chain

https://www.kenklippenstein.com/p/why-i-disappeared
49•eh_why_not•5h ago•30 comments

Remembering Lou Gerstner

https://newsroom.ibm.com/2025-12-28-Remembering-Lou-Gerstner
75•thm•8h ago•33 comments

Show HN: My app just won best iOS Japanese learning tool of 2025 award (blog)

https://skerritt.blog/best-japanese-learning-tools-2025-award-show/
52•wahnfrieden•3h ago•10 comments

Fast Cvvdp Implementation in C

https://github.com/halidecx/fcvvdp
12•todsacerdoti•3h ago•1 comments

Learn computer graphics from scratch and for free

https://www.scratchapixel.com
198•theusus•16h ago•26 comments

Self-hosting is being enshittified

https://troubled.engineer/posts/selfhosting-in-2025/
22•StrLght•1h ago•7 comments

Doublespeak: In-Context Representation Hijacking

https://mentaleap.ai/doublespeak/
56•surprisetalk•6d ago•5 comments

Time in C++: Inter-Clock Conversions, Epochs, and Durations

https://www.sandordargo.com/blog/2025/12/24/clocks-part-5-conversions
28•ibobev•2d ago•6 comments

Writing non-English languages with a QWERTY keyboard

https://altgr-weur.eu/altgr-intl.html
12•tokai•4d ago•8 comments

Show HN: Pion SCTP with RACK is 70% faster with 30% less latency

https://pion.ly/blog/sctp-and-rack/
51•pch07•9h ago•5 comments

How to Complain (2024)

https://outerproduct.net/trivial/2024-03-25_complain.html
32•ysangkok•3h ago•2 comments

No, it's not a battleship

https://www.navalgazing.net/No-its-not
93•hermitcrab•7h ago•114 comments

Dolphin Progress Report: Release 2512

https://dolphin-emu.org/blog/2025/12/22/dolphin-progress-report-release-2512/
87•akyuu•5h ago•8 comments

One year of keeping a tada list

https://www.ducktyped.org/p/one-year-of-keeping-a-tada-list
233•egonschiele•6d ago•70 comments

Show HN: Phantas – A browser-based binaural strobe engine (Web Audio API)

https://phantas.io
22•AphantaZach•6h ago•8 comments

Intermission: Battle Pulses

https://acoup.blog/2025/12/18/intermission-battle-pulses/
9•Khaine•2d ago•1 comments

Oral History of Richard Greenblatt (2005) [pdf]

https://archive.computerhistory.org/resources/text/Oral_History/Greenblatt_Richard/greenblatt.ora...
14•0xpgm•3d ago•0 comments

Calendar

https://neatnik.net/calendar/?year=2026
968•twapi•22h ago•116 comments