frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Replacing JavaScript with Just HTML

https://www.htmhell.dev/adventcalendar/2025/27/
80•soheilpro•1h ago•18 comments

How we lost communication to entertainment

https://ploum.net/2025-12-15-communication-entertainment.html
289•8organicbits•6h ago•152 comments

Dad's Fitness May Be Packaged and Passed Down in Sperm RNA

https://www.quantamagazine.org/how-dads-fitness-may-be-packaged-and-passed-down-in-sperm-rna-2025...
17•vismit2000•1h ago•2 comments

Why Reliability Demands Functional Programming

https://blog.rastrian.dev/post/why-reliability-demands-functional-programming-adts-safety-and-cri...
46•rastrian•2h ago•24 comments

Floor796

https://floor796.com/
571•krtkush•13h ago•73 comments

Project Vend: Phase Two

https://www.anthropic.com/research/project-vend-2
63•kubami•5d ago•20 comments

Text rendering hates you

https://faultlore.com/blah/text-hates-you/
88•andsoitis•6d ago•26 comments

Gpg.fail

https://gpg.fail
284•todsacerdoti•9h ago•144 comments

Rainbow Six Siege hacked as players get billions of credits and random bans

https://www.shanethegamer.com/esports-news/rainbow-six-siege-hacked-global-server-outage/
99•erhuve•6h ago•31 comments

Show HN: What 4M posts reveal about going viral on Hacker News

https://hn-ph.vercel.app
20•salebanolow•1h ago•4 comments

Windows 2 for the Apricot PC/Xi

https://www.ninakalinina.com/notes/win2apri/
101•todsacerdoti•8h ago•23 comments

Show HN: Waycore – an open-source, offline-first modular field computer

38•DGrechko•3h ago•19 comments

Clock synchronization is a nightmare

https://arpitbhayani.me/blogs/clock-sync-nightmare/
128•grep_it•4d ago•79 comments

immer – a library of persistent and immutable data structures written in C++

https://github.com/arximboldi/immer
21•smartmic•6d ago•6 comments

Nvidia's $20B antitrust loophole

https://ossa-ma.github.io/blog/groq
347•ossa-ma•8h ago•118 comments

Show HN: Ez FFmpeg – Video editing in plain English

http://npmjs.com/package/ezff
343•josharsh•17h ago•164 comments

Janet Jackson had the power to crash laptop computers (2022)

https://devblogs.microsoft.com/oldnewthing/20220816-00/?p=106994
230•montalbano•9h ago•95 comments

OrangePi 6 Plus Review

https://boilingsteam.com/orange-pi-6-plus-review/
139•ekianjo•13h ago•122 comments

Toll roads are spreading in America

https://www.economist.com/united-states/2025/12/18/toll-roads-are-spreading-in-america
130•smurda•8h ago•387 comments

Pfizer ended up passing on my GLP-1 work back in the early '90s (2024)

https://www.statnews.com/2024/09/09/glp-1-history-pfizer-john-baxter-jeffrey-flier-calbio-metabio/
67•rajlego•4h ago•27 comments

Ask HN: Resources to get better at outbound sales?

158•sieep•6d ago•40 comments

Pantograph: Building a preschool for robots

https://pantograph.com/blog/building-a-preschool-for-robots.html
37•agajews•4d ago•8 comments

They made me an offer I couldn't refuse (1997)

https://jens.mooseyard.com/1997/04/13/they-made-me-an-offer-i-couldnt-refuse/
40•classichasclass•4d ago•26 comments

Rust the Process

https://www.amalbansode.com/writing/2025-12-24-rust-the-process/
31•quadrophenia•3d ago•4 comments

7- and 14-segment fonts "DSEG"

https://www.keshikan.net/fonts.html
10•anigbrowl•3h ago•1 comments

Say No to Palantir in the NHS

https://notopalantir.goodlawproject.org/email-to-target/stop-palantir-in-the-nhs/
85•_____k•5h ago•7 comments

Mruby: Ruby for Embedded Systems

https://github.com/mruby/mruby
126•nateb2022•5d ago•32 comments

Show HN: Mysti – Claude, Codex, and Gemini debate your code, then synthesize

https://github.com/DeepMyst/Mysti
172•bahaAbunojaim•4d ago•138 comments

Richard Stallman at the First Hackers Conference in 1984 [video]

https://www.youtube.com/watch?v=Hf2pfzzWPYE
97•schmuckonwheels•5h ago•14 comments

How We Found Out About COINTELPRO (2014)

https://monthlyreview.org/articles/how-we-found-out-about-cointelpro/
68•bryanrasmussen•4h ago•35 comments