frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

The struggle of resizing windows on macOS Tahoe

https://noheger.at/blog/2026/01/11/the-struggle-of-resizing-windows-on-macos-tahoe/
1318•happosai•9h ago•558 comments

CLI agents make self-hosting on a home server easier and fun

https://fulghum.io/self-hosting
425•websku•9h ago•273 comments

Himalayas bare and rocky after reduced winter snowfall, scientists warn

https://www.bbc.com/news/articles/clyndv7zd20o
50•koolhead17•3h ago•11 comments

This game is a single 13 KiB file that runs on Windows, Linux and in the Browser

https://iczelia.net/posts/snake-polyglot/
152•snoofydude•8h ago•45 comments

iCloud Photos Downloader

https://github.com/icloud-photos-downloader/icloud_photos_downloader
393•reconnecting•11h ago•178 comments

Don't fall into the anti-AI hype

https://antirez.com/news/158
825•todsacerdoti•20h ago•993 comments

39c3: In-house electronics manufacturing from scratch: How hard can it be? [video]

https://media.ccc.de/v/39c3-in-house-electronics-manufacturing-from-scratch-how-hard-can-it-be
27•fried-gluttony•2d ago•0 comments

Xfce is great

https://rubenerd.com/xfce-is-great/
105•mikece•2h ago•58 comments

I'm making a game engine based on dynamic signed distance fields (SDFs) [video]

https://www.youtube.com/watch?v=il-TXbn5iMA
255•imagiro•3d ago•33 comments

Sampling at negative temperature

https://cavendishlabs.org/blog/negative-temperature/
141•ag8•10h ago•43 comments

FUSE is All You Need – Giving agents access to anything via filesystems

https://jakobemmerling.de/posts/fuse-is-all-you-need/
101•jakobem•9h ago•48 comments

The next two years of software engineering

https://addyosmani.com/blog/next-two-years/
89•napolux•8h ago•53 comments

Garbage collection is contrarian

https://trynova.dev/blog/garbage-collection-is-contrarian
28•aapoalas•2d ago•1 comments

Which programming languages are most token-efficient?

https://martinalderson.com/posts/which-programming-languages-are-most-token-efficient/
73•tehnub•5h ago•43 comments

Show HN: An LLM-optimized programming language

https://github.com/ImJasonH/ImJasonH/blob/main/articles/llm-programming-language.md
22•ImJasonH•3h ago•6 comments

Perfectly Replicating Coca Cola [video]

https://www.youtube.com/watch?v=TDkH3EbWTYc
176•HansVanEijsden•3d ago•115 comments

Gadget Exposed a Spy Camera [video]

https://www.youtube.com/watch?v=1reman2waLs
25•rib3ye•6h ago•16 comments

Uncrossy

https://uncrossy.com/
32•dgacmu•5h ago•14 comments

Elo – A data expression language which compiles to JavaScript, Ruby, and SQL

https://elo-lang.org/
70•ravenical•4d ago•11 comments

Erich von Däniken has died

https://daniken.com/en/startseite-english/
58•Kaibeezy•11h ago•91 comments

Insights into Claude Opus 4.5 from Pokémon

https://www.lesswrong.com/posts/u6Lacc7wx4yYkBQ3r/insights-into-claude-opus-4-5-from-pokemon
50•surprisetalk•5d ago•11 comments

Ask HN: What are you working on? (January 2026)

170•david927•14h ago•550 comments

I'd tell you a UDP joke…

https://www.codepuns.com/post/805294580859879424/i-would-tell-you-a-udp-joke-but-you-might-not-get
142•redmattred•8h ago•38 comments

1% vs. 67%: What happened when we stopped trusting embeddings alone

https://roampal.ai/blog-context-rot.html
4•roampal•5d ago•0 comments

A set of Idiomatic prod-grade katas for experienced devs transitioning to Go

https://github.com/MedUnes/go-kata
118•medunes•4d ago•17 comments

Moving Scratch generation to Python on browser

https://kushaldas.in/posts/introducing-ektupy.html
32•kushaldas•2d ago•6 comments

Show HN: Engineering Schizophrenia: Trusting yourself through Byzantine faults

61•rescrv•8h ago•11 comments

Poison Fountain

https://rnsaffn.com/poison3/
185•atomic128•13h ago•115 comments

I dumped Windows 11 for Linux, and you should too

https://www.notebookcheck.net/I-dumped-Windows-11-for-Linux-and-you-should-too.1190961.0.html
777•smurda•19h ago•717 comments

I Cannot SSH into My Server Anymore (and That's Fine)

https://soap.coffee/~lthms/posts/i-cannot-ssh-into-my-server-anymore.html
98•TheWiggles•4d ago•76 comments