frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Gentoo Linux 2025 Review

https://www.gentoo.org/news/2026/01/05/new-year.html
166•akhuettel•5h ago•68 comments

Happy 50th Birthday KIM-1

https://github.com/netzherpes/KIM1-Demo
30•JKCalhoun•2h ago•11 comments

"Food JPEGs" in Super Smash Bros. & Kirby Air Riders

https://sethmlarson.dev/food-jpegs-in-super-smash-bros-and-kirby-air-riders
146•SethMLarson•4d ago•39 comments

Instagram data breach reportedly exposed the personal info of 17.5M users

https://www.engadget.com/cybersecurity/an-instagram-data-breach-reportedly-exposed-the-personal-i...
80•IvanAchlaqullah•1h ago•28 comments

I dumped Windows 11 for Linux, and you should too

https://www.notebookcheck.net/I-dumped-Windows-11-for-Linux-and-you-should-too.1190961.0.html
342•smurda•5h ago•314 comments

C++ std::move doesn't move anything: A deep dive into Value Categories

https://0xghost.dev/blog/std-move-deep-dive/
164•signa11•2d ago•124 comments

BasiliskII Macintosh 68k Emulator Ported to ESP32-P4 / M5Stack Tab5

https://github.com/amcchord/M5Tab-Macintosh
44•rcarmo•4h ago•5 comments

Think of Pavlov

https://boz.com/articles/think-pavlov
70•kiyanwang•5h ago•26 comments

The Concise TypeScript Book

https://github.com/gibbok/typescript-book
169•javatuts•11h ago•33 comments

Show HN: Porting xv6 to HiFive Unmatched board

https://github.com/eyengin/xv6-riscv-unmatched
16•eyengin•1d ago•0 comments

My Home Fibre Network Disintegrated

https://alienchow.dev/post/fibre_disintegration/
202•alienchow•12h ago•174 comments

Vojtux – Unofficial Linux Distribution Aimed at Visually Impaired Users

https://github.com/vojtapolasek/vojtux
96•TheWiggles•4d ago•25 comments

You are not required to close your <p>, <li>, <img>, or <br> tags in HTML

https://blog.novalistic.com/archives/2017/08/optional-end-tags-in-html/
84•jen729w•1d ago•131 comments

HTML-only conditional lazy loading (via preload and media)

https://orga.cat/blog/html-conditional-lazy-loading/
47•netol•5h ago•9 comments

Replace the Retiring Windows XP with Linux

https://www.linux.com/training-tutorials/replace-retiring-windows-xp-linux/
8•righthand•31m ago•2 comments

More than one hundred years of Film Sizes

https://wichm.home.xs4all.nl/filmsize.html
72•exvi•8h ago•17 comments

Finding and fixing Ghostty's largest memory leak

https://mitchellh.com/writing/ghostty-memory-leak-fix
550•thorel•21h ago•121 comments

Show HN: I used Claude Code to discover connections between 100 books

https://trails.pieterma.es/
436•pmaze•23h ago•133 comments

Code and Let Live

https://fly.io/blog/code-and-let-live/
398•usrme•1d ago•152 comments

Show HN: Ferrite – Markdown editor in Rust with native Mermaid diagram rendering

https://github.com/OlaProeis/Ferrite
211•OlaProis•15h ago•120 comments

CPU Counters on Apple Silicon: article + tool

https://blog.bugsiki.dev/posts/apple-pmu/
138•verte_zerg•4d ago•0 comments

Outward Signs of Inner Mysteries

https://lareviewofbooks.org/article/outward-signs-of-inner-mysteries/
13•prismatic•4d ago•0 comments

Large Feeds and RFC 5005

https://alexschroeder.ch/view/2025-09-10-large-feeds
4•8organicbits•4d ago•0 comments

Learning from Sudoku Solvers (2007)

http://ravimohan.blogspot.com/2007/04/learning-from-sudoku-solvers.html
10•forks•5d ago•3 comments

'Bandersnatch': The Works That Inspired the 'Black Mirror' Interactive Feature (2019)

https://www.hollywoodreporter.com/tv/tv-news/black-mirror-bandersnatch-real-life-works-influences...
68•rafaepta•5d ago•27 comments

Open Chaos: A self-evolving open-source project

https://www.openchaos.dev/
402•stefanvdw1•1d ago•83 comments

Max Payne – two decades later – Graphics Critique (2021)

https://darkcephas.blogspot.com/2021/07/max-payne-two-decades-later-graphics.html
107•davikr•13h ago•35 comments

AI is a business model stress test

https://dri.es/ai-is-a-business-model-stress-test
298•amarsahinovic•23h ago•286 comments

Google: Don't make "bite-sized" content for LLMs

https://arstechnica.com/google/2026/01/google-dont-make-bite-sized-content-for-llms-if-you-care-a...
54•cebert•4h ago•31 comments

Don't fall into the anti-AI hype

https://antirez.com/news/158
312•todsacerdoti•6h ago•444 comments