frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Lewis Carroll Computed Determinants

https://www.johndcook.com/blog/2023/07/10/lewis-carroll-determinants/
25•tzury•41m ago•3 comments

Experts Explore New Mushroom Which Causes Fairytale-Like Hallucinations

https://nhmu.utah.edu/articles/experts-explore-new-mushroom-which-causes-fairytale-hallucinations
102•astronads•2h ago•34 comments

Package managers keep using Git as a database, it never works out

https://nesbitt.io/2025/12/24/package-managers-keep-using-git-as-a-database.html
451•birdculture•6h ago•253 comments

LearnixOS

https://www.learnix-os.com
141•gtirloni•6h ago•49 comments

Rob Pike got spammed with an AI slop "act of kindness"

https://simonwillison.net/2025/Dec/26/slop-acts-of-kindness/
88•nabla9•1h ago•22 comments

C/C++ Embedded Files (2013)

https://www.4rknova.com//blog/2013/01/27/cpp-embedded-files
31•ibobev•2h ago•27 comments

FFmpeg has issued a DMCA takedown on GitHub

https://twitter.com/FFmpeg/status/2004599109559496984
100•merlindru•1h ago•9 comments

My insulin pump controller uses the Linux kernel. It also violates the GPL

https://old.reddit.com/r/linux/comments/1puojsr/the_device_that_controls_my_insulin_pump_uses_the/
24•davisr•31m ago•1 comments

Show HN: AutoLISP interpreter in Rust/WASM – a CAD workflow invented 33 yrs ago

https://acadlisp.de/noscript.html
49•holg•3h ago•25 comments

Show HN: Xcc700: Self-hosting mini C compiler for ESP32 (Xtensa) in 700 lines

https://github.com/valdanylchuk/xcc700
45•isitcontent•4h ago•8 comments

Unix "find" expressions compiled to bytecode

https://nullprogram.com/blog/2025/12/23/
76•rcarmo•7h ago•4 comments

ZJIT is now available in Ruby 4.0

https://railsatscale.com/2025-12-24-launch-zjit/
34•ibobev•2h ago•16 comments

The Algebra of Loans in Rust

https://nadrieril.github.io/blog/2025/12/21/the-algebra-of-loans-in-rust.html
149•g0xA52A2A•4d ago•75 comments

Perfect Aircrete, Kitchen Ingredients [video]

https://www.youtube.com/watch?v=z4_GxPHwqkA
10•surprisetalk•6d ago•1 comments

Joan Didion and Kurt Vonnegut had something to say. We have it on tape

https://www.nytimes.com/2025/12/19/books/james-baldwin-joan-didion-92ny-recordings.html
77•tintinnabula•4d ago•14 comments

Sandbox: Run untrusted AI code safely, fast

https://github.com/PwnFunction/sandbox
17•vortex_ape•1w ago•1 comments

High School Student Discovers 1.5M Potential New Astronomical Objects

https://www.smithsonianmag.com/smart-news/high-school-student-discovers-1-5-million-potential-new...
81•mhb•4h ago•76 comments

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

https://github.com/thu-ml/TurboDiffusion
199•meander_water•16h ago•39 comments

Rob Pike Goes Nuclear over GenAI

https://skyview.social/?url=https%3A%2F%2Fbsky.app%2Fprofile%2Frobpike.io%2Fpost%2F3matwg6w3ic2s&...
830•christoph-heiss•5h ago•765 comments

Overlooked No More: Inge Lehmann, Who Discovered the Earth's Inner Core

https://www.nytimes.com/2025/12/20/obituaries/inge-lehmann-overlooked.html
54•Hooke•3d ago•12 comments

A Proclamation Regarding the Restoration of the Em-Dash

https://blog.nawaz.org/posts/2025/Dec/a-proclamation-regarding-the-restoration-of-the-dash/
64•BeetleB•2h ago•72 comments

An 11-qubit atom processor in silicon with all fidelities from 99.10% to 99.99%

https://www.nature.com/articles/s41586-025-09827-w
66•giuliomagnifico•5d ago•42 comments

The First Web Server

https://dfarq.homeip.net/the-first-web-server/
36•giuliomagnifico•8h ago•6 comments

Show HN: Gaming Couch – a local multiplayer party game platform for 8 players

https://gamingcouch.com
370•ChaosOp•5d ago•106 comments

Building an AI agent inside a 7-year-old Rails monolith

https://catalinionescu.dev/ai-agent/building-ai-agent-part-1/
96•cionescu1•12h ago•41 comments

Bedlam Cube Solved (ALL 19,186 solutions)

http://scottkurowski.com/BedlamCube/
20•kristianp•4d ago•3 comments

ChatGPT conversations still lack timestamps after years of requests

https://community.openai.com/t/timestamps-for-chats-in-chatgpt/440107?page=3
199•Valid3840•7h ago•107 comments

Geometric Algorithms for Translucency Sorting in Minecraft [pdf]

https://douira.dev/assets/document/douira-master-thesis.pdf
59•HeliumHydride•10h ago•20 comments

How to Reproduce This Book with LaTeX

https://github.com/BenjaminGor/Latex_Notes_Tutorial
81•nill0•1w ago•11 comments

MiniMax M2.1: Built for Real-World Complex Tasks, Multi-Language Programming

https://www.minimaxi.com/news/minimax-m21
200•110•18h ago•71 comments