frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Maybe the default settings are too high

https://www.raptitude.com/2025/12/maybe-the-default-settings-are-too-high/
259•htk•3h ago•78 comments

MiniMax M2.1: Built for Real-World Complex Tasks, Multi-Language Programming

https://www.minimaxi.com/news/minimax-m21
44•110•2h ago•16 comments

Python 3.15’s interpreter for Windows x86-64 should hopefully be 15% faster

https://fidget-spinner.github.io/posts/no-longer-sorry.html
328•lumpa•14h ago•108 comments

Paperbacks and TikTok

https://calnewport.com/on-paperbacks-and-tiktok/
82•zdw•3d ago•45 comments

Fahrplan – 39C3

https://fahrplan.events.ccc.de/congress/2025/fahrplan/
155•rurban•8h ago•19 comments

The entire New Yorker archive is now digitized

https://www.newyorker.com/news/press-room/the-entire-new-yorker-archive-is-now-fully-digitized
356•thm•5d ago•50 comments

When a driver challenges the kernel's assumptions

http://miod.online.fr/software/openbsd/stories/udl.html
21•todsacerdoti•2h ago•2 comments

Tiled Art

https://tiled.art/en/home/?id=SilverAndGold
14•meander_water•6d ago•0 comments

CUDA Tile Open Sourced

https://github.com/NVIDIA/cuda-tile
149•JonChesterfield•6d ago•54 comments

Seven Diabetes Patients Die Due to Undisclosed Bug in Abbott's Glucose Monitors

https://sfconservancy.org/blog/2025/dec/23/seven-abbott-freestyle-libre-cgm-patients-dead/
61•pabs3•2h ago•11 comments

Asahi Linux with Sway on the MacBook Air M2 (2024)

https://daniel.lawrence.lu/blog/2024-12-01-asahi-linux-with-sway-on-the-macbook-air-m2/
194•andsoitis•12h ago•172 comments

Archiving Git branches as tags

https://etc.octavore.com/2025/12/archiving-git-branches-as-tags/
87•octavore•3d ago•23 comments

The Program 2025 annual review: How much money does an audio drama podcast make?

https://programaudioseries.com/the-program-results-7/
45•I-M-S•3d ago•14 comments

I sell onions on the Internet (2019)

https://www.deepsouthventures.com/i-sell-onions-on-the-internet/
386•sogen•10h ago•116 comments

Show HN: Lamp Carousel – DIY kinetic sculpture powered by lamp heat (2024)

https://evan.widloski.com/posts/spinners/
68•Evidlo•1d ago•12 comments

Critical vulnerability in LangChain – CVE-2025-68664

https://cyata.ai/blog/langgrinch-langchain-core-cve-2025-68664/
74•shahartal•9h ago•50 comments

Ask HN: What skills do you want to develop or improve in 2026?

6•meridion•10h ago•0 comments

Google is 'gradually rolling out' option to change your gmail.com address

https://9to5google.com/2025/12/24/google-change-gmail-addresses/
131•geox•5h ago•116 comments

Reinventing the dial-up modem (2019)

https://saket.me/dtmf-tones/
8•todsacerdoti•6d ago•4 comments

Choosing the Right Python Docker Image for Finance Workloads

https://jiripik.com/2025/12/19/choosing-the-right-python-docker-image-for-finance-workloads/
9•jiripik•6d ago•6 comments

We invited a man into our home at Christmas and he stayed with us for 45 years

https://www.bbc.co.uk/news/articles/cdxwllqz1l0o
947•rajeshrajappan•16h ago•227 comments

Toys with the highest play-time and lowest clean-up-time

https://joannabregan.substack.com/p/toys-with-the-highest-play-time-and
307•surprisetalk•1w ago•163 comments

Clearspace (YC W23) Is Hiring a Founding Network Engineer (VPN and Proxy)

https://www.ycombinator.com/companies/clearspace/jobs/5LtM86I-founding-network-engineer-at-clears...
1•anteloper•10h ago

Alzheimer’s disease can be reversed in animal models? Study

https://case.edu/news/new-study-shows-alzheimers-disease-can-be-reversed-achieve-full-neurologica...
431•thunderbong•11h ago•107 comments

Geometric Algorithms for Translucency Sorting in Minecraft [pdf]

https://douira.dev/assets/document/douira-master-thesis.pdf
38•HeliumHydride•1w ago•7 comments

Fabrice Bellard Releases MicroQuickJS

https://github.com/bellard/mquickjs/blob/main/README.md
1439•Aissen•2d ago•540 comments

Who Watches the Waymos? I do [video]

https://www.youtube.com/watch?v=oYU2hAbx_Fc
271•notgloating•1d ago•102 comments

Phoenix: A modern X server written from scratch in Zig

https://git.dec05eba.com/phoenix/about/
630•snvzz•1d ago•382 comments

Ruby 4.0.0

https://www.ruby-lang.org/en/news/2025/12/25/ruby-4-0-0-released/
690•FBISurveillance•22h ago•158 comments

The First Photographs of Snowflakes Discover the Groundbreaking Microphotography (2017)

https://www.openculture.com/2017/12/the-first-photographs-of-snowflakes.html
89•_____k•1w ago•18 comments