frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

VPN location claims don't match real traffic exits

https://ipinfo.io/blog/vpn-location-mismatch-report
159•mmaia•2h ago•73 comments

Why Twilio Segment moved from microservices back to a monolith

https://www.twilio.com/en-us/blog/developers/best-practices/goodbye-microservices
56•birdculture•2h ago•37 comments

Recovering Anthony Bourdain's (really) lost Li.st's

https://sandyuraz.com/blogs/bourdain/
37•thecsw•1h ago•4 comments

I fed 24 years of my blog posts to a Markov model

https://susam.net/fed-24-years-of-posts-to-markov-model.html
42•zdw•2h ago•13 comments

I tried Gleam for Advent of Code

https://blog.tymscar.com/posts/gleamaoc2025/
196•tymscar•5h ago•114 comments

The Rise of Computer Games, Part I: Adventure

https://technicshistory.com/2025/12/13/the-rise-of-computer-games-part-i-adventure/
18•cfmcdonald•2h ago•1 comments

Want to sway an election? Here’s how much fake online accounts cost

https://www.science.org/content/article/want-sway-election-here-s-how-much-fake-online-accounts-cost
76•rbanffy•1h ago•26 comments

Some surprising things about DuckDuckGo you probably don't know

https://gabrielweinberg.com/p/some-surprising-things-about-duckduckgo
18•ArmageddonIt•44m ago•8 comments

Useful patterns for building HTML tools

https://simonwillison.net/2025/Dec/10/html-tools/
207•simonw•3d ago•60 comments

Cryptids

https://wiki.bbchallenge.org/wiki/Cryptids
76•frozenseven•1w ago•12 comments

Ask HN: How do you handle release notes for multiple audiences?

33•glidr_dev•2h ago•38 comments

Ask HN: How can I get better at using AI for programming?

139•lemonlime227•7h ago•186 comments

Go Proposal: Secret Mode

https://antonz.org/accepted/runtime-secret/
131•enz•4d ago•58 comments

Are we stuck with the same Desktop UX forever? [video]

https://www.youtube.com/watch?v=1fZTOjd_bOQ
60•joelkesler•4h ago•68 comments

What is the nicest thing a stranger has ever done for you?

https://louplummer.lol/nice-stranger/
257•speckx•2d ago•199 comments

EasyPost (YC S13) Is Hiring

https://www.easypost.com/careers
1•jstreebin•5h ago

A Giant Ball Will Help This Man Survive a Year on an Iceberg

https://www.outsideonline.com/outdoor-adventure/exploration-survival/how-giant-ball-will-help-man...
20•areoform•7h ago•23 comments

Researchers seeking better measures of cognitive fatigue

https://www.nature.com/articles/d41586-025-03974-w
90•bikenaga•3d ago•26 comments

Purdue University Approves New AI Requirement for All Undergrads

https://www.forbes.com/sites/michaeltnietzel/2025/12/13/purdue-university-approves-new-ai-require...
27•rmason•1h ago•20 comments

Photographer built a medium-format rangefinder

https://petapixel.com/2025/12/06/this-photographer-built-an-awesome-medium-format-rangefinder-and...
154•shinryuu•1w ago•34 comments

Will West Coast Jazz Get Some Respect?

https://www.honest-broker.com/p/will-west-coast-jazz-finally-get
60•paulpauper•1w ago•36 comments

From Azure Functions to FreeBSD

https://jmmv.dev/2025/12/from-azure-functions-to-freebsd.html
38•todsacerdoti•5d ago•2 comments

A Lisp Interpreter Implemented in Conway's Game of Life (2021)

https://woodrush.github.io/blog/posts/2022-01-12-lisp-in-life.html
83•pabs3•19h ago•3 comments

Using Python for Scripting

https://hypirion.com/musings/use-python-for-scripting
72•birdculture•5d ago•64 comments

Java FFM zero-copy transport using io_uring

https://www.mvp.express/
92•mands•6d ago•42 comments

GNU Unifont

https://unifoundry.com/unifont/index.html
315•remywang•1d ago•72 comments

A 'toaster with a lens': The story behind the first handheld digital camera

https://www.bbc.com/future/article/20251205-how-the-handheld-digital-camera-was-born
74•selvan•5d ago•41 comments

Beautiful Abelian Sandpiles

https://eavan.blog/posts/beautiful-sandpiles.html
131•eavan0•4d ago•22 comments

Pig Video Arcades Critique Life in the Pen (1997)

https://www.wired.com/1997/06/pig-video-arcades-critique-life-in-the-pen/
3•naryJane•5d ago•0 comments

Dynamic Pong Wars

https://markodenic.tech/dynamic-pong-wars/
31•rendall•1w ago•8 comments