frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

So, You've Hit an Age Gate. What Now?

https://www.eff.org/deeplinks/2026/01/so-youve-hit-age-gate-what-now
79•hn_acker•1h ago•47 comments

Why some clothes shrink in the wash – and how to 'unshrink' them

https://www.swinburne.edu.au/news/2025/08/why-some-clothes-shrink-in-the-wash-and-how-to-unshrink...
236•OptionOfT•3d ago•125 comments

Find a pub that needs you

https://www.ismypubfucked.com/
73•thinkingemote•2h ago•32 comments

Starlink roam 50GB is now 100GB with unlimited slow speed after that

https://starlink.com/support/article/58c9c8b7-474e-246f-7e3c-06db3221d34d
127•bahmboo•2h ago•114 comments

Ask HN: Could you share your personal website here?

41•susam•1h ago•135 comments

The Unbearable Frustration of Figuring Out APIs

https://blog.ar-ms.me/thoughts/translation-cli/
37•ezekg•2h ago•18 comments

Ford F-150 Lightning outsold the Cybertruck and was then canceled for poor sales

https://electrek.co/2026/01/13/ford-f150-lightning-outsold-tesla-cybertruck-canceled-not-selling-...
104•MBCook•1h ago•96 comments

Edge of Emulation: Game Boy Sewing Machines (2020)

https://shonumi.github.io/articles/art22.html
73•mosura•4h ago•6 comments

There's a ridiculous amount of tech in a disposable vape

https://blog.jgc.org/2026/01/theres-ridiculous-amount-of-tech-in.html
672•abnercoimbre•2d ago•583 comments

I built Vector. Now I'm answering the question your observability vendor won't

https://usetero.com/blog/the-question-your-observability-vendor-wont-answer
54•binarylogic•2h ago•23 comments

Show HN: HyTags – HTML as a Programming Language

https://hytags.org
27•lassejansen•1d ago•12 comments

Show HN: A 10KiB kernel for cloud apps

https://github.com/ReturnInfinity/BareMetal-Cloud
30•ianseyler•2h ago•2 comments

Xoscript

https://xoscript.com/history.xo
30•gabordemooij•2h ago•19 comments

Virginia Faulkner: Writer, Editor and Ghostwriter?

https://lithub.com/virginia-faulkner-writer-editor-and-ghostwriter/
8•samclemens•5d ago•1 comments

I’m leaving Redis for SolidQueue

https://www.simplethread.com/redis-solidqueue/
248•amalinovic•9h ago•100 comments

Government drops plans for mandatory digital ID to work in UK

https://www.bbc.com/news/articles/c3385zrrx73o
99•FridayoLeary•3h ago•36 comments

How have prices changed in a year? NPR checked 114 items at Walmart

https://www.npr.org/2026/01/14/nx-s1-5638908/walmart-prices-inflation-affordability-shrinkflation
76•srameshc•1h ago•40 comments

Lago (Open-Source Billing) is hiring across teams and geos

1•Rafsark•6h ago

A Brief Introduction to the Basics of Game Theory

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1968579
43•7777777phil•2d ago•5 comments

I Hate GitHub Actions with Passion

https://xlii.space/eng/i-hate-github-actions-with-passion/
276•xlii•7h ago•224 comments

Show HN: Tiny FOSS Compass and Navigation App (<2MB)

https://github.com/CompassMB/MBCompass
104•nativeforks•7h ago•32 comments

1000 Blank White Cards

https://en.wikipedia.org/wiki/1000_Blank_White_Cards
328•eieio•15h ago•58 comments

GitHub should charge everyone $1 more per month to fund open source

https://blog.greg.technology/2025/11/27/github-should-charge-1-dollar-more-per-month.html
37•evakhoury•2h ago•52 comments

System Programming in Linux: A Hands-On Introduction "Demo" Programs

https://github.com/stewartweiss/intro-linux-sys-prog
69•teleforce•8h ago•3 comments

4k tons of potatoes to be given away for free in Berlin

https://www.the-berliner.com/english-news-berlin/4000-tons-of-potatoes-to-be-given-away-for-free/
96•mrzool•1h ago•80 comments

A 40-line fix eliminated a 400x performance gap

https://questdb.com/blog/jvm-current-thread-user-time/
342•bluestreak•19h ago•73 comments

Every GitHub object has two IDs

https://www.greptile.com/blog/github-ids
308•dakshgupta•1d ago•67 comments

FBI raids Washington Post reporter's home

https://www.theguardian.com/us-news/2026/jan/14/fbi-raid-washington-post-hannah-natanson
682•echelon_musk•3h ago•401 comments

ASCII Clouds

https://caidan.dev/portfolio/ascii_clouds/
311•majkinetor•16h ago•55 comments

Never-before-seen Linux malware is "more advanced than typical"

https://arstechnica.com/security/2026/01/never-before-seen-linux-malware-is-far-more-advanced-tha...
84•Brajeshwar•3h ago•20 comments