frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Erdos problem #728 was solved more or less autonomously by AI

https://mathstodon.xyz/@tao/115855840223258103
152•cod1r•2h ago•96 comments

JavaScript Demos in 140 Characters

https://beta.dwitter.net
161•themanmaran•5h ago•35 comments

RTX 5090 and Raspberry Pi: Can It Game?

https://scottjg.com/posts/2026-01-08-crappy-computer-showdown/
117•scottjg•5h ago•59 comments

Caltrain shows why every region should be moving toward regional rail

https://www.hsrail.org/blog/caltrain-shows-why-every-region-should-be-moving-toward-regional-rail/
12•gok•20m ago•3 comments

Flock Hardcoded the Password for America's Surveillance Infrastructure 53 Times

https://nexanet.ai/blog/53-times-flocksafety-hardcoded-the-password-for-americas-surveillance-inf...
149•fuck_flock•7h ago•52 comments

Scientists discover oldest poison, on 60k-year-old arrows

https://www.nytimes.com/2026/01/07/science/poison-arrows-south-africa.html
84•noleary•1d ago•24 comments

How will the miracle happen today?

https://kk.org/thetechnium/how-will-the-miracle-happen-today/
322•zdw•5d ago•178 comments

The (likely?) cheapest home-made Michelson interferometer

https://guille.site/posts/3d-printed-michelson/
76•LolWolf•5d ago•36 comments

QtNat – Open you port with Qt UPnP

http://renaudguezennec.eu/index.php/2026/01/09/qtnat-open-you-port-with-qt/
37•jandeboevrie•4h ago•22 comments

How Markdown took over the world

https://www.anildash.com/2026/01/09/how-markdown-took-over-the-world/
115•zdw•6h ago•73 comments

Show HN: EuConform – Offline-first EU AI Act compliance tool (open source)

https://github.com/Hiepler/EuConform
56•hiepler•5h ago•33 comments

Show HN: Scroll Wikipedia like TikTok

https://quack.sdan.io
126•sdan•6h ago•31 comments

Ragdoll Mayhem Maker – a physics-based level editor for my indie game

https://ragdollmayhemmaker.com/
14•anefiox•2d ago•5 comments

Show HN: I made a memory game to teach you to play piano by ear

https://lend-me-your-ears.specr.net
388•vunderba•7h ago•137 comments

See it with your lying ears

https://lcamtuf.substack.com/p/see-it-with-your-lying-ears
7•fratellobigio•23m ago•0 comments

Turn a single image into a navigable 3D Gaussian Splat with depth

https://lab.revelium.studio/ml-sharp
50•ytpete•6h ago•34 comments

Show HN: Rocket Launch and Orbit Simulator

https://www.donutthejedi.com/
79•donutthejedi•5h ago•26 comments

Washington National Opera Is Leaving the Kennedy Center

https://www.nytimes.com/2026/01/09/arts/music/washington-national-opera-kennedy-center.html
25•mikhael•50m ago•3 comments

Amiga Pointer Archive

https://heckmeck.de/pointers/
36•erickhill•9h ago•15 comments

Design duality and the expression problem (2018)

https://www.tedinski.com/2018/02/27/the-expression-problem.html
4•NeutralForest•6d ago•0 comments

Kagi releases alpha version of Orion for Linux

https://help.kagi.com/orion/misc/linux-status.html
328•HelloUsername•11h ago•233 comments

The Vietnam government has banned rooted phones from using any banking app

https://xdaforums.com/t/discussion-the-root-and-mod-hiding-fingerprint-spoofing-keybox-stealing-c...
404•Magnusmaster•7h ago•495 comments

Show HN: Similarity = cosine(your_GitHub_stars, Karpathy) Client-side

https://puzer.github.io/github_recommender/
113•puzer•3d ago•32 comments

Show HN: I built a tool to create AI agents that live in iMessage

https://tryflux.ai/
46•danielsdk•5d ago•23 comments

Show HN: Various shape regularization algorithms

https://github.com/nickponline/shreg
42•nickponline•22h ago•3 comments

Deno has made its PyPI distribution official

https://github.com/denoland/deno/issues/31254
14•zahlman•3h ago•4 comments

Cloudflare CEO on the Italy fines

https://twitter.com/eastdakota/status/2009654937303896492
384•sidcool•7h ago•559 comments

TextMaze

https://robobunny.com/projects/textmaze/html/?page=0
8•kqr•6d ago•1 comments

Exercise can be nearly as effective as therapy for depression

https://www.sciencedaily.com/releases/2026/01/260107225516.htm
276•mustaphah•6h ago•215 comments

Show HN: Repogen – a static site generator for package repositories

https://github.com/ralt/repogen
19•tlar•3d ago•3 comments