frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Provide agents with automated feedback

https://banay.me/dont-waste-your-backpressure/
87•ghuntley•1d ago•27 comments

Gaussian Splatting – A$AP Rocky "Helicopter" music video

https://radiancefields.com/a-ap-rocky-releases-helicopter-music-video-featuring-gaussian-splatting
554•ChrisArchitect•12h ago•175 comments

Flux 2 Klein pure C inference

https://github.com/antirez/flux2.c
292•antirez•12h ago•113 comments

Dead Internet Theory

https://kudmitry.com/articles/dead-internet-theory/
191•skwee357•10h ago•243 comments

The Code-Only Agent

https://rijnard.com/blog/the-code-only-agent
35•emersonmacro•3h ago•13 comments

A Social Filesystem

https://overreacted.io/a-social-filesystem/
349•icy•22h ago•151 comments

Fil-Qt: A Qt Base build with Fil-C experience

https://git.qt.io/cradam/fil-qt
62•pjmlp•2d ago•37 comments

AVX-512: First Impressions on Performance and Programmability

https://shihab-shahriar.github.io//blog/2026/AVX-512-First-Impressions-on-Performance-and-Program...
39•shihab•5d ago•9 comments

Show HN: I quit coding years ago. AI brought me back

https://calquio.com/finance/compound-interest
52•ivcatcher•5h ago•38 comments

Gas Town Decoded

https://www.alilleybrinker.com/mini/gas-town-decoded/
100•alilleybrinker•4d ago•87 comments

Experiments with Kafka's head-of-line blocking (2023)

https://www.artur-rodrigues.com/tech/2023/03/21/kafka-head-of-line-blocking.html
4•teleforce•5d ago•0 comments

Show HN: Dock – Slack minus the bloat, tax, and 90-day memory loss

https://getdock.io/
97•yadavrh•9h ago•78 comments

Using proxies to hide secrets from Claude Code

https://www.joinformal.com/blog/using-proxies-to-hide-secrets-from-claude-code/
59•drewgregory•5d ago•24 comments

Astrophotography visibility plotting and planning tool

https://airmass.org/
13•NKosmatos•3d ago•2 comments

Poking holes into bytecode with peephole optimisations

https://xnacly.me/posts/2026/purple-garden-first-optimisations/
21•xnacly•4d ago•0 comments

Command-line Tools can be 235x Faster than your Hadoop Cluster (2014)

https://adamdrake.com/command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html
339•tosh•21h ago•225 comments

Show HN: AWS-doctor – A terminal-based AWS health check and cost optimizer in Go

https://github.com/elC0mpa/aws-doctor
3•elC0mpa•1h ago•1 comments

The space and motion of communicating agents (2008) [pdf]

https://www.cl.cam.ac.uk/archive/rm135/Bigraphs-draft.pdf
13•dhorthy•3d ago•1 comments

Police Invested Millions in Shadowy Phone-Tracking Software Won't Say How Used

https://www.texasobserver.org/texas-police-invest-tangles-sheriff-surveillance/
288•nobody9999•9h ago•82 comments

High-speed train collision in Spain kills at least 21

https://www.bbc.com/news/articles/cedw6ylpynyo
64•akyuu•6h ago•40 comments

The Cathedral, the Megachurch, and the Bazaar

https://opensourcesecurity.io/2026/01-cathedral-megachurch-bazaar/
151•todsacerdoti•5d ago•118 comments

Sins of the Children

https://asteriskmag.com/issues/07/sins-of-the-children
134•maxall4•13h ago•65 comments

Show HN: Lume 0.2 – Build and Run macOS VMs with unattended setup

https://cua.ai/docs/lume/guide/getting-started/introduction
109•frabonacci•12h ago•31 comments

Simulating the Ladybug Clock Puzzle

https://austinhenley.com/blog/ladybugclock.html
9•azhenley•1d ago•0 comments

Predicting OpenAI's ad strategy

https://ossa-ma.github.io/blog/openads
520•calcifer•15h ago•456 comments

A free and open-source rootkit for Linux

https://lwn.net/SubscriberLink/1053099/19c2e8180aeb0438/
187•jwilk•20h ago•39 comments

Wine 11.0

https://gitlab.winehq.org/wine/wine/-/releases/wine-11.0
321•zdw•5d ago•61 comments

ASCII characters are not pixels: a deep dive into ASCII rendering

https://alexharri.com/blog/ascii-rendering
1216•alexharri•1d ago•131 comments

Show HN: Beats, a web-based drum machine

https://beats.lasagna.pizza
55•kinduff•9h ago•16 comments

ThinkNext Design

https://thinknextdesign.com/home.html
248•__patchbit__•23h ago•120 comments