news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/

23•softwaredoug•2mo ago

Comments

jbellis•2mo ago

I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•2mo ago

You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Asynchrony is not concurrency

https://kristoff.it/blog/asynchrony-is-not-concurrency/

150•kristoff_it•4h ago•104 comments

How to write Rust in the Linux kernel: part 3

https://lwn.net/SubscriberLink/1026694/3413f4b43c862629/

21•chmaynard•1h ago•0 comments

Ccusage: A CLI tool for analyzing Claude Code usage from local JSONL files

https://github.com/ryoppippi/ccusage

14•kristianp•44m ago•4 comments

Silence Is a Commons by Ivan Illich (1983)

http://www.davidtinapple.com/illich/1983_silence_commons.html

57•entaloneralie•2h ago•8 comments

Shutting Down Clear Linux OS

https://community.clearlinux.org/t/all-good-things-come-to-an-end-shutting-down-clear-linux-os/10716

11•todsacerdoti•22m ago•2 comments

Broadcom to discontinue free Bitnami Helm charts

https://github.com/bitnami/charts/issues/35164

80•mmoogle•4h ago•43 comments

Wii U SDBoot1 Exploit “paid the beak”

https://consolebytes.com/wii-u-sdboot1-exploit-paid-the-beak/

61•sjuut•3h ago•6 comments

Multiplatform Matrix Multiplication Kernels

https://burn.dev/blog/sota-multiplatform-matmul/

44•homarp•4h ago•16 comments

EPA says it will eliminate its scientific reseach arm

https://www.nytimes.com/2025/07/18/climate/epa-firings-scientific-research.html

53•anigbrowl•1h ago•15 comments

lsr: ls with io_uring

https://rockorager.dev/log/lsr-ls-but-with-io-uring/

290•mpweiher•11h ago•150 comments

Valve confirms credit card companies pressured it to delist certain adult games

https://www.pcgamer.com/software/platforms/valve-confirms-credit-card-companies-pressured-it-to-delist-certain-adult-games-from-steam/

139•freedomben•8h ago•139 comments

Meta says it wont sign Europe AI agreement, calling it growth stunting overreach

https://www.cnbc.com/2025/07/18/meta-europe-ai-code.html

81•rntn•6h ago•117 comments

Trying Guix: A Nixer's impressions

https://tazj.in/blog/trying-guix

131•todsacerdoti•3d ago•38 comments

Replication of Quantum Factorisation Records with a VIC-20, an Abacus, and a Dog

https://eprint.iacr.org/2025/1237

57•teddyh•4h ago•13 comments

AI capex is so big that it's affecting economic statistics

https://paulkedrosky.com/honey-ai-capex-ate-the-economy/

180•throw0101c•4h ago•196 comments

Show HN: Molab, a cloud-hosted Marimo notebook workspace

https://molab.marimo.io/notebooks

61•akshayka•5h ago•8 comments

Mango Health (YC W24) Is Hiring

https://www.ycombinator.com/companies/mango-health/jobs/3bjIHus-founding-engineer

1•zachgitt•5h ago

The year of peak might and magic

https://www.filfre.net/2025/07/the-year-of-peak-might-and-magic/

68•cybersoyuz•6h ago•34 comments

CP/M creator Gary Kildall's memoirs released as free download

https://spectrum.ieee.org/cpm-creator-gary-kildalls-memoirs-released-as-free-download

226•rbanffy•13h ago•118 comments

Sage: An atomic bomb kicked off the biggest computing project in history

https://www.ibm.com/history/sage

10•rawgabbit•3d ago•0 comments

Show HN: I built library management app for those who outgrew spreadsheets

https://www.librari.io/

41•hmkoyan•4h ago•27 comments

A New Geometry for Einstein's Theory of Relativity

https://www.quantamagazine.org/a-new-geometry-for-einsteins-theory-of-relativity-20250716/

71•jandrewrogers•8h ago•1 comments

Cancer DNA is detectable in blood years before diagnosis

https://www.sciencenews.org/article/cancer-tumor-dna-blood-test-screening

151•bookofjoe•5h ago•93 comments

Show HN: Simulating autonomous drone formations

https://github.com/sushrut141/ketu

12•wanderinglight•3d ago•2 comments

How I keep up with AI progress

https://blog.nilenso.com/blog/2025/06/23/how-i-keep-up-with-ai-progress/

165•itzlambda•5h ago•85 comments

Benben: An audio player for the terminal, written in Common Lisp

https://chiselapp.com/user/MistressRemilia/repository/benben/home

45•trocado•3d ago•3 comments

Making a StringBuffer in C, and questioning my sanity

https://briandouglas.ie/string-buffer-c/

24•coneonthefloor•3d ago•13 comments

Hundred Rabbits – Low-tech living while sailing the world

https://100r.co/site/home.html

213•0xCaponte•4d ago•60 comments

How to Get Foreign Keys Horribly Wrong

https://hakibenita.com/django-foreign-keys

49•Bogdanp•3d ago•23 comments

When root meets immutable: OpenBSD chflags vs. log tampering

https://rsadowski.de/posts/2025/openbsd-immutable-system-logs/

126•todsacerdoti•15h ago•41 comments