frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

The 26,000-Year Astronomical Monument Hidden in Plain Sight

https://longnow.org/ideas/the-26000-year-astronomical-monument-hidden-in-plain-sight/
151•mkmk•2h ago•27 comments

Instabridge has acquired Nova Launcher

https://novalauncher.com/nova-is-here-to-stay
45•KORraN•1h ago•24 comments

The Unix Pipe Card Game

https://punkx.org/unix-pipe-game/
105•kykeonaut•3h ago•25 comments

Unconventional PostgreSQL Optimizations

https://hakibenita.com/postgresql-unconventional-optimizations
152•haki•6h ago•17 comments

I'm addicted to being useful

https://www.seangoedecke.com/addicted-to-being-useful/
369•swah•9h ago•188 comments

Show HN: wxpath – Declarative web crawling in XPath

https://github.com/rodricios/wxpath
35•rodricios•6d ago•5 comments

Show HN: Mastra 1.0, open-source JavaScript agent framework from the Gatsby devs

https://github.com/mastra-ai/mastra
20•calcsam•3h ago•5 comments

Nvidia Stock Crash Prediction

https://entropicthoughts.com/nvidia-stock-crash-prediction
235•todsacerdoti•4h ago•188 comments

Linux kernel framework for PCIe device emulation, in userspace

https://github.com/cakehonolulu/pciem
181•71bw•12h ago•70 comments

IP Addresses Through 2025

https://www.potaroo.net/ispcol/2026-01/addr2025.html
130•petercooper•6h ago•82 comments

The Zen of Reticulum

https://github.com/markqvist/Reticulum/blob/master/Zen%20of%20Reticulum.md
76•mikece•6h ago•47 comments

Level S4 solar radiation event

https://www.swpc.noaa.gov/news/g4-severe-geomagnetic-storm-levels-reached-19-jan-2026
576•WorldPeas•1d ago•188 comments

De-dollarization: Is the US dollar losing its dominance? (2025)

https://www.jpmorgan.com/insights/global-research/currencies/de-dollarization
467•andsoitis•4h ago•604 comments

Apple testing new App Store design that blurs the line between ads and results

https://9to5mac.com/2026/01/16/iphone-apple-app-store-search-results-ads-new-design/
581•ksec•1d ago•479 comments

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
70•adammajcher•7h ago•25 comments

Channel3 (YC S25) Is Hiring

https://www.ycombinator.com/companies/channel3/jobs/3DIAYYY-backend-engineer
1•aschiff1•8h ago

IP over Avian Carriers with Quality of Service (1999)

https://www.rfc-editor.org/rfc/rfc2549.html
59•mig4ng•9h ago•24 comments

Reticulum, a secure and anonymous mesh networking stack

https://github.com/markqvist/Reticulum
309•brogu•20h ago•81 comments

Fast Concordance: Instant concordance on a corpus of >1,200 books

https://iafisher.com/concordance/
4•evakhoury•3d ago•0 comments

The Alignment Game (2023)

https://dmvaldman.github.io/alignment-game/
41•dmvaldman•4d ago•9 comments

Running Claude Code dangerously (safely)

https://blog.emilburzo.com/2026/01/running-claude-code-dangerously-safely/
220•emilburzo•8h ago•182 comments

What came first: the CNAME or the A record?

https://blog.cloudflare.com/cname-a-record-order-dns-standards/
437•linolevan•1d ago•150 comments

Increasing the performance of WebAssembly Text Format parser by 350%

https://blog.gplane.win/posts/improve-wat-parser-perf.html
92•gplane•5d ago•30 comments

The secret medieval tunnels that we still don't understand

https://weirdmedievalguys.substack.com/p/the-secret-medieval-tunnels-that
7•coloneltcb•57m ago•0 comments

The coming industrialisation of exploit generation with LLMs

https://sean.heelan.io/2026/01/18/on-the-coming-industrialisation-of-exploit-generation-with-llms/
236•long•1d ago•146 comments

Prediction markets are ushering in a world in which news becomes about gambling

https://www.theatlantic.com/technology/2026/01/america-polymarket-disaster/685662/
452•krustyburger•2d ago•444 comments

Benchmarking a Baseline Fully-in-Place Functional Language Compiler [pdf]

https://trendsfp.github.io/papers/tfp26-paper-12.pdf
34•matt_d•4d ago•5 comments

Nanolang: A tiny experimental language designed to be targeted by coding LLMs

https://github.com/jordanhubbard/nanolang
215•Scramblejams•22h ago•173 comments

Notes on Apple's Nano Texture (2025)

https://jon.bo/posts/nano-texture/
245•dsr12•1d ago•126 comments

Squishy Go

https://puyogo.app/en/
28•kqr•4d ago•8 comments