frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Apple Is Fighting for TSMC Capacity as Nvidia Takes Center Stage

https://www.culpium.com/p/exclusiveapple-is-fighting-for-tsmc
148•speckx•1h ago•108 comments

25 Years of Wikipedia

https://wikipedia25.org
113•easton•3h ago•63 comments

Show HN: TinyCity – A tiny city SIM for MicroPython (Thumby micro console)

https://github.com/chrisdiana/TinyCity
49•inflam52•2h ago•2 comments

The URL shortener that makes your links look as suspicious as possible

https://creepylink.com/
614•dreadsword•13h ago•119 comments

The Palantir app helping ICE raids in Minneapolis

https://www.404media.co/elite-the-palantir-app-ice-uses-to-find-neighborhoods-to-raid/
314•fajmccain•1h ago•229 comments

Claude Cowork exfiltrates files

https://www.promptarmor.com/resources/claude-cowork-exfiltrates-files
782•takira•20h ago•344 comments

OBS Studio 32.1.0 Beta 1 available

https://github.com/obsproject/obs-studio/releases/tag/32.1.0-beta1
41•Sean-Der•1h ago•9 comments

The 3D Software Rendering Technology of 1998's Thief: The Dark Project (2019)

https://nothings.org/gamedev/thief_rendering.html
63•suioir•5h ago•26 comments

Impeccable Style

https://impeccable.style
60•noemit•3d ago•34 comments

Programming, Evolved: Lessons and Observations

https://github.com/kulesh/dotfiles/blob/main/dev/dev/docs/programming-evolved.md
20•dnw•3h ago•7 comments

Investing with GIFs: A Visual Guide

https://www.ft.com/content/9b1ff0b8-a1e8-4869-8d61-620c5ed32d35
10•7777777phil•5d ago•3 comments

Sinclair C5

https://en.wikipedia.org/wiki/Sinclair_C5
40•jszymborski•4d ago•20 comments

Z80 Mem­ber­ship Card

https://sunrise-ev.com/z80.htm
72•exvi•3d ago•21 comments

Jiga (YC W21) Is Hiring Full Stack Engineers

https://jiga.io/about-us
1•grmmph•4h ago

Ask HN: Share your personal website

736•susam•23h ago•2014 comments

Ask HN: How are you doing RAG locally?

276•tmaly•1d ago•117 comments

Raspberry Pi's New AI Hat Adds 8GB of RAM for Local LLMs

https://www.jeffgeerling.com/blog/2026/raspberry-pi-ai-hat-2/
196•ingve•8h ago•151 comments

The 500k-ton typo: Why data center copper math doesn't add up

https://investinglive.com/news/the-500000-ton-typo-why-data-center-copper-math-doesnt-add-up-2026...
68•thebeardisred•3h ago•97 comments

San Remo Pasta Measurer

https://www.toxel.com/tech/2025/09/17/san-remo-pasta-measurer/
38•surprisetalk•5d ago•37 comments

Show HN: MailPilot – Freedom to go anywhere while your agents work

30•keepamovin•9h ago•36 comments

Scaling long-running autonomous coding

https://cursor.com/blog/scaling-agents
245•samwillis•18h ago•152 comments

Ask HN: What did you find out or explore today?

155•blahaj•22h ago•268 comments

French Court Orders Popular VPNs to Block More Pirate Sites, Despite Opposition

https://torrentfreak.com/french-court-orders-popular-vpns-to-block-more-pirate-sites-despite-oppo...
75•iamnothere•3h ago•50 comments

Crafting Interpreters

https://craftinginterpreters.com/
184•tosh•18h ago•42 comments

Python: Tprof, a Targeting Profiler

https://adamj.eu/tech/2026/01/14/python-introducing-tprof/
47•jonatron•7h ago•1 comments

The State of OpenSSL for pyca/cryptography

https://cryptography.io/en/latest/statements/state-of-openssl/
181•SGran•18h ago•45 comments

Bubblewrap: A nimble way to prevent agents from accessing your .env files

https://patrickmccanna.net/a-better-way-to-limit-claude-code-and-other-coding-agents-access-to-se...
144•0o_MrPatrick_o0•14h ago•111 comments

Show HN: WebTiles – create a tiny 250x250 website with neighbors around you

https://webtiles.kicya.net/
211•dimden•5d ago•33 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
94•code_brian•22h ago•23 comments

Handy – Free open source speech-to-text app

https://github.com/cjpais/Handy
152•tin7in•11h ago•86 comments