frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

The URL shortener that makes your links look as suspicious as possible

https://creepylink.com/
228•dreadsword•3h ago•42 comments

Claude Cowork exfiltrates files

https://www.promptarmor.com/resources/claude-cowork-exfiltrates-files
600•takira•11h ago•263 comments

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR

https://www.tavus.io/post/sparrow-1-human-level-conversational-timing-in-real-time-voice
54•code_brian•13h ago•12 comments

Furiosa: 3.5x efficiency over H100s

https://furiosa.ai/blog/introducing-rngd-server-efficient-ai-inference-at-data-center-scale
145•written-beyond•6h ago•75 comments

New Safari developer tools provide insight into CSS Grid Lanes

https://webkit.org/blog/17746/new-safari-developer-tools-provide-insight-into-css-grid-lanes/
35•feross•6h ago•7 comments

Handy – free open source speech-to-text app

https://github.com/cjpais/Handy
15•tin7in•2h ago•8 comments

Scaling long-running autonomous coding

https://cursor.com/blog/scaling-agents
180•samwillis•9h ago•90 comments

Ask HN: Share your personal website

548•susam•14h ago•1569 comments

Ask HN: How are you doing RAG locally?

87•tmaly•16h ago•32 comments

Project SkyWatch (a.k.a. Wescam at Home)

https://ianservin.com/2026/01/13/project-skywatch-aka-wescam-at-home/
23•jjwiseman•14h ago•4 comments

Bubblewrap: A nimble way to prevent agents from accessing your .env files

https://patrickmccanna.net/a-better-way-to-limit-claude-code-and-other-coding-agents-access-to-se...
72•0o_MrPatrick_o0•5h ago•54 comments

The State of OpenSSL for pyca/cryptography

https://cryptography.io/en/latest/statements/state-of-openssl/
127•SGran•9h ago•21 comments

Ask HN: Weird archive.today behavior?

84•rabinovich•8h ago•24 comments

Bare metal programming with RISC-V guide (2023)

https://popovicu.com/posts/bare-metal-programming-risc-v/
7•todsacerdoti•4d ago•1 comments

Show HN: WebTiles – create a tiny 250x250 website with neighbors around you

https://webtiles.kicya.net/
163•dimden•5d ago•23 comments

SparkFun Officially Dropping AdaFruit due to CoC Violation

https://www.sparkfun.com/official-response
440•yaleman•16h ago•438 comments

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP

https://github.com/cosinusalpha/webctl
83•cosinusalpha•16h ago•26 comments

Sun Position Calculator

https://drajmarsh.bitbucket.io/earthsun.html
96•sanbor•10h ago•20 comments

Find a pub that needs you

https://www.ismypubfucked.com/
259•thinkingemote•15h ago•208 comments

Ask HN: What is the best way to provide continuous context to models?

39•nemath•6h ago•20 comments

Crafting Interpreters

https://craftinginterpreters.com/
73•tosh•9h ago•8 comments

Show HN: Ever wanted to look at yourself in Braille?

https://github.com/NishantJoshi00/dith
22•cat-whisperer•5d ago•11 comments

ChromaDB Explorer

https://www.chroma-explorer.com/
51•arsentjev•8h ago•3 comments

Generate QR Codes with Pure SQL in PostgreSQL

https://tanelpoder.com/posts/generate-qr-code-with-pure-sql-in-postgres/
73•tanelpoder•4d ago•6 comments

Roam 50GB is now Roam 100GB

https://starlink.com/support/article/58c9c8b7-474e-246f-7e3c-06db3221d34d
270•bahmboo•15h ago•323 comments

How can I build a simple pulse generator to demonstrate transmission lines

https://electronics.stackexchange.com/questions/764155/how-can-i-build-a-simple-pulse-generator-t...
32•alphabetter•5d ago•8 comments

Is Rust faster than C?

https://steveklabnik.com/writing/is-rust-faster-than-c/
257•vincentchau•4d ago•298 comments

Ford F-150 Lightning outsold the Cybertruck and was then canceled for poor sales

https://electrek.co/2026/01/13/ford-f150-lightning-outsold-tesla-cybertruck-canceled-not-selling-...
562•MBCook•14h ago•731 comments

Rubik's Cube in Prolog – Order

https://medium.com/@kenichisasagawa/i-am-preparing-material-for-a-prolog-book-af7580acfee7
30•myth_drannon•4d ago•9 comments

Native ZFS VDEV for Object Storage (OpenZFS Summit)

https://www.zettalane.com/blog/openzfs-summit-2025-mayanas-objbacker.html
105•suprasam•12h ago•29 comments