frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Public Sans – A strong, neutral typeface

https://public-sans.digital.gov/
150•mhb•2h ago•44 comments

Netflix: Open Content

https://opencontent.netflix.com/
389•tosh•6h ago•66 comments

Non-Zero-Sum Games

https://nonzerosum.games/
199•8organicbits•5h ago•59 comments

The British Empire's Resilient Subsea Telegraph Network

https://subseacables.blogspot.com/2025/12/the-british-empires-resilient-subsea.html
67•giuliomagnifico•3h ago•8 comments

The Legacy of Undersea Cables

https://blog.sciencemuseumgroup.org.uk/the-legacy-of-undersea-cables/
21•teleforce•1h ago•2 comments

Postgres extension complements pgvector for performance and scale

https://github.com/timescale/pgvectorscale
48•flyaway123•5d ago•2 comments

Approachable Swift Concurrency

https://fuckingapproachableswiftconcurrency.com/en/
68•wrxd•3h ago•31 comments

Go away Python

https://lorentz.app/blog-item.html?id=go-shebang
191•baalimago•7h ago•141 comments

Hive (YC S14) Is Hiring a Staff Software Engineer (Data Systems)

https://jobs.ashbyhq.com/hive.co/cb0dc490-0e32-4734-8d91-8b56a31ed497
1•patman_h•2h ago

GOG is getting acquired by its original co-founder

https://www.gog.com/blog/gog-is-getting-acquired-by-its-original-co-founder-what-it-means-for-you/
796•haunter•1d ago•470 comments

Times New American: A Tale of Two Fonts

https://hsu.cy/2025/12/times-new-american/
104•firexcy•3h ago•62 comments

No strcpy either

https://daniel.haxx.se/blog/2025/12/29/no-strcpy-either/
118•firesteelrain•3h ago•55 comments

Stranger Things creator says turn off "garbage" settings

https://screenrant.com/stranger-things-creator-turn-off-settings-premiere/
305•1970-01-01•16h ago•539 comments

Show HN: One clean, developer-focused page for every Unicode symbol

https://fontgenerator.design/symbols
108•yarlinghe•5d ago•45 comments

Tesla's 4680 battery supply chain collapses as partner writes down deal by 99%

https://electrek.co/2025/12/29/tesla-4680-battery-supply-chain-collapses-partner-writes-down-dea/
570•coloneltcb•22h ago•632 comments

Hacking Washing Machines [video]

https://media.ccc.de/v/39c3-hacking-washing-machines
171•clausecker•15h ago•35 comments

Nicolas Guillou, French ICC judge sanctioned by the US and “debanked”

https://www.lemonde.fr/en/international/article/2025/11/19/nicolas-guillou-french-icc-judge-sanct...
225•lifeisstillgood•5h ago•157 comments

ManusAI Joins Meta

https://manus.im/blog/manus-joins-meta-for-next-era-of-innovation
282•gniting•18h ago•171 comments

The future of software development is software developers

https://codemanship.wordpress.com/2025/11/25/the-future-of-software-development-is-software-devel...
324•cdrnsf•21h ago•352 comments

Charm Ruby – Glamorous Terminal Libraries for Ruby

https://charm-ruby.dev/
75•todsacerdoti•9h ago•10 comments

UNIX Fourth Edition

http://squoze.net/UNIX/v4/README
86•dcminter•1w ago•8 comments

Concurrent Hash Table Designs

https://bluuewhale.github.io/posts/concurrent-hashmap-designs/
22•signa11•3d ago•1 comments

AI is forcing us to write good code

https://bits.logic.inc/p/ai-is-forcing-us-to-write-good-code
259•sgk284•21h ago•190 comments

Turning an old Amazon Kindle into a eInk development platform (2021)

https://blog.lidskialf.net/2021/02/08/turning-an-old-kindle-into-a-eink-development-platform/
53•fanf2•4d ago•8 comments

2025 Was Another Exceptionally Hot Year

https://e360.yale.edu/digest/2025-second-hottest-year
17•Brajeshwar•1h ago•5 comments

Singapore Study Links Heavy Infant Screen Time to Teen Anxiety

https://www.bloomberg.com/news/articles/2025-12-30/singapore-study-links-heavy-infant-screen-time...
55•1vuio0pswjnm7•3h ago•27 comments

Graph Algorithms in Rayon

https://davidlattimore.github.io/posts/2025/11/27/graph-algorithms-in-rayon.html
36•PaulHoule•4d ago•0 comments

Google is dead. Where do we go now?

https://www.circusscientist.com/2025/12/29/google-is-dead-where-do-we-go-now/
976•tomjuggler•20h ago•773 comments

MongoDB Server Security Update, December 2025

https://www.mongodb.com/company/blog/news/mongodb-server-security-update-december-2025
99•plorkyeran•16h ago•41 comments

Outside, Dungeon, Town: Integrating the Three Places in Videogames (2024)

https://keithburgun.net/outside-dungeon-town-integrating-the-three-places-in-videogames/
95•vector_spaces•15h ago•44 comments