frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•7mo ago

Comments

jbellis•7mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•7mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Gemini 3 Flash: frontier intelligence built for speed

https://blog.google/products/gemini/gemini-3-flash/
435•meetpateltech•3h ago•202 comments

How SQLite is tested

https://sqlite.org/testing.html
65•whatisabcdefgh•1h ago•2 comments

AWS CEO says replacing junior devs with AI is 'one of the dumbest ideas'

https://www.finalroundai.com/blog/aws-ceo-ai-cannot-replace-junior-developers
392•birdculture•2h ago•216 comments

A Safer Container Ecosystem with Docker: Free Docker Hardened Images

https://www.docker.com/blog/docker-hardened-images-for-every-developer/
157•anttiharju•2h ago•28 comments

Coursera to combine with Udemy

https://investor.coursera.com/news/news-details/2025/Coursera-to-Combine-with-Udemy-to-Empower-th...
271•throwaway019254•7h ago•161 comments

Tell HN: HN was down

304•uyzstvqs•2h ago•190 comments

Why outcome-billing makes sense for AI Agents

https://www.valmi.io/blog/an-imperative-for-ai-agents-outcome-billing-with-valmi/
18•rajvarkala•1h ago•19 comments

AI capability isn't humanness

https://research.roundtable.ai/capabilities-humanness/
9•mdahardy•2h ago•4 comments

Show HN: High-Performance Wavelet Matrix for Python, Implemented in Rust

https://pypi.org/project/wavelet-matrix/
3•math-hiyoko•11m ago•0 comments

FCC chair suggests agency isn't independent, word cut from mission statement

https://www.axios.com/2025/12/17/brendan-carr-fcc-independent-senate-testimony-website
70•jmsflknr•1h ago•39 comments

Notes on Sorted Data

https://amit.prasad.me/blog/sorted-data
37•surprisetalk•6d ago•2 comments

Flick (YC F25) Is Hiring Founding Engineer to Build Figma for AI Filmmaking

https://www.ycombinator.com/companies/flick/jobs/Tdu6FH6-founding-frontend-engineer
1•rayruiwang•2h ago

Zmij: Faster floating point double-to-string conversion

https://vitaut.net/posts/2025/faster-dtoa/
16•fanf2•3d ago•0 comments

Doublespeed hacked, revealing what its AI-generated accounts are promoting

https://www.404media.co/hack-reveals-the-a16z-backed-phone-farm-flooding-tiktok-with-ai-influencers/
62•grahamlee•1h ago•22 comments

AI will make formal verification go mainstream

https://martin.kleppmann.com/2025/12/08/ai-formal-verification.html
767•evankhoury•22h ago•389 comments

Launch HN: Kenobi (YC W22) – Personalize your website for every visitor

15•sarreph•3h ago•34 comments

alpr.watch

https://alpr.watch/
862•theamk•1d ago•409 comments

Announcing the Beta release of ty

https://astral.sh/blog/ty
765•gavide•22h ago•145 comments

No Graphics API

https://www.sebastianaaltonen.com/blog/no-graphics-api
773•ryandrake•1d ago•147 comments

Put SSH keys in .git to make repos USB-portable

https://dansjots.github.io/posts/per-repo-ssh-key/
13•dansjots•2h ago•9 comments

AI's real superpower: consuming, not creating

https://msanroman.io/blog/ai-consumption-paradigm
163•firefoxd•11h ago•112 comments

I created a publishing system for step-by-step coding guides in Typst

https://press.knowledge.dev/p/new-150-pages-rust-guide-create-a
10•deniskolodin•4d ago•2 comments

Learning the oldest programming language (2024)

https://uncenter.dev/posts/learning-fortran/
30•lioeters•6h ago•25 comments

Is Mozilla trying hard to kill itself?

https://infosec.press/brunomiguel/is-mozilla-trying-hard-to-kill-itself
709•pabs3•10h ago•611 comments

No AI* Here – A Response to Mozilla's Next Chapter

https://www.waterfox.com/blog/no-ai-here-response-to-mozilla/
468•MrAlex94•21h ago•263 comments

TLA+ Modeling Tips

http://muratbuffalo.blogspot.com/2025/12/tla-modeling-tips.html
95•birdculture•11h ago•24 comments

Pricing Changes for GitHub Actions

https://resources.github.com/actions/2026-pricing-changes-for-github-actions/
757•kevin-david•1d ago•790 comments

GPT Image 1.5

https://openai.com/index/new-chatgpt-images-is-here/
498•charlierguo•1d ago•240 comments

Thin desires are eating life

https://www.joanwestenberg.com/thin-desires-are-eating-your-life/
665•mitchbob•1d ago•222 comments

Mozilla appoints new CEO Anthony Enzor-Demeo

https://blog.mozilla.org/en/mozilla/leadership/mozillas-next-chapter-anthony-enzor-demeo-new-ceo/
573•recvonline•1d ago•858 comments