frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Show HN: ChartGPU – WebGPU-powered charting library (1M points at 60fps)

https://github.com/ChartGPU/ChartGPU
419•huntergemmer•7h ago•133 comments

Challenges in join optimization

https://www.starrocks.io/blog/inside-starrocks-why-joins-are-faster-than-youd-expect
16•HermitX•5h ago•0 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
176•meetpateltech•6h ago•118 comments

TeraWave Satellite Communications Network

https://www.blueorigin.com/news/blue-origin-introduces-terawave-space-based-network-for-global-co...
95•T-A•3h ago•58 comments

Show HN: Rails UI

https://railsui.com/
72•justalever•3h ago•53 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
202•dayanruben•6h ago•61 comments

Jerry (YC S17) Is Hiring

https://www.ycombinator.com/companies/jerry-inc/jobs/QaoK3rw-software-engineer-core-automation-ma...
1•linaz•44m ago

The WebRacket language is a subset of Racket that compiles to WebAssembly

https://github.com/soegaard/webracket
55•mfru•4d ago•11 comments

OpenAI API Logs: Unpatched data exfiltration

https://www.promptarmor.com/resources/openai-api-logs-unpatched-data-exfiltration
19•takira•2h ago•5 comments

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
169•josephwegner•4h ago•111 comments

Letting Claude play text adventures

https://borretti.me/article/letting-claude-play-text-adventures
36•varjag•5d ago•13 comments

Show HN: TerabyteDeals – Compare storage prices by $/TB

https://terabytedeals.com
6•vektor888•56m ago•3 comments

Show HN: UltraContext – A simple context API for AI agents with auto-versioning

https://ultracontext.ai/
14•ofabioroma•7h ago•11 comments

Three types of LLM workloads and how to serve them

https://modal.com/llm-almanac/workloads
11•charles_irl•5h ago•0 comments

eBay explicitly bans AI "buy for me" agents in user agreement update

https://www.valueaddedresource.net/ebay-bans-ai-agents-updates-arbitration-user-agreement-feb-2026/
51•bdcravens•1h ago•21 comments

Slouching Towards Bethlehem – Joan Didion (1967)

https://www.saturdayeveningpost.com/2017/06/didion/
42•jxmorris12•4h ago•1 comments

JPEG XL Test Page

https://tildeweb.nl/~michiel/jxl/
142•roywashere•5h ago•100 comments

Scientists find a way to regrow cartilage in mice and human tissue samples

https://www.sciencedaily.com/releases/2026/01/260120000333.htm
210•saikatsg•4h ago•56 comments

SIMD programming in pure Rust

https://kerkour.com/introduction-rust-simd
23•randomint64•2d ago•4 comments

Tell HN: Bending Spoons laid off almost everybody at Vimeo yesterday

324•Daemon404•5h ago•276 comments

Show HN: Semantic search engine for Studio Ghibli movie

https://ghibli-search.anini.workers.dev/
7•aninibread•8h ago•2 comments

Nested code fences in Markdown

https://susam.net/nested-code-fences.html
167•todsacerdoti•9h ago•55 comments

Show HN: SpeechOS – Wispr Flow-inspired voice input for any web app

https://www.speechos.ai/
7•gangster_dave•6h ago•5 comments

Can you slim macOS down?

https://eclecticlight.co/2026/01/21/can-you-slim-macos-down/
138•ingve•14h ago•183 comments

Magnetic Remote Control of Biology

https://bsky.app/profile/andrewgyork.bsky.social/post/3mcbrdoftak2l
10•AndrewGYork•2h ago•4 comments

I finally got my sway layout to autostart the way I like it

https://hugues.betakappaphi.com/2026/01/19/sway-layout/
9•__hugues•13h ago•2 comments

Show HN: Company hiring trends and insights from job postings

https://jobswithgpt.com/company-profiles/
28•sp1982•4h ago•4 comments

Linux from Scratch

https://www.linuxfromscratch.org/lfs/view/stable/
303•Alupis•3h ago•77 comments

SmartOS

https://docs.smartos.org/
157•ofrzeta•6h ago•63 comments

PicoPCMCIA – a PCMCIA development board for retro-computing enthusiasts

https://www.yyzkevin.com/picopcmcia/
99•rbanffy•5h ago•25 comments