frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Sugar industry influenced researchers and blamed fat for CVD

https://www.ucsf.edu/news/2016/09/404081/sugar-papers-reveal-industry-role-shifting-national-hear...
103•aldarion•1h ago•36 comments

LaTeX Coffee Stains [pdf] (2021)

https://ctan.math.illinois.edu/graphics/pgf/contrib/coffeestains/coffeestains-en.pdf
88•zahrevsky•1h ago•20 comments

A4 Paper Stories

https://susam.net/a4-paper-stories.html
126•blenderob•3h ago•60 comments

Shipmap.org

https://www.shipmap.org/
34•surprisetalk•1h ago•5 comments

“Stop Designing Languages. Write Libraries Instead” (2016)

https://lbstanza.org/purpose_of_programming_languages.html
159•teleforce•3h ago•119 comments

LLM Problems Observed in Humans

https://embd.cc/llm-problems-observed-in-humans
22•js216•34m ago•1 comments

US Job Openings Decline to Lowest Level in More Than a Year

https://www.bloomberg.com/news/articles/2026-01-07/us-job-openings-decline-to-lowest-level-in-mor...
73•toomuchtodo•38m ago•28 comments

Meditation as Wakeful Relaxation: Unclenching Smooth Muscle

https://psychotechnology.substack.com/p/meditation-as-wakeful-relaxation
31•surprisetalk•1h ago•5 comments

Many Hells of WebDAV: Writing a Client/Server in Go

https://candid.dev/blog/many-hells-of-webdav
6•candiddevmike•20m ago•0 comments

Show HN: KeelTest – AI-driven VS Code unit test generator with bug discovery

https://keelcode.dev/keeltest
18•bulba4aur•2h ago•5 comments

Sergey Brin's Unretirement

https://www.inc.com/jessica-stillman/google-co-founder-sergey-brins-unretirement-is-a-lesson-for-...
288•iancmceachern•6d ago•357 comments

Formal methods only solve half my problems

https://brooker.co.za/blog/2022/06/02/formal.html
55•signa11•4d ago•20 comments

Optery (YC W22) Hiring a CISO and Web Scraping Engineers (Node) (US and Latam)

https://www.optery.com/careers/
1•beyondd•4h ago

Vector graphics on GPU

https://gasiulis.name/vector-graphics-on-gpu/
118•gsf_emergency_6•4d ago•24 comments

Stop Doom Scrolling, Start Doom Coding: Build via the terminal from your phone

https://github.com/rberg27/doom-coding
521•rbergamini27•20h ago•363 comments

Opus 4.5 is not the normal AI agent experience that I have had thus far

https://burkeholland.github.io/posts/opus-4-5-change-everything/
712•tbassetto•22h ago•1025 comments

Quake Brutalist Jam III

https://www.slipseer.com/index.php?resources/quake-brutalist-jam-iii.549/
59•Venn1•2d ago•10 comments

Show HN: I built a "Do not disturb" Device for my home office

https://apoorv.page/blogs/over-engineered-dnd
3•quacky_batak•4d ago•1 comments

The Eric and Wendy Schmidt Observatory System

https://www.schmidtsciences.org/schmidt-observatory-system/
50•pppone•3h ago•40 comments

Electronic nose for indoor mold detection and identification

https://advanced.onlinelibrary.wiley.com/doi/10.1002/adsr.202500124
164•PaulHoule•15h ago•93 comments

Show HN: Comet MCP – Give Claude Code a browser that can click

https://github.com/hanzili/comet-mcp
14•hanzili•3d ago•13 comments

A 30B Qwen model walks into a Raspberry Pi and runs in real time

https://byteshape.com/blogs/Qwen3-30B-A3B-Instruct-2507/
302•dataminer•19h ago•105 comments

Show HN: SMTP Tunnel – A SOCKS5 proxy disguised as email traffic to bypass DPI

https://github.com/x011/smtp-tunnel-proxy
108•lobito25•15h ago•36 comments

Show HN: RepoReaper – AST-aware, JIT-loading code audit agent (Python/AsyncIO)

https://github.com/tzzp1224/RepoReaper
6•realdexter•1h ago•0 comments

The creator of Claude Code's Claude setup

https://twitter.com/bcherny/status/2007179832300581177
498•KothuRoti•4d ago•341 comments

Commodore 64 floppy drive has the power to be a computer and runs BASIC

https://www.tomshardware.com/pc-components/cpus/commodore-64-floppy-drive-has-the-power-to-be-a-c...
8•rbanffy•33m ago•2 comments

We might have been slower to abandon StackOverflow if it wasn't a toxic hellhole

https://www.pcloadletter.dev/blog/abandoning-stackoverflow/
56•ronbenton•43m ago•69 comments

Vietnam bans unskippable ads

https://saigoneer.com/vietnam-news/28652-vienam-bans-unskippable-ads,-requires-skip-button-to-app...
1496•hoherd•23h ago•764 comments

On the slow death of scaling

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5877662
101•sethbannon•12h ago•23 comments

I wanted a camera that doesn't exist, so I built it

https://medium.com/@cristi.baluta/i-wanted-a-camera-that-doesnt-exist-so-i-built-it-5f9864533eb7
430•cyrc•4d ago•132 comments