frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

ASCII characters are not pixels: a deep dive into ASCII rendering

https://alexharri.com/blog/ascii-rendering
288•alexharri•4h ago•36 comments

The Dilbert Afterlife

https://www.astralcodexten.com/p/the-dilbert-afterlife
168•rendall•1d ago•90 comments

The 600-year-old origins of the word 'hello'

https://www.bbc.com/culture/article/20260113-hello-hiya-aloha-what-our-greetings-reveal
41•1659447091•3h ago•15 comments

ClickHouse acquires Langfuse

https://langfuse.com/blog/joining-clickhouse
127•tin7in•6h ago•45 comments

Map To Poster – Create Art of your favourite city

https://github.com/originalankur/maptoposter
96•originalankur•5h ago•34 comments

US electricity demand surged in 2025 – solar handled 61% of it

https://electrek.co/2026/01/16/us-electricity-demand-surged-in-2025-solar-handled-61-percent/
172•doener•5h ago•131 comments

The recurring dream of replacing developers

https://www.caimito.net/en/blog/2025/12/07/the-recurring-dream-of-replacing-developers.html
6•glimshe•1h ago•7 comments

East Germany balloon escape

https://en.wikipedia.org/wiki/East_Germany_balloon_escape
585•robertvc•22h ago•239 comments

Show HN: Streaming gigabyte medical images from S3 without downloading them

https://github.com/PABannier/WSIStreamer
79•el_pa_b•6h ago•17 comments

Architecture for Disposable Systems

https://tuananh.net/2026/01/15/architecture-for-disposable-systems/
32•tuananh•4h ago•18 comments

Sergei Fedorov's Escape from Soviet Union Helped Save Red Wings (2020)

https://www.freep.com/story/sports/nhl/red-wings/2026/01/12/sergei-fedorov-detroit-red-wings-russ...
14•rmason•4d ago•1 comments

Cloudflare acquires Astro

https://astro.build/blog/joining-cloudflare/
872•todotask2•1d ago•367 comments

Italy investigates Activision Blizzard for pushing in-game purchases

https://techcrunch.com/2026/01/16/italy-investigates-activision-blizzard-for-pushing-in-game-purc...
24•7777777phil•1h ago•0 comments

The 'untouchable hacker god' behind Finland's biggest ever crime

https://www.theguardian.com/technology/2026/jan/17/vastaamo-hack-finland-therapy-notes
77•c420•8h ago•71 comments

Lies, Damned Lies and Proofs: Formal Methods Are Not Slopless

https://www.lesswrong.com/posts/rhAPh3YzhPoBNpgHg/lies-damned-lies-and-proofs-formal-methods-are-...
65•OgsyedIE•3d ago•36 comments

Cursor's latest “browser experiment” implied success without evidence

https://embedding-shapes.github.io/cursor-implied-success-without-evidence/
625•embedding-shape•1d ago•270 comments

High-Level Is the Goal

https://bvisness.me/high-level/
191•tobr•2d ago•83 comments

PCs refuse to shut down after Microsoft patch

https://www.theregister.com/2026/01/16/patch_tuesday_secure_launch_bug_no_shutdown/
98•smurda•4h ago•120 comments

The Risks of AI in Schools Outweigh the Benefits, Report Says

https://www.npr.org/2026/01/14/nx-s1-5674741/ai-schools-education
46•backpackerBMW•2h ago•17 comments

6-Day and IP Address Certificates Are Generally Available

https://letsencrypt.org/2026/01/15/6day-and-ip-general-availability
437•jaas•23h ago•247 comments

FLUX.2 [Klein]: Towards Interactive Visual Intelligence

https://bfl.ai/blog/flux2-klein-towards-interactive-visual-intelligence
177•GaggiX•15h ago•50 comments

After 25 years, Wikipedia has proved that news doesn't need to look like news

https://www.niemanlab.org/2026/01/after-25-years-wikipedia-has-proved-that-news-doesnt-need-to-lo...
134•giuliomagnifico•5h ago•126 comments

Show HN: Microwave – Native iOS app for videos on ATproto

https://testflight.apple.com/join/cVxV1W3g
28•sinned•3d ago•8 comments

LLM Structured Outputs Handbook

https://nanonets.com/cookbooks/structured-llm-outputs
308•vitaelabitur•1d ago•51 comments

Drone Hacking Part 1: Dumping Firmware and Bruteforcing ECC

https://neodyme.io/en/blog/drone_hacking_part_1/
104•tripdout•13h ago•18 comments

AV1 Image File Format Specification Gets an Upgrade with AVIF v1.2.0

https://aomedia.org/blog%20posts/AV1-Image-File-Format-Specification-Gets-an-Upgrade-with-AVIF/
42•breve•4h ago•1 comments

Post-PARA: What survived 4 years of real use

https://cortwave.github.io/posts/post-para/
23•cortwave•5d ago•1 comments

Releasing rainbow tables to accelerate Net-NTLMv1 protocol deprecation

https://cloud.google.com/blog/topics/threat-intelligence/net-ntlmv1-deprecation-rainbow-tables
136•linolevan•17h ago•80 comments

Ask HN: Is it still worth pursuing a software startup?

106•newbebee•13h ago•106 comments

Dell UltraSharp 52 Thunderbolt Hub Monitor

https://www.dell.com/en-us/shop/dell-ultrasharp-52-thunderbolt-hub-monitor-u5226kw/apd/210-bthw/m...
253•cebert•22h ago•317 comments