frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers

https://gptzero.me/news/neurips/
290•segmenta•2h ago•171 comments

Qwen3-TTS Family Is Now Open Sourced: Voice Design, Clone, and Generation

https://qwen.ai/blog?id=qwen3tts-0115
173•Palmik•3h ago•39 comments

It looks like the status/need-triage label was removed

https://github.com/google-gemini/gemini-cli/issues/16728
52•nickswalker•1h ago•13 comments

In Europe, Wind and Solar Overtake Fossil Fuels

https://e360.yale.edu/digest/europe-wind-solar-fossil-fuels
256•speckx•3h ago•228 comments

Show HN: isometric.nyc – giant isometric pixel art map of NYC

https://cannoneyed.com/isometric-nyc/
45•cannoneyed•47m ago•12 comments

Tree-sitter vs. Language Servers

https://lambdaland.org/posts/2026-01-21_tree-sitter_vs_lsp/
105•ashton314•2h ago•30 comments

AnswerThis (YC F25) Is Hiring

https://www.ycombinator.com/companies/answerthis/jobs/r5VHmSC-ai-agent-orchestration
1•ayush4921•39m ago

Miami, Your Waymo Ride Is Ready

https://waymo.com/blog/2026/01/miami-your-waymo-ride-is-ready
30•ChrisArchitect•1h ago•35 comments

Design Thinking Books You Must Read

https://www.designorate.com/design-thinking-books/
192•rrm1977•5h ago•92 comments

Launch HN: Constellation Space (YC W26) – AI for satellite mission assurance

https://constellation-io.com/
6•kmajid•36m ago•0 comments

ISO PDF spec is getting Brotli – ~20 % smaller documents with no quality loss

https://pdfa.org/want-to-make-your-pdfs-20-smaller-for-free/
92•whizzx•6h ago•42 comments

30 Years of ReactOS

https://reactos.org/blogs/30yrs-of-ros/
163•Mark_Jansen•9h ago•80 comments

Joe Armstrong and Jeremy Ruston – Intertwingling the Tiddlywiki with Erlang [video]

https://www.youtube.com/watch?v=Uv1UfLPK7_Q
25•kerim-ca•2d ago•2 comments

Show HN: Sweep, Open-weights 1.5B model for next-edit autocomplete

https://huggingface.co/sweepai/sweep-next-edit-1.5B
470•williamzeng0•18h ago•95 comments

Doctors in Brazil using tilapia fish skin to treat burn victims

https://www.pbs.org/newshour/health/brazilian-city-uses-tilapia-fish-skin-treat-burn-victims
224•kaycebasques•12h ago•71 comments

Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant

https://www.media.mit.edu/publications/your-brain-on-chatgpt/
480•misswaterfairy•18h ago•349 comments

Show HN: Interactive physics simulations I built while teaching my daughter

https://www.projectlumen.app/
42•anticlickwise•3d ago•5 comments

We will ban you and ridicule you in public if you waste our time on crap reports

https://curl.se/.well-known/security.txt
760•latexr•6h ago•472 comments

Pragmatic Bitmap Filters in Microsoft SQL Server

https://www.vldb.org/cidrdb/2026/i-cant-believe-its-not-yannakakis-pragmatic-bitmap-filters-in-mi...
6•tanelpoder•5d ago•1 comments

In Praise of APL (1977)

https://www.jsoftware.com/papers/perlis77.htm
77•tosh•8h ago•43 comments

Downtown Denver's office vacancy rate grows to 38.2%

https://coloradosun.com/2026/01/22/denver-downtown-office-vacancy-rate-tenants-workplace/
18•mooreds•28m ago•13 comments

Threat actors expand abuse of Microsoft Visual Studio Code

https://www.jamf.com/blog/threat-actors-expand-abuse-of-visual-studio-code/
246•vinnyglennon•17h ago•250 comments

eBay explicitly bans AI "buy for me" agents in user agreement update

https://www.valueaddedresource.net/ebay-bans-ai-agents-updates-arbitration-user-agreement-feb-2026/
263•bdcravens•20h ago•278 comments

Douglas Adams on the English–American cultural divide over "heroes"

https://shreevatsa.net/post/douglas-adams-cultural-divide/
250•speckx•3h ago•253 comments

The Science of Life and Death in Mary Shelley's Frankenstein

https://publicdomainreview.org/essay/the-science-of-life-and-death-in-mary-shelleys-frankenstein/
16•Anon84•5d ago•1 comments

Claude's new constitution

https://www.anthropic.com/news/claude-new-constitution
535•meetpateltech•1d ago•626 comments

Meet the Alaska Student Arrested for Eating an AI Art Exhibit

https://www.thenation.com/article/society/alaska-student-arrested-eating-ai-art-exhibit/
79•petethomas•3h ago•44 comments

Waiting for dawn in search: Search index, Google rulings and impact on Kagi

https://blog.kagi.com/waiting-dawn-search
413•josephwegner•1d ago•230 comments

Gathering Linux Syscall Numbers in a C Table

https://t-cadet.github.io/programming-wisdom/#2026-01-17-gathering-linux-syscall-numbers
82•phi-system•5d ago•36 comments

Skip is now free and open source

https://skip.dev/blog/skip-is-free/
477•dayanruben•1d ago•216 comments