frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.

Cowork: Claude Code for the rest of your work

https://claude.com/blog/cowork-research-preview
403•adocomplete•3h ago•219 comments

TimeCapsuleLLM: LLM trained only on data from 1800-1875

https://github.com/haykgrigo3/TimeCapsuleLLM
413•admp•6h ago•180 comments

Fabrice Bellard's TS Zip (2024)

https://www.bellard.org/ts_zip/
60•everlier•2h ago•19 comments

The chess bot on Delta Air Lines will destroy you (2024) [video]

https://www.youtube.com/watch?v=c0mLhHDcY3I
99•cjaackie•2h ago•35 comments

Postal Arbitrage

https://walzr.com/postal-arbitrage
198•The28thDuck•4h ago•105 comments

F2 (YC S25) Is Hiring

https://www.ycombinator.com/companies/f2/jobs/cJsc7Fe-product-designer
1•arctech•7m ago

Unauthenticated remote code execution in OpenCode

https://cy.md/opencode-rce/
169•CyberShadow•1d ago•39 comments

I Taught Myself to Code on a Cracked Android Phone. Now I Can't Get Hired

https://www.rly0nheart.com/posts/life/i-taught-myself-to-code-on-a-cracked-android-phone-now-i-ca...
6•boyter•16m ago•1 comments

Date is out, Temporal is in

https://piccalil.li/blog/date-is-out-and-temporal-is-in/
264•alexanderameye•7h ago•86 comments

LLVM: The bad parts

https://www.npopov.com/2026/01/11/LLVM-The-bad-parts.html
252•vitaut•8h ago•50 comments

Show HN: AI in SolidWorks

https://www.trylad.com
97•WillNickols•5h ago•44 comments

Floppy disks turn out to be the greatest TV remote for kids

https://blog.smartere.dk/2026/01/floppy-disks-the-best-tv-remote-for-kids/
452•mchro•9h ago•263 comments

Perlsecret – Perl secret operators and constants

https://metacpan.org/dist/perlsecret/view/lib/perlsecret.pod
42•mjs•5d ago•7 comments

What old tennis players teach us (2017)

https://www.raphkoster.com/2017/09/22/31098/
21•surprisetalk•4d ago•9 comments

Update on age requirements for apps distributed in Texas

https://developer.apple.com/news/?id=8jzbigf4
21•Austin_Conlon•2h ago•19 comments

Show HN: Agent-of-empires: OpenCode and Claude Code session manager

https://github.com/njbrake/agent-of-empires
32•river_otter•8h ago•2 comments

Message Queues: A Simple Guide with Analogies (2024)

https://www.cloudamqp.com/blog/message-queues-exaplined-with-analogies.html
64•byt3h3ad•5h ago•18 comments

Apple picks Google's Gemini to power Siri

https://www.cnbc.com/2026/01/12/apple-google-ai-siri-gemini.html
543•stygiansonic•7h ago•323 comments

Show HN: Fall asleep by watching JavaScript load

https://github.com/sarusso/bedtime
36•sarusso•4h ago•14 comments

GitHub: A case study in link maintenance and 404 pages (2013)

https://chrismorgan.info/blog/github-links-case-study/
4•roryokane•5d ago•0 comments

Zen-C: Write like a high-level language, run like C

https://github.com/z-libs/Zen-C
145•simonpure•9h ago•86 comments

Building a 25 Gbit/s workstation for the SCION Association

https://github.com/scionassociation/blog-25gbit-workstation
59•romshark•6h ago•21 comments

Anthropic made a mistake in cutting off third-party clients

https://archaeologist.dev/artifacts/anthropic
182•codesparkle•11h ago•153 comments

Show HN: Yolobox – Run AI coding agents with full sudo without nuking home dir

https://github.com/finbarr/yolobox
42•Finbarr•4h ago•33 comments

Ansible battle tested hardening for Linux, SSH, Nginx, MySQL

https://github.com/dev-sec/ansible-collection-hardening
32•walterbell•5d ago•5 comments

Ai, Japanese chimpanzee who counted and painted dies at 49

https://www.bbc.com/news/articles/cj9r3zl2ywyo
161•reconnecting•13h ago•54 comments

Reproducing DeepSeek's MHC: When Residual Connections Explode

https://taylorkolasinski.com/notes/mhc-reproduction/
91•taykolasinski•8h ago•27 comments

Launch a Debugging Terminal into GitHub Actions

https://blog.gripdev.xyz/2026/01/10/actions-terminal-on-failure-for-debugging/
124•martinpeck•10h ago•51 comments

Personal thoughts/notes from working on Zootopia 2

https://blog.yiningkarlli.com/2025/12/zootopia-2.html
267•pantalaimon•5d ago•51 comments

JRR Tolkien reads from The Hobbit for 30 Minutes (1952)

https://www.openculture.com/2026/01/j-r-r-tolkien-reads-from-the-hobbit-for-30-minutes-1952.html
305•bookofjoe•5d ago•118 comments