frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I used Claude Code to discover connections between 100 books

https://trails.pieterma.es/
92•pmaze•5h ago•19 comments

Open Chaos: A self-evolving open-source project

https://www.openchaos.dev/
261•stefanvdw1•6h ago•47 comments

Worst of Breed Software

https://worstofbreed.net/
48•facundo_olano•1h ago•12 comments

Finding and Fixing Ghostty's Largest Memory Leak

https://mitchellh.com/writing/ghostty-memory-leak-fix
85•thorel•3h ago•22 comments

AI is a business model stress test

https://dri.es/ai-is-a-business-model-stress-test
106•amarsahinovic•5h ago•139 comments

A Eulogy for Dark Sky, a Data Visualization Masterpiece (2023)

https://nightingaledvs.com/dark-sky-weather-data-viz/
327•skadamat•9h ago•146 comments

Rats caught on camera hunting flying bats

https://scienceclock.com/rats-caught-on-camera-hunting-flying-bats-for-the-first-time/
51•akg130522•3h ago•6 comments

ASCII-Driven Development

https://medium.com/@calufa/ascii-driven-development-850f66661351
59•_hfqa•2d ago•35 comments

I replaced Windows with Linux and everything's going great

https://www.theverge.com/tech/858910/linux-diary-gaming-desktop
431•rorylawless•6h ago•361 comments

Side-by-side comparison of how AI models answer moral dilemmas

https://civai.org/p/ai-values
48•jesenator•2d ago•33 comments

New information extracted from Snowden PDFs through metadata version analysis

https://libroot.org/posts/going-through-snowden-documents-part-4/
251•libroot•10h ago•114 comments

ChatGPT Health is a marketplace, guess who is the product?

https://consciousdigital.org/chatgpt-health-is-a-marketplace-guess-who-is-the-product/
185•yoaviram•2d ago•196 comments

Bichon: A lightweight, high-performance Rust email archiver with WebUI

https://github.com/rustmailer/bichon
41•rendx•2h ago•16 comments

Extracting books from production language models (2026)

https://arxiv.org/abs/2601.02671
6•logicprog•1h ago•0 comments

Bindless Oriented Graphics Programming

https://alextardif.com/BindlessProgramming.html
24•ibobev•3d ago•2 comments

UpCodes (YC S17) is hiring PMs, SWEs to automate construction compliance

https://up.codes/careers?utm_source=HN
1•Old_Thrashbarg•5h ago

Org Mode Syntax Is One of the Most Reasonable Markup Languages to Use for Text

https://karl-voit.at/2017/09/23/orgmode-as-markup-only/
210•adityaathalye•13h ago•159 comments

The 8 ways that all the elements in the Universe are made

https://bigthink.com/starts-with-a-bang/8-ways-elements-made/
7•zdw•5d ago•0 comments

How your high school affects your chances of UC Admission

https://sfeducation.substack.com/p/how-your-high-school-affects-your
37•mutator•2d ago•80 comments

Distributed Denial of Secrets

https://ddosecrets.com/
41•sabakhoj•2d ago•9 comments

Drones that recharge directly on transmission lines

https://www.ycombinator.com/companies/voltair
134•alphabetatango•5h ago•101 comments

NASA announces unprecedented return of sick ISS astronaut and crew

https://www.livescience.com/space/space-exploration/nasa-cancels-spacewalk-and-considers-early-cr...
68•bookofjoe•8h ago•70 comments

UK Orders Ofcom to Explore Encryption Backdoors

https://reclaimthenet.org/uk-orders-ofcom-to-explore-encryption-backdoors
14•worldofmatthew•46m ago•1 comments

“Erdos problem #728 was solved more or less autonomously by AI”

https://mathstodon.xyz/@tao/115855840223258103
590•cod1r•23h ago•331 comments

Tesla's Germany Sales Down 72% from Their Peak

https://cleantechnica.com/2026/01/08/teslas-germany-sales-down-72-from-their-peak/
17•01-_-•48m ago•1 comments

UK government exempting itself from cyber law inspires little confidence

https://www.theregister.com/2026/01/10/csr_bill_analysis/
272•DyslexicAtheist•8h ago•55 comments

Httpz – Zero-Allocation HTTP/1.1 Parser for OxCaml

https://github.com/avsm/httpz
67•noelwelsh•3d ago•17 comments

GPU memory snapshots: sub-second startup (2025)

https://modal.com/blog/gpu-mem-snapshots
16•jxmorris12•2d ago•5 comments

Allow me to introduce, the Citroen C15

https://eupolicy.social/@jmaris/115860595238097654
651•colinprince•11h ago•441 comments

Sinclair C5

https://en.wikipedia.org/wiki/Sinclair_C5
23•jszymborski•1h ago•9 comments
Open in hackernews

Bridging the gap between keyword and semantic search with SPLADE (2024)

http://arcturus-labs.com/blog/2024/10/09/bridging-the-gap-between-keyword-and-semantic-search-with-splade/
23•softwaredoug•8mo ago

Comments

jbellis•8mo ago
I'm kind of disappointed in this article, Splade is a cool way to improve results of a TF/IDF index with minimally invasive changes and this obscures that more than it clarifies.

> Next, my SPLADE implementation in Elasticsearch is oversimplified. If you scroll back up to get_splade_embedding, we extract non-zero elements from vec_np (the SPLADE tokens) but discard their associated weights. This is a missed opportunity. The SPLADE papers use these weights for scoring matches.

Yes, exactly, that is the whole point of Splade.

Probably easier to demonstrate if you drop down a level to Lucene, I don't think you will be able to do it easily with Elastic.

Tangentially, I haven't looked closely at SPLATE which tries to marry Splade and ColBERT, but it's an interesting idea. https://arxiv.org/html/2404.13950v1

JnBrymn•8mo ago
You're absolutely right. This was a post I tossed together quickly just to see what could be done without thinking too much. In retrospect, I think this would be better implemented using Elasticsearch sparse vector fields which allow you to specify the value of every token. Maybe I'l make an update post to try again.