frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Two Bits Are Better Than One: making bloom filters 2x more accurate

https://floedb.ai/blog/two-bits-are-better-than-one-making-bloom-filters-2x-more-accurate
42•matheusalmeida•4d ago

Comments

pkoird•1h ago
Clever. My first impression was that surely this saturates the filter too fast as we're setting more bits at once but looks like the maths checks out. It's one of those non-intuitive things that I am glad I learned today.
lemagedurage•1h ago
True, I had the same feeling. The article does go off 256K elements in a bloom filter of 2M. After 1M elements, using 2 bits actually increases false positive rate, but at that point the false positive rate is higher than 50% already.
vlmutolo•59m ago
This article is a little confusing. I think this is a roundabout way to invent the blocked bloom filter with k=2 bits inserted per element.

It seems like the authors wanted to use a single hash for performance (?). Maybe they correctly determined that naive Bloom filters have poor cache locality and reinvented block bloom filters from there.

Overall, I think block bloom filters should be the default most people reach for. They completely solve the cache locality issues (single cache miss per element lookup), and they sacrifice only like 10–15% space increase to do it. I had a simple implementation running at something like 20ns per query with maybe k=9. It would be about 9x that for native Bloom filters.

There’s some discussion in the article about using a single hash to come up with various indexing locations, but it’s simpler to just think of block bloom filters as:

1. Hash-0 gets you the block index

2. Hash-1 through hash-k get you the bits inside the block

If your implementation slices up a single hash to divide it into multiple smaller hashes, that’s fine.

sakras•17m ago
Yeah I kind of think authors didn't conduct a thorough-enough literature review here. There are well-known relations between number of hash functions you use and the FPR, cache-blocking and register-blocking are classic techniques (Cache-, Hash-, and Space-Efficient Bloom Filters by Putze et. al), and there are even ways of generating patterns from only a single hash function that works well (shamelessly shilling my own blogpost on the topic: https://save-buffer.github.io/bloom_filter.html)

I also find the use of atomics to build the filter confusing here. If you're doing a join, you're presumably doing a batch of hashes, so it'd be much more efficient to partition your Bloom filter, lock the partitions, and do a bulk insertion.

h33t-l4x0r•23m ago
Hmm, Bloom filters seem important. I'm wondering why my CS education never even touched on them and it's tbh triggering my imposter syndrome.

How I use Claude Code: Separation of planning and execution

https://boristane.com/blog/how-i-use-claude-code/
273•vinhnx•5h ago•159 comments

Japanese Woodblock Print Search

https://ukiyo-e.org/
37•curmudgeon22•2h ago•8 comments

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

https://github.com/xaskasdf/ntransformer
172•xaskasdf•8h ago•39 comments

A Botnet Accidentally Destroyed I2P

https://www.sambent.com/a-botnet-accidentally-destroyed-i2p-the-full-story/
60•Cider9986•4h ago•27 comments

Two Bits Are Better Than One: making bloom filters 2x more accurate

https://floedb.ai/blog/two-bits-are-better-than-one-making-bloom-filters-2x-more-accurate
43•matheusalmeida•4d ago•5 comments

“Playmakers,” reviewed: The race to give every child a toy

https://www.newyorker.com/magazine/2026/02/16/playmakers-the-jewish-entrepreneurs-who-created-the...
7•fortran77•1d ago•0 comments

How far back in time can you understand English?

https://www.deadlanguagesociety.com/p/how-far-back-in-time-understand-english
448•spzb•3d ago•248 comments

Evidence of the bouba-kiki effect in naïve baby chicks

https://www.science.org/doi/10.1126/science.adq7188
103•suddenlybananas•7h ago•29 comments

Scientists discover recent tectonic activity on the moon

https://phys.org/news/2026-02-scientists-tectonic-moon.html
33•bookmtn•4d ago•2 comments

Parse, Don't Validate and Type-Driven Design in Rust

https://www.harudagondi.space/blog/parse-dont-validate-and-type-driven-design-in-rust/
156•todsacerdoti•9h ago•39 comments

Gamedate – A site to revive dead multiplayer games

https://gamedate.org/
26•msuniverse2026•1d ago•3 comments

zclaw: personal AI assistant in under 888 KB, running on an ESP32

https://github.com/tnm/zclaw
136•tosh•17h ago•77 comments

How Taalas "prints" LLM onto a chip?

https://www.anuragk.com/blog/posts/Taalas.html
8•beAroundHere•10h ago•0 comments

Forward propagation of errors through time

https://nicolaszucchet.github.io/Forward-propagation-errors-through-time/
20•iNic•2d ago•0 comments

CXMT has been offering DDR4 chips at about half the prevailing market rate

https://www.koreaherald.com/article/10679206
181•phront•15h ago•163 comments

Claws are now a new layer on top of LLM agents

https://twitter.com/karpathy/status/2024987174077432126
242•Cyphase•1d ago•686 comments

Toyota Mirai hydrogen car depreciation: 65% value loss in a year

https://carbuzz.com/toyota-mirai-massive-depreciation-one-year/
126•iancmceachern•11h ago•289 comments

Carelessness versus Craftsmanship in Cryptography

https://blog.trailofbits.com/2026/02/18/carelessness-versus-craftsmanship-in-cryptography/
3•ingve•3d ago•0 comments

The Human Root of Trust – public domain framework for agent accountability

https://humanrootoftrust.org/
5•3du4rd0v3g4•15h ago•1 comments

EDuke32 – Duke Nukem 3D (Open-Source)

https://www.eduke32.com/
167•reconnecting•9h ago•61 comments

Canvas_ity: A tiny, single-header <canvas>-like 2D rasterizer for C++

https://github.com/a-e-k/canvas_ity
75•PaulHoule•10h ago•25 comments

I verified my LinkedIn identity. Here's what I handed over

https://thelocalstack.eu/posts/linkedin-identity-verification-privacy/
1213•ColinWright•22h ago•428 comments

Be wary of Bluesky

https://kevinak.se/blog/be-wary-of-bluesky
282•kevinak•1d ago•184 comments

Inputlag.science – Repository of knowledge about input lag in gaming

https://inputlag.science
76•akyuu•9h ago•13 comments

Finding forall-exists Hyperbugs using Symbolic Execution

https://dl.acm.org/doi/full/10.1145/3689761
27•todsacerdoti•5d ago•1 comments

What not to write on your security clearance form (1988)

https://milk.com/wall-o-shame/security_clearance.html
418•wizardforhire•12h ago•186 comments

Permacomputing

https://wiki.xxiivv.com/site/permacomputing.html
124•tosh•4d ago•30 comments

A16z partner says that the theory that we’ll vibe code everything is wrong

https://www.aol.com/articles/a16z-partner-says-theory-well-050150534.html
107•paulpauper•1d ago•168 comments

Uncovering insiders and alpha on Polymarket with AI

https://twitter.com/peterjliu/status/2024901585806225723
140•somerandomness•1d ago•131 comments

Keep Android Open

https://f-droid.org/2026/02/20/twif.html
2026•LorenDB•1d ago•694 comments