A distributed queue in a single JSON file on object storage

https://turbopuffer.com/blog/object-storage-queue

25•Sirupsen•3d ago

Comments

soletta•1h ago

The usual path an engineer takes is to take a complex and slow system and reengineer it into something simple, fast, and wrong. But as far as I can tell from the description in the blog though, it actually works at scale! This feels like a free lunch and I’m wondering what the tradeoff is.

jrjeksjd8d•1h ago

It seems like this is an approach that trades off scale and performance for operational simplicity. They say they only have 1GB of records and they can use a single committer to handle all requests. Failover happens by missing a compare-and-set so there's probably a second of latency to become leader?

This is not to say it's a bad system, but it's very precisely tailored for their needs. If you look at the original Kafka implementation, for instance, it was also very simple and targeted. As you bolt on more use cases and features you lose the simplicity to try and become all things to all people.

formerly_proven•1h ago

Write amplification >9000 mostly

jamescun•1h ago

This post touches on a realisation I made a while ago, just how far you can get with the guarantees and trade-offs of object storage.

What actually _needs_ to be in the database? I've never gone as far as building a job queue on top of object storage, but have been involved in building surprisingly consistent and reliable systems with object storage.

dewey•1h ago

Depending on who hosts your object storage this seems like it could get much more expensive than using a queue table in your database? But I'm also aware that this is a blog post of an object storage company.

Normal_gaussian•55m ago

The original graph appears to simply show the blocking issue of their previous synchronisation mechanism; 10 min to process an item down to 6 min. Any central system would seem to resolve this for them.

In any organisation its good to make choices for simplicity rather than small optimisations - you're optimising maintenance, incident resolution, and development.

Typically I have a small pg server for these things. It'll work out slightly more expensive than this setup for one action, yet it will cope with so much more - extending to all kinds of other queues and config management - with simple management, off the shelf diagnostics etc.

While the object store is neat, there is a confluence of factors which make it great and simple for this workload, that may not extend to others. 200ms latency is a lot for other workloads, 5GB/s doesn't leave a lot of headroom, etc. And I don't want to be asked to diagnose transient issues with this.

So I'm torn. It's simple to deploy and configure from a fresh deployment PoV. Yet it wouldn't be accepted into any deployment I have worked on.

pjc50•49m ago

Several things going on here:

- concurrency is very hard

- .. but object storage "solves" most of that for you, handing you a set of semantics which work reliably

- single file throughput sucks hilariously badly

- .. because 1Gb is ridiculously large for an atomic unit

- (this whole thing resembles a project I did a decade ago for transactional consistency on TFAT on Flash, except that somehow managed faster commit times despite running on a 400Mhz MIPS CPU. Edit: maybe I should try to remember how that worked and write it up for HN)

- therefore, all of the actual work is shifted to the broker. The broker is just periodically committing its state in case it crashes

- it's not clear whether the broker ACKs requests before they're in durable storage? Is it possible to lose requests in flight anyway?

- there's a great design for a message queue system between multiple nodes that aims for at least once delivery, and has existed for decades, while maintaining high throughput: SMTP. Actually, there's a whole bunch of message queue systems?

isoprophlex•37m ago

Is this reinventing a few redis features with an object storage for persistence?

dewey•30m ago

Assuming you already using object storage in your project, but don't use Redis yet it wouldn't be re-inventing but just avoiding an extra dependency that would only be used by a single feature.

jstrong•3m ago

that's A choice.

Diode – Build, program, and simulate hardware

Terence Tao, at 8 years old (1984) [pdf]

ΛProlog: Logic programming in higher-order logic

Show HN: enveil – hide your .env secrets from prAIng eyes

A distributed queue in a single JSON file on object storage

I Ported Coreboot to the ThinkPad X270

Firefox 148 Launches with AI Kill Switch Feature and More Enhancements

Show HN: X86CSS – An x86 CPU emulator written in CSS

Blood test boosts Alzheimer's diagnosis accuracy to 94.5%, clinical study shows

The Age Verification Trap: Verifying age undermines everyone's data protection

Show HN: Steerling-8B, a language model that can explain any token it generates

The Missing Semester of Your CS Education – Revised for 2026

Making Wolfram tech available as a foundation tool for LLM systems

Unsung heroes: Flickr's URLs scheme

UNIX99, a UNIX-like OS for the TI-99/4A (2025)

“Car Wash” test with 53 models

Intel XeSS 3: expanded support for Core Ultra/Core Ultra 2 and Arc A, B series

A simple web we own

Graph Topology and Battle Royale Mechanics

Show HN: PgDog – Scale Postgres without changing the app

Genetic underpinnings of chills from art and music

Ladybird adopts Rust, with help from AI

What it means that Ubuntu is using Rust

Show HN: Cellarium: A Playground for Cellular Automata

Writing code is cheap now

Typed Assembly Language (2000)

Hetzner Prices increase 30-40%

FreeBSD doesn't have Wi-Fi driver for my old MacBook, so AI built one for me

SIM (YC X25) Is Hiring the Best Engineers in San Francisco

The Righteous EV Owners Who Won't Let Their Broken Cars Die

A distributed queue in a single JSON file on object storage

Comments

Diode – Build, program, and simulate hardware

Terence Tao, at 8 years old (1984) [pdf]

ΛProlog: Logic programming in higher-order logic

Show HN: enveil – hide your .env secrets from prAIng eyes

A distributed queue in a single JSON file on object storage

I Ported Coreboot to the ThinkPad X270

Firefox 148 Launches with AI Kill Switch Feature and More Enhancements

Show HN: X86CSS – An x86 CPU emulator written in CSS

Blood test boosts Alzheimer's diagnosis accuracy to 94.5%, clinical study shows

The Age Verification Trap: Verifying age undermines everyone's data protection

Show HN: Steerling-8B, a language model that can explain any token it generates

The Missing Semester of Your CS Education – Revised for 2026

Making Wolfram tech available as a foundation tool for LLM systems

Unsung heroes: Flickr's URLs scheme

UNIX99, a UNIX-like OS for the TI-99/4A (2025)

“Car Wash” test with 53 models

Intel XeSS 3: expanded support for Core Ultra/Core Ultra 2 and Arc A, B series

A simple web we own

Graph Topology and Battle Royale Mechanics

Show HN: PgDog – Scale Postgres without changing the app

Genetic underpinnings of chills from art and music

Ladybird adopts Rust, with help from AI

What it means that Ubuntu is using Rust

Show HN: Cellarium: A Playground for Cellular Automata

Writing code is cheap now

Typed Assembly Language (2000)

Hetzner Prices increase 30-40%

FreeBSD doesn't have Wi-Fi driver for my old MacBook, so AI built one for me

SIM (YC X25) Is Hiring the Best Engineers in San Francisco

The Righteous EV Owners Who Won't Let Their Broken Cars Die