Speeding up PostgreSQL dump/restore snapshots

https://xata.io/blog/behind-the-scenes-speeding-up-pgstream-snapshots-for-postgresql

60•tudorg•5h ago

Comments

hadlock•2h ago

One thing that's sorely needed in the official documentation is a "best practice" for backup/restore from "cold and dark" where you lose your main db in a fire and are now restoring from offsite backups for business continuity. Particularly in the 100-2TB range where probably most businesses lie, and backup/restore can take anywhere from 6 to 72 hours, often in less than ideal conditions. Like many things with SQL there's many ways to do it, but an official roadmap for order of operations would be very useful for backup/restore of roles/permissions, schema etc. You will figure it out eventually, but in my experience the dev and prod db size delta is so large many things that "just work" in the sub-1gb scale really trip you up over 200-500gb. Finding out you did one step out of order (manually, or badly written script) halfway through the restore process can mean hours and hours of rework. Heaven help you if you didn't start a screen session on your EC2 instance when you logged in.

forinti•1h ago

If you can have a secondary database (at another site or on the cloud) being updated with streaming replication, you can switch over very quickly and with little fuss.

SoftTalker•1h ago

Which is what you must do if minimizing downtime is critical.

And, of course, your disaster recovery plan is incomplete until you've tested it (at scale). You don't want to be looking up Postgres documentation when you need to restore from a cold backup, you want to be following the checklist you have in your recovery plan and already verified.

nijave•1h ago

Ideally off-site replica you fail over too and don't need to restore.

pg_restore will handle roles, indexes, etc assuming you didn't switch the flags around to disable them

If you're on EC2, hopefully you're using disk snapshots and WAL archiving.

moribunda•1h ago

While these optimizations are solid improvements, I was hoping to see more advanced techniques beyond the standard bulk insert and deferred constraint patterns. These are well-established PostgreSQL best practices - would love to see how pgstream handles more complex scenarios like parallel workers with partition-aware loading, or custom compression strategies for specific data types.

bitbasher•1h ago

pg_bulkload[1] has saved me so much time cold restoring large (1+ TB) databases. It went from 24-72 hours to an hour or two.

I also recommend pg_repack[2] to squash tables on a live system and reclaim disk space. It has saved me so much space.

1: https://ossc-db.github.io/pg_bulkload/pg_bulkload.html

2: https://github.com/reorg/pg_repack

jpalawaga•1h ago

Postgres backups are tricky for sure. Even if you have a DR plan you should assume your incremental backups are no good and you need to restore the whole thing from scratch. That’s your real DR SLA.

If things go truly south, just hope you have a read replica you can use as your new master. Most SLAs are not written with 72h+ of downtime. Have you tried the nuclear recovery plan, from scratch? Does it work?

A New Father's Reflections on the American Dream

The 'Space for All' T-Shirt

Ask HN: Do you regret not being a Dr, lawyer, or something with advanced degree?

Show HN: Certimon – Free Telegram bot to monitor SSL certificate expiry

A universal interface connecting you to premier AI models

Show HN: Recent incident inspired-community log of frauds/scams by individuals

Synthetic proteins are being built with the help of AI models

Show HN: LogDog – Remote Debugging and Mocking for Mobile Apps

What a $500k grants looks like (2022)

Elon Musk Forms a New Political Party to Challenge Trump and the Republicans

Vehicles to be freed from car park after two years

Noam Chomsky on ChatGPT, AI, Universal Grammar, Language and Mind (2023)

Show HN: Created a astrojs website for my dungeon crawler game – just went live

Automatically Evaluating AI Coding Assistants with Each Git Commit

WIP Silent Hill decompilation project

Coding with AI agents using the Breadcrumb Protocol

Engineer caught juggling multiple startup jobs is a cautionary tale

OpenMW 0.49.0 released (open-source Morrowind reimplementation)

How to crack FAANG coding interviews

Artificial Intelligence in Miniature Format for Small Devices

The Self That Never Was

Techno-Feudalism and the Rise of AGI: A Future Without Economic Rights?

How to Crack ML System Design Interviews

WinUAE 6.0.0 Amiga Emulator

Show HN: Web Metadata search. Search for headers, web apps, CMSs, and versions

A100 is a puzzle game inspired by the mobile game „1010 " by Gram Games

How Tether became money-launderers' dream currency

Inertial forces (indirect terms) in problems with a central body

Video captured during Alberta storm could be rare ball lightning event

How to Network as an Introvert