frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Incremental Backups of Gmail Takeouts

https://baecher.dev/stdout/incremental-backups-of-gmail-takeouts/
36•pbhn•4d ago

Comments

pbhn•4d ago
Gmail takeouts come in an arbitrarily-ordered mbox file; I wanted something a bit more backup friendly so I created a small tool for that purpose and wrote about it.
venusenvy47•47m ago
I've been meaning to try this tool for backing up. I'm curious if anyone else has tried this.

https://github.com/rustmailer/bichon

SanjayMehta•2h ago
Serious question: have you ever needed an email from even 5 years ago?

I only save financial statements and contact information. Everything else gets deleted as soon as possible.

raybb•2h ago
I've enjoyed digging up an old flight itinerary to see how much I paid back in 2015 or just looking at the messages a company replied in support and realizing I'm not buying from them again because they didn't fix the problem.
mantra2•54m ago
Looking up how much things used to cost? I too like being depressed.
xnx•2h ago
> Everything else gets deleted as soon as possible.

What's the advantage to deleting? It's easy to ignore anything old and disk space is cheap. Do you delete old photos?

stephenhuey•1h ago
How strictly do you define need? I've been living as an adult long enough that there have been countless times I've searched for photos and emails from one or two decades ago. I distinctly remember the first time I met an Inbox Zero person. It was so important to her to militantly delete everything she had dealt with, and to me, the disadvantages from that practice far outweigh the advantages.
jawns•1h ago
Inbox Zero just means to deal with messages as they come in, then move them out of the inbox, generally to an archive section.

If she was hard-deleting everything, she wasn't just Inbox Zero, she was F---s Zero, too.

mantra2•55m ago
I have, but very rarely. I could count on one hand how often I’ve needed to dig back more than half a decade ago.

Back when I used Gmail I just kept everything personal and work related but when I moved away and started paying for email storage I took a different approach. It didn’t make sense for me to pay considerably more storage for something I almost never use.

I ended up backing up all of my emails outside of the last 5 years and stored them on an offline drive where I can reference them as eml files if I ever need it.

Going forward once a year I’ll export and purge the oldest year in my account.

omoikane•43m ago
I backed up lots of emails that I deemed precious, but I still search through email first, because sometimes it's just easier to search email than to search my backups.

Also, oftentimes I search email not so much for the content, but to find the timestamp associated with a particular event. I have had to search old email metadata a few times when I get an unexpected question related to time (for example, gmail will ask when you created the account as part of its account recovery process).

viraptor•32m ago
Yes, looking for old documents proving things for the government.
Brajeshwar•27m ago
All the time. I read an interesting thing about someone online, and that name strikes me as someone I have interacted with. I search my email archive, then reply to that thread or start a new one to catch up. All of them have been super happy, “wow! You replied to our email from 10 years ago!”

I do have “Clean Inbox”[1] because I don’t see or interact with them, but I keep them. The only emails I see are the actionable “Unread OR Flagged.”

1. https://brajeshwar.com/2024/email/

cosmic_cheese•16m ago
Maybe not “need” in the strictest sense, but there have been more times than I can count where digging up old mail has either made things much faster and easier or helped me answer a random question that popped into my head about something that happened ages ago.

Old SMS, iMessage, Telegram etc messages have been useful from time to time too for similar reasons.

Both can also serve as exceptional time capsules that provide windows into past “eras” of life. I occasionally kick myself for not having archived mail and messages from a couple of defunct email addresses and chat apps… without them there’s a hole spanning a few years where visibility is limited.

yooogurt•1h ago
> if you want to back this file up regularly with something like restic, then you will quickly end up in a world of pain: since new mails are not even appended to the end of the file, each cycle of takeout-then-backup essentially produces a new giant file.

As I'm sure the author is aware, Restic will do hash-based chunking so that similar files can be efficiently be backed up.

How similar are two successive Takeout mboxes?

If the order of messages within an mbox is stable, and new emails are inserted somewhere, the delta update might be tiny.

Even if the order of the mbox's messages are ~random, Restic's delta updates will forego large attachments.

It would be great to see empirical figures here: how large is the incremental backup after after a month's emails. How does that compare for each backup strategy?

The pro of sticking with restic is simplicity, and also avoiding the risk of your tool managing to screw up the data.

This risk isn't so bad if it's a mature tool that canonicalises mboxes (e.g. order them by time), but seems risky for something handrolled.

Brajeshwar•39m ago
For emails, here is my current simple backup setup. Of course, I’m also looking to do this without having to open Thunderbird, or I might have an old laptop running it. So, work-in-progress.

For the email accounts I want a backup, I set it to spew out POP3 without doing anything (don’t mark read or delete). I set up Thunderbird with that POP3. It has a backup copy of all the emails. I’ve had searchable emails since like 2004/2005, and I’ve occasionally replied to people and gotten back in touch with very old friends from the Internet.

I saw an open-source tool sometime back (I think, here on Hacker News) that backs up your IMAP mails with a nicely done interface. That would be nice to have.

Edit: Perhaps Bichon,[1] mentioned somewhere in the other comment threads[2] was the one.

1. https://github.com/rustmailer/bichon

2. https://news.ycombinator.com/item?id=46429250

tehlike•34m ago
Wouldn't it be nice if Google just dumped the takeout into a sqlite file?
pabs3•14m ago
Why doesn't Google use zip/tar of a Maildir instead? Much better format than mbox. Converting the mbox to Maildir using standard tools would work too.

Google is dead. Where do we go now?

https://www.circusscientist.com/2025/12/29/google-is-dead-where-do-we-go-now/
586•tomjuggler•7h ago•540 comments

Hacking Washing Machines [video]

https://media.ccc.de/v/39c3-hacking-washing-machines
41•clausecker•2h ago•10 comments

MongoDB Server Security Update, December 2025

https://www.mongodb.com/company/blog/news/mongodb-server-security-update-december-2025
47•plorkyeran•3h ago•15 comments

Show HN: Stop Claude Code from forgetting everything

https://github.com/mutable-state-inc/ensue-skill
102•austinbaggio•5h ago•143 comments

Show HN: A Claude Code plugin that catch destructive Git and filesystem commands

https://github.com/kenryu42/claude-code-safety-net
18•kenryu•4d ago•15 comments

Outside, Dungeon, Town: Integrating the Three Places in Videogames (2024)

https://keithburgun.net/outside-dungeon-town-integrating-the-three-places-in-videogames/
41•vector_spaces•3h ago•16 comments

Parsing Advances

https://matklad.github.io/2025/12/28/parsing-advances.html
55•birdculture•4h ago•4 comments

Geology of the Gulf of the Farallones National Marine Sanctuary

https://pubs.usgs.gov/fs/farallones/
38•greesil•5h ago•12 comments

When someone says they hate your product

https://www.getflack.com/p/responding-to-negative-feedback
116•jger15•8h ago•85 comments

Incremental Backups of Gmail Takeouts

https://baecher.dev/stdout/incremental-backups-of-gmail-takeouts/
36•pbhn•4d ago•17 comments

AI is forcing us to write good code

https://bits.logic.inc/p/ai-is-forcing-us-to-write-good-code
101•sgk284•9h ago•86 comments

Static Allocation with Zig

https://nickmonad.blog/2025/static-allocation-with-zig-kv/
171•todsacerdoti•12h ago•85 comments

100x (YC S22) Is Hiring a Front End Engineer

1•shardullavekar•3h ago

I migrated to an almost all-EU stack and saved 500€ per year

https://www.zeitgeistofbytes.com/p/bye-bye-big-tech-how-i-migrated-to
80•alexcos•4h ago•45 comments

Flame Graphs vs Tree Maps vs Sunburst (2017)

https://www.brendangregg.com/blog/2017-02-06/flamegraphs-vs-treemaps-vs-sunburst.html
111•gudzpoz•2d ago•29 comments

Vitest Browser Mode Guide

https://howtotestfrontend.com/resources/vitest-browser-mode-guide-and-setup-info
35•howToTestFE•5d ago•4 comments

Kidnapped by Deutsche Bahn

https://www.theocharis.dev/blog/kidnapped-by-deutsche-bahn/
971•JeremyTheo•15h ago•863 comments

The AI Noise

https://rishi.monster/posts/time-intelligence-economy-part-1-the-ai-noise/
8•wawhal•3d ago•2 comments

Show HN: A 45x45 Connections Puzzle To Commemorate 2025=45*45

https://thomaswc.com/2025.html
31•thomaswc•6d ago•6 comments

A production bug that made me care about undefined behavior

https://gaultier.github.io/blog/the_production_bug_that_made_me_care_about_undefined_behavior.html
120•birdculture•10h ago•72 comments

List of domains censored by German ISPs

https://cuiiliste.de/domains
320•elcapitan•9h ago•135 comments

Stanford Lecture: Dr. Don Knuth – Adventures with Knight's Tours [video]

https://www.youtube.com/watch?v=MKiRte-tnMY
38•vismit2000•5d ago•3 comments

Linux DAW: Help Linux musicians to quickly and easily find the tools they need

https://linuxdaw.org/
207•prmoustache•15h ago•98 comments

All Delisted Steam Games

https://delistedgames.com/all-delisted-steam-games/
214•Bondi_Blue•9h ago•95 comments

Karpathy on Programming: “I've never felt this much behind”

https://twitter.com/karpathy/status/2004607146781278521
361•rishabhaiover•3d ago•398 comments

Show HN: Superset – Terminal to run 10 parallel coding agents

https://superset.sh/
74•avipeltz•6d ago•72 comments

Libgodc: Write Go Programs for Sega Dreamcast

https://github.com/drpaneas/libgodc
208•drpaneas•14h ago•47 comments

Intelligence – A Mystery Investigation Game

https://intelligencegame.tech/
26•throw_a_grenade•4d ago•2 comments

What an unprocessed photo looks like

https://maurycyz.com/misc/raw_photo/
2327•zdw•1d ago•384 comments

Which Humans? (2023)

https://osf.io/preprints/psyarxiv/5b26t_v1
37•surprisetalk•8h ago•19 comments