How Frogger 2’s source code was recovered from a destroyed tape [video]

https://www.youtube.com/watch?v=lvEO4IaEJlw

208•perching_aix•7mo ago

Comments

NaOH•7mo ago

The long road to recover Frogger 2 source from tape drives - https://news.ycombinator.com/item?id=36061574 - May 2023 (213 comments)

CSMastermind•7mo ago

Absolutely heroic effort. And that data recovery company should go out of business.

chii•7mo ago

It is named and shamed in the comments of that video somewhere.

Data recovery companies ought to have the integrity to just say no to a job, if they cannot do it risk free. Trying and failing with the risk of damaging the original data could be very costly to the customer, even if they don't charge money - the customer's lost data could be priceless.

sllabres•7mo ago

A lot of work, but with success as reward! Makes you wonder how easy or difficult it will be in 30 years to 'recover' data from today.

dehrmann•7mo ago

> Makes you wonder how easy or difficult it will be in 30 years to 'recover' data from today.

The challenges will be different. Flash loses its charge in 30 years, most disks are encrypted, and on-site physical backups are mostly a thing of the past. The source might survive in a cloud repo, but it'll either be tied up for legal reasons or deleted when the customer stops paying the bill. But storage is cheap and getting cheaper!

ljlolel•7mo ago

Easy. The “deleted” even overwritten data can leave ghosts even multiple layers deep (think of a clay tablet or painting with multiple inscriptions)

Encryption for 30 years ago? Trivially breakable with quantum

rjst01•7mo ago

> Encryption for 30 years ago? Trivially breakable with quantum

I wouldn't be so sure - quantum computers aren't nearly as effective for symmetric algorithms as they are for pre-quantum asymmetric algorithms.

b112•7mo ago

Regardless of the parent's statement, just normal compute in 30 years, plus general vulnerabilities and weaknesses discovered, will ensure that anything encrypted today is easily readable in the future.

I can't think of anything from 30 years ago that isn't just a joke today. The same will likely be true by 2050, quantum computing or not. I wonder how many people realise this?

Even if one disagrees with my certainty, I think people should still plan for the concept that there's a strong probability it will be so. Encryption is really not about preventing data exposure, but about delaying it.

Any other view regarding encryption means disappointment.

charcircuit•7mo ago

>normal compute

You are underestimating the exponential possibilities of keys.

>plus general vulnerabilities and weaknesses discovered, will ensure that anything encrypted today is easily readable in the future.

You can't just assume that there is always going to be new vulnerabilities that cause it to be broken. It ignores that people have improved at designing secure cryptography over time.

b112•7mo ago

From a security perspective, I argue ypu must assume precisely that.

An example being, destroying sensitive backup media upon its retirement, regardless of data encryption.

Dylan16807•7mo ago

> I can't think of anything from 30 years ago that isn't just a joke today.

AES is only 3 years shy of 30.

If you used MD5 as a keystream generator I believe that would still be secure and that's 33 years old.

3DES is still pretty secure, isn't it? That's 44 years old.

As for today's data, there's always risk into the future but we've gotten better as making secure algorithms over time and avoiding quantum attacks seems to mostly be a matter of doubling key length. I'd worry more about plain old leaks.

b112•7mo ago

I'll concede your point re: current status of some encryption. However there are loads that were comprised.

How do you tell which will fall, and which will succeed in 30 years?

All this said, I just think proper mental framing helps. Considering the value of encrypted data, in 30 years, if it is broken.

In many cases... who cares. In others, it could be unpleasant.

bluGill•7mo ago

30 years ago we had a good idea. Anything considered good 30 years ago - 3DES- still is. Anything not considered good has turned out not to be. We don't know what the future will hold so it is always possible someone will find a major flaw in AES, but as I write this nobody has indicated they are even close.

Dylan16807•7mo ago

> However there are loads that were comprised.

There are a lot of interactive systems that have attacks on their key exchange or authentication. And there are hashes that have collision attacks.

But compromises that let you figure out a key that's no longer in use have not been common for a while. And even md5 can't be reversed.

I agree with you about being wary, but I think encryption itself can be one of the stronger links in the chain, even going out 30 years.

retrac•7mo ago

> I can't think of anything from 30 years ago that isn't just a joke today

The gold standard 30 years ago was PGP. RSA 1024 or 2048 for key exchange. IDEA symmetric cipher.

This combination is, as far as I am aware, still practically cryptographically secure. Though maybe not in another 10 or 20 years. (RSA 1024 is not that far from brute forcing with classical machines.)

rjst01•7mo ago

I was wondering exactly how hard factoring RSA-1024 would be today and found this stackexchange answer: https://crypto.stackexchange.com/a/111828

In summary, it estimates the cost at $3.5 billion using commodity hardware, and I'd expect a purpose-built system could bring that cost down by an order of magnitude.

NewsaHackO•7mo ago

I would go as far as saying anyone who mentions quantum computers breaking block encryption doesn’t know what they’re talking about.

jetbalsa•7mo ago

Has this been proven for flash storage? Once a flash charge is depleted its gone forever, its not like magnetic storage of old.

mjg59•7mo ago

Shor's algorithm is primarily relevant to asymmetric cryptography, and disk encryption is pretty much universally symmetric. Quantum computers do nothing to break modern disk encryption.

privatelypublic•7mo ago

Not this tripe again.

The reality is, as soon as humanity figures out how to distinguish between two values (magnetic flux, voltage, pits/lands, etc) we use it to store more data, or move it faster.

The end.

ljlolel•7mo ago

Sure but we are talking 30 year old disks so they don’t have that

bayindirh•7mo ago

Don't forget that flash drives are not accessed linearly. Your data might look linear to you, but without that sector addressing table, you're looking at noise.

On top of that static wear leveling can move all your data around when your disk is idle, and TRIM will effectively zero your unused areas when you are not looking.

So, it's a very different landscape.

rietta•7mo ago

Data not continuously copied is lost. Ironically the most future proof media is becoming increasingly rare. Those gold layer dvds may last well into the future but the readers will not be available.

rietta•7mo ago

A major plot of my fictional book is going to be the resurrection of data from a DVD recovered from an archaeological site in from the ancient North American period (from the pov of the characters). It is a significant challenge fraught with perils, including the professor responsible being threatened with failure after failure in the field, budget cuts, and political barriers. But success will be the first glimpse after thousands of years into the unknown dark age that so little is known about.

nullc•7mo ago

Flash seems to lose its charge a lot faster than that, -- I found ordinary SSDs left in a closet for two years to be full of errors while matched sibling drives left in running systems were fine.

Cthulhu_•7mo ago

Hard, but it depends on backup / duplication strategies; this is why e.g. the internet archive is so important, and I hope there are multiple parties doing the same thing for redundancy.

rietta•7mo ago

Hard. I am slowly writing a book set in the future where we are a “digital dark age” where little to nothing is known about our time tentatively called Professor Bitrot.

heavensteeth•7mo ago

I'm not surprised by the data recovery company story, it feels like I only hear bad things about that industry. I remember something similar happened with LinusTechTips.

tomashubelbauer•7mo ago

The source code: https://github.com/HighwayFrogs/frogger2-vss

I enjoyed skimming through this: https://github.com/HighwayFrogs/frogger2-vss/blob/main/teamS...

Jean-Papoulos•7mo ago

Line 427 makes this game NSFW !

javcasas•7mo ago

This is dedicated to the people in the other thread about forbidding swearwords in code.

Take this!

wkat4242•7mo ago

Which file? Now I'm curious :)

Rendello•7mo ago

The second link in the comment, the devlog.

wkat4242•7mo ago

Ahhh ok I was thinking it was in the actual code. Because the devlog wouldn't be part of the game. Thinking too literally :)

secondcoming•7mo ago

Committing ASCII-art of a scantily clad woman is quite the power move.

darepublic•7mo ago

In those days every save was a commit?

chris_wot•7mo ago

Here’s a gist that might be enlightening:

https://github.com/Kneesnap/onstream-data-recovery/blob/main...

Simran-B•7mo ago

Another entertaining piece by the Lego Island guy!

My takeaway is that you should choose a passion project as your hobby and put in the time to learn and do whatever is necessary to achieve your goal on your own or together with similarly motivated people rather than relying on anyone external you have to pay - things go downhill fairly often and quickly it seems. Is any business a scam to some degree nowadays?

dylan604•7mo ago

I once wrote a DAM that wrote to LTO tapes just using tar. The tape was operated with forward/rewind commands from mt. Nobody needed access to the tapes until after I was no longer at the company. Apparently, they spent weeks trying to install various backup software to read the tapes, but none could. They eventually contacted me, but due to how much software they had tried to use the original computer it was attached to was no longer the OS. At that point, they asked 3rd party companies for help and eventually found someone with a drive attached to a Linux system. I was then able to walk them through how to read and extract data.

Tape storage can be an absolute nightmare. Most will do the writes, some will say they verify with a read, but few actually test with a full restore. Just because the software says it can read the tape to show you the listings does not mean it can read the files themselves. This was alluded to in TFA(TFV??) but been there done that on trying to read from a bum tape/bad write. It gets worse if you write in one tape drive and read from another also mentioned in TFV. Now I feel old just thinking about it all

nullc•7mo ago

AFAICT the way most 'recovery' places work is that they'll recover data if there are no issues and any ordinarily skilled IT tech could also recover it. And then otherwise they'll claim it's unrecoverable. Sometimes, it seems, they'll even just claim its unrecoverable because they didn't happen to have or find a compatible drive when that's all that was needed.

I've recovered data from media a number of times a recovery company said it was unrecoverable with no particular difficulty.

xenadu02•7mo ago

This seemed to be a point of confusion in the original story and the video wasn't super clear but:

Pretty much all tape backup software writes headers as it is streaming the file to tape. Just more bytes in the buffer.

For normal restores it consults its local database because that is way faster. If you don't have the local database you do a "Catalog Tape" operation that scans the file headers on the tape to reconstruct the database. For whatever reason ARCServe couldn't complete the catalog with that specific kind of tape. Whether that was the specific version he found or was a general problem with support for those tape drives I don't know.

OpenClaw ClawHub Broken Windows Theory – If basic sorting isn't working what is?

OpenBSD Copyright Policy

OpenClaw Creator: Why 80% of Apps Will Disappear

What Happens When Technical Debt Vanishes?

AI Is Finally Eating Software's Total Market: Here's What's Next

Computer Science from the Bottom Up

Show HN: I built a toy compiler as a young dev

You don't need Mac mini to run OpenClaw

Learning to Reason in 13 Parameters

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

Ask HN: Will GPU and RAM prices ever go down?

From hunger to luxury: The story behind the most expensive rice (2025)

Substack makes money from hosting Nazi newsletters

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

Moltbook was peak AI theater

Why Claude Cowork is a math problem Indian IT can't solve

Show HN: Built an space travel calculator with vanilla JavaScript v2

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

Micro-Front Ends in 2026: Architecture Win or Enterprise Tax?

These White-Collar Workers Actually Made the Switch to a Trade

The Wonder Drug That's Plaguing Sports

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

Federated Credential Management (FedCM)

Token-to-Credit Conversion: Avoiding Floating-Point Errors in AI Billing Systems

The Story of Heroku (2022)

Obey the Testing Goat

Claude Opus 4.6 extends LLM pareto frontier

Brute Force Colors (2022)

Google Translate apparently vulnerable to prompt injection

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

OpenClaw ClawHub Broken Windows Theory – If basic sorting isn't working what is?

OpenBSD Copyright Policy

OpenClaw Creator: Why 80% of Apps Will Disappear

What Happens When Technical Debt Vanishes?

AI Is Finally Eating Software's Total Market: Here's What's Next

Computer Science from the Bottom Up

Show HN: I built a toy compiler as a young dev

You don't need Mac mini to run OpenClaw

Learning to Reason in 13 Parameters

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

Ask HN: Will GPU and RAM prices ever go down?

From hunger to luxury: The story behind the most expensive rice (2025)

Substack makes money from hosting Nazi newsletters

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

Moltbook was peak AI theater

Why Claude Cowork is a math problem Indian IT can't solve

Show HN: Built an space travel calculator with vanilla JavaScript v2

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

Micro-Front Ends in 2026: Architecture Win or Enterprise Tax?

These White-Collar Workers Actually Made the Switch to a Trade

The Wonder Drug That's Plaguing Sports

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

Federated Credential Management (FedCM)

Token-to-Credit Conversion: Avoiding Floating-Point Errors in AI Billing Systems

The Story of Heroku (2022)

Obey the Testing Goat

Claude Opus 4.6 extends LLM pareto frontier

Brute Force Colors (2022)

Google Translate apparently vulnerable to prompt injection

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

How Frogger 2’s source code was recovered from a destroyed tape [video]

Comments