I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563

6•bundie•1mo ago

Comments

qwertox•1mo ago

Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.

turtleyacht•1mo ago

It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.

qwertox•1mo ago

Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•1mo ago

Oh, I see. Having a flag to skip deletion during test runs is a good rule then.

rvz•1mo ago

Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•1mo ago

Who runs such an AI generated script without checking the code first?

qwertox•1mo ago

To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•1mo ago

Right so lets just always run the code as is ?

qwertox•1mo ago

No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•1mo ago

> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•1mo ago

In which Roko's Basilisk fires a warning shot.

jethronethro•1mo ago

This is why you test code or a script before running it for real. Live and learn, I guess ...

My Blog Is Overengineered to the Point People Think It's a Static Site (2022)

Ask HN: Is there a market for agentic scraping tools?

Hanako-San

Ask HN: What are fundamental books on systems, system thinking, reliability?

Stop Killing Games in EU passed 1.000.000 signatures

Jan – Local AI Assistant

Fixing the Web? – Carson Gross [video]

Cod Have Been Shrinking for Decades, Scientists Say They've Solved Mystery

Show HN: I built an multi-devices AI usage analytics app for Claude Code

How to create repositories in Artifactory with curl

Writing Modular Prompts

Show HN: Centenary Day – toolkit for healthy living (routines, meals, tracking)

AI 'thinks' like a human – after training on 160 psychology studies

I got rid of all my Neovim plugins

Show HN: Flaget – small 5kB CLI argument parser for Node.js

Cursive writing could become a requirement for students in Pa

Recreating Early Colour Outside Broadcast [video]

LLM-d: Disaggregated Serving northstar

A new law in Sweden makes it illegal to buy custom adult content

The Mother of All Demos

KDE Plasma 6.4 has landed in OpenBSD

Proposal: GUI-first, text-based mechanical CAD inspired by software engineering

Show HN: Baml_vcr -Record your LLM calls and play them back during tests

Kioxia CD9P Showing the Power of New BiCS Flash at HPE Discover 2025

Overview of new technologies applied to BiCS FLASH generation 8

Predicting average IMDB movie ratings using text embeddings of movie metadata

AI winter is well on its way (2018)

Show HN: An AI tool that lets you interact with your terminal in plain English

Let's vibe code a new programming language [video]

The AI Coding Stack Developers Are Using to Save 20 Hours a Week (2025 Guide)

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

Comments

My Blog Is Overengineered to the Point People Think It's a Static Site (2022)

Ask HN: Is there a market for agentic scraping tools?

Hanako-San

Ask HN: What are fundamental books on systems, system thinking, reliability?

Stop Killing Games in EU passed 1.000.000 signatures

Jan – Local AI Assistant

Fixing the Web? – Carson Gross [video]

Cod Have Been Shrinking for Decades, Scientists Say They've Solved Mystery

Show HN: I built an multi-devices AI usage analytics app for Claude Code

How to create repositories in Artifactory with curl

Writing Modular Prompts

Show HN: Centenary Day – toolkit for healthy living (routines, meals, tracking)

AI 'thinks' like a human – after training on 160 psychology studies

I got rid of all my Neovim plugins

Show HN: Flaget – small 5kB CLI argument parser for Node.js

Cursive writing could become a requirement for students in Pa

Recreating Early Colour Outside Broadcast [video]

LLM-d: Disaggregated Serving northstar

A new law in Sweden makes it illegal to buy custom adult content

The Mother of All Demos

KDE Plasma 6.4 has landed in OpenBSD

Proposal: GUI-first, text-based mechanical CAD inspired by software engineering

Show HN: Baml_vcr -Record your LLM calls and play them back during tests

Kioxia CD9P Showing the Power of New BiCS Flash at HPE Discover 2025

Overview of new technologies applied to BiCS FLASH generation 8

Predicting average IMDB movie ratings using text embeddings of movie metadata

AI winter is well on its way (2018)

Show HN: An AI tool that lets you interact with your terminal in plain English

Let's vibe code a new programming language [video]

The AI Coding Stack Developers Are Using to Save 20 Hours a Week (2025 Guide)