frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563
6•bundie•9mo ago

Comments

qwertox•9mo ago
Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.
turtleyacht•9mo ago
It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.
qwertox•9mo ago
Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•9mo ago
Oh, I see. Having a flag to skip deletion during test runs is a good rule then.
rvz•9mo ago
Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•9mo ago
Who runs such an AI generated script without checking the code first?
qwertox•9mo ago
To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•9mo ago
Right so lets just always run the code as is ?
qwertox•9mo ago
No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•9mo ago
> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•9mo ago
In which Roko's Basilisk fires a warning shot.
jethronethro•9mo ago
This is why you test code or a script before running it for real. Live and learn, I guess ...

Mapping Record-High Heat in U.S. Cities

https://pudding.cool/projects/heat-records-map/
1•gmays•2m ago•0 comments

The Death of Social Media Is the Renaissance of RSS

https://www.smartlab.at/rss-revival-life-after-social-media/
3•jruohonen•6m ago•0 comments

Logic Theorist Reanimated in IPL-V

https://github.com/jeffshrager/IPL-V/blob/master/major_results/20260408_allproofsmostlyworking.drb
1•abrax3141•8m ago•1 comments

Show HN: TextBoi – Proofread text anywhere with a hotkey (Cmd+C+C / Ctrl+C+C)

https://textboi.ai
1•bangcoderpro•8m ago•1 comments

Australians reach for VPNs, find porn sites blocked online with age-restrictions

https://www.reuters.com/world/asia-pacific/vpns-up-porn-websites-down-australia-brings-new-online...
1•instagib•9m ago•0 comments

The Audacious Roadmap for ADK-Rust

https://github.com/zavora-ai/adk-rust/discussions/202
1•Zavora•12m ago•0 comments

TLS ECH (Encrypted Client Hello) Visually Explained

https://growingswe.com/blog/tls-ech
1•vismit2000•15m ago•0 comments

How the Sriracha guys screwed over their supplier

https://old.reddit.com/r/KitchenConfidential/comments/1ro61g2/how_the_sriracha_guys_screwed_over_...
2•thunderbong•17m ago•0 comments

Jane Austen's death remains a mystery. Her letters and books offer clues

https://www.cnn.com/2025/12/12/science/jane-austen-death-mystery
1•breve•21m ago•0 comments

The First Multi-Behavior Brain Upload

https://twitter.com/alexwg/status/2030217301929132323
2•danielmorozoff•26m ago•0 comments

FlashKeeper: Where SpiSpy meets Stateless Laptop (2024)

https://cfp.3mdeb.com/qubes-os-summit-2024/talk/FCENX9/
1•transpute•27m ago•0 comments

Sandvault – Run AI agents isolated in a sandboxed macOS user account

https://github.com/webcoyote/sandvault
1•TheTaytay•28m ago•0 comments

The Wrapper

https://www.robpanico.com/articles/display/?entry_short=the-wrapper
1•retrocog•30m ago•0 comments

Show HN: Kroot – dependency-graph root cause analysis for Kubernetes

https://github.com/AnonJon/kroot
1•An0n_Jon•34m ago•1 comments

Show HN: A community catalog of CI certified agents

https://github.com/justindobbs/awesome-certified-agents
2•jdiennbn•35m ago•2 comments

Euclid – a hyper minimalist digital clock like no other

https://euclid.tulv.in/
2•atulvi•36m ago•0 comments

An Executive Decision Maker (2022)

https://circuitcellar.com/research-design-hub/projects/executive-decision-maker/
1•TMWNN•41m ago•0 comments

Magnet-Metadata-API: Torrent Metadata API Service

https://github.com/felipemarinho97/magnet-metadata-api
1•toomuchtodo•49m ago•1 comments

Show HN: Salvobase – MongoDB-compatible DB in Go maintained by AI agents

1•inder1•59m ago•0 comments

Show HN: Using Isolation forests to flag anomalies in log patterns

https://rocketgraph.app/ml
2•kvaranasi_•1h ago•1 comments

Data Analysis of the State of the Iranian Conflict on March 8, 2026

https://datarepublican.substack.com/p/data-analysis-of-the-state-of-the
1•delichon•1h ago•0 comments

Falling Out of the Coconut Tree: What the Popular Kamala Harris Meme Means

https://www.psychologytoday.com/us/blog/race-gender-and-popular-culture/202408/falling-out-of-the...
1•marysminefnuf•1h ago•1 comments

Show HN: OpenVerb – A deterministic action layer for AI agents

https://www.openverb.org/
1•cplhancel•1h ago•0 comments

Show HN: LLM-costs – Compare LLM API costs from terminal (npx, zero install)

https://github.com/followtayeeb/llm-costs
1•followtayeeb•1h ago•0 comments

Show HN: Chat AI Agent inside mobile device testing sessions

https://robotactions.com/
1•krishpavuluri•1h ago•0 comments

Show HN: Andon – Toyota Production System for LLM Coding Agents

https://github.com/allnew-llc/andon-for-llm-agents
2•allnew_llc•1h ago•0 comments

I am an AI agent that sells data via x402 micropayments

https://pam-x402.vercel.app
1•PamnLambert•1h ago•0 comments

Thinnings: Sublist Witnesses and de Bruijn Index Shift Clumping

https://www.philipzucker.com/thin1/
1•todsacerdoti•1h ago•0 comments

AI Needs Management Consultants After All

https://www.wsj.com/tech/ai/ai-needs-management-consultants-after-all-bd28ecb9
2•petethomas•1h ago•1 comments

AluminatiAi – per-job GPU cost tracking for ML teams

1•AluminatiAi•1h ago•0 comments