frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563
6•bundie•11mo ago

Comments

qwertox•11mo ago
Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.
turtleyacht•11mo ago
It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.
qwertox•11mo ago
Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•11mo ago
Oh, I see. Having a flag to skip deletion during test runs is a good rule then.
rvz•11mo ago
Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•11mo ago
Who runs such an AI generated script without checking the code first?
qwertox•11mo ago
To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•11mo ago
Right so lets just always run the code as is ?
qwertox•11mo ago
No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•11mo ago
> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•11mo ago
In which Roko's Basilisk fires a warning shot.
jethronethro•11mo ago
This is why you test code or a script before running it for real. Live and learn, I guess ...

Show HN: NoonFlow – a macOS workspace I built for Claude Code and Codex

https://github.com/heyallencao/NoonFlow/releases
1•AllenCao•32s ago•0 comments

Political Discontent in China Is Associated with Isolating Personality Traits

https://www.journals.uchicago.edu/doi/abs/10.1086/719273
1•xnhbx•3m ago•0 comments

Apple's New CEO Must Launch 5 Product Ecosystems or Miss the Next iPhone Moment

https://greentic-ai.medium.com/apples-next-ceo-must-launch-these-5-product-ecosystems-or-miss-the...
1•mectors•3m ago•0 comments

Show HN: Playing music over movies causes them to sync

1•e1ghtSpace•4m ago•0 comments

Colorado citizens pay with rights, xAI pays nothing, long live democracy

https://aitwerp.com/signals/xai-colorado-discrimination-federal/
1•Inziu•9m ago•1 comments

Hire based on the conversation about code, not the code itself

https://dbarabashh.com/thoughts-and-experience/hire-for-the-conversation-not-the-code
1•birdculture•15m ago•0 comments

LogAct: Enabling agentic reliability via shared logs

https://arxiv.org/abs/2604.07988
1•pramodbiligiri•22m ago•0 comments

Show HN: A faithful offline recreation of the classic MS-DOS Editor

https://pascar.run
1•mysticmode•40m ago•0 comments

Managing the Unmanaged Switch

https://watchmysys.com/blog/2026/03/managing-the-unmanaged-switch/
1•luu•41m ago•0 comments

Show HN: I fixed sleep schedule by tracking solar time, so I built an app for it

https://sun.mikolajmocek.com/
1•mondonno•43m ago•0 comments

Richard Sutton – Father of RL thinks LLMs are a dead end [video]

https://www.youtube.com/watch?v=21EYKqUsPfg
2•mpweiher•43m ago•1 comments

Scaling Camera File Processing at Netflix

https://netflixtechblog.com/scaling-camera-file-processing-at-netflix-6dab2b1e80be
2•redblueflame•47m ago•0 comments

Show HN: Agent MCP Studio – build multi-agent MCP systems in a browser tab

https://www.agentmcp.studio
3•stealthtsdb•47m ago•0 comments

Testing GPT-5.5 in early access: what we are seeing so far

https://lovable.dev/blog/gpt-5-5-now-in-lovable
2•doener•52m ago•1 comments

Escrow Security for iCloud Keychain

https://support.apple.com/guide/security/escrow-security-for-icloud-keychain-sec3e341e75d/web
2•gurjeet•57m ago•0 comments

Tewart Brand on LSD, A.I. Black Boxes and the Beauty of Care [video]

https://www.youtube.com/watch?v=t8u24wvHeSE
1•born-jre•1h ago•0 comments

Code review advice for vibe coders

https://xata.io/blog/code-review-for-vibe-coders
2•tee-es-gee•1h ago•0 comments

Show HN: A CLI to use any model in your coding agent

https://getaivo.dev/
2•spirit23•1h ago•0 comments

What is Nostr? A simple guide to the protocol

https://usenostr.org/
1•vlugorilla•1h ago•0 comments

The Tiny Donut That Proved We Still Don't Understand Magnetism [video]

https://www.youtube.com/watch?v=XKSjCOKDtpk
1•mpweiher•1h ago•0 comments

List of personal sites that host Wander console, a tool to explore the small web

https://susam.codeberg.page/wander/wcn.html
2•susam•1h ago•0 comments

Naming Things Is Easy Now

https://notesbylex.com/naming-things-is-easy-now
3•lexandstuff•1h ago•0 comments

I left Vercel Pro ($20/mo) for a $10/mo VPS. 7-day Next.js migration report

https://gist.github.com/Samarth0211/b728534af45242b61b45a87a4ecdf155
1•samarth0211•1h ago•1 comments

Global Energy Flows

https://ig.ft.com/global-energy-flows/
1•saswatms•1h ago•0 comments

Mystery Cpuid Bit

http://www.os2museum.com/wp/mystery-cpuid-bit/
1•userbinator•1h ago•0 comments

Do you ever ask "Please Claude I need this my account is kinda tokenless "

https://engram-three.vercel.app/
1•-Refraction-•1h ago•1 comments

Ask HN: What should a Microblogging Site look like?

1•PiSquareS•1h ago•1 comments

ChatGPT Recommends the Same 3 Companies to Every B2B Buyer. Until They Specify

https://growtika.com/blog/chatgpt-b2b-persona-recommendations
2•Growtika•1h ago•1 comments

Ubuntu 26.04 LTS (Resolute Raccoon)

https://releases.ubuntu.com/resolute/
2•kwar13•1h ago•1 comments

Show HN: Aliasme – A shell script to memorize your commands

https://github.com/Jintin/aliasme
1•Jintin•1h ago•2 comments