frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563
6•bundie•1y ago

Comments

qwertox•1y ago
Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.
turtleyacht•1y ago
It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.
qwertox•1y ago
Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•1y ago
Oh, I see. Having a flag to skip deletion during test runs is a good rule then.
rvz•1y ago
Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•1y ago
Who runs such an AI generated script without checking the code first?
qwertox•1y ago
To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•1y ago
Right so lets just always run the code as is ?
qwertox•1y ago
No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•1y ago
> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•1y ago
In which Roko's Basilisk fires a warning shot.
jethronethro•1y ago
This is why you test code or a script before running it for real. Live and learn, I guess ...

HN isn't swamped yet, just AI-obsessed

https://www.mahl.me/blog/hacker-news-isnt-swamped-yet/
1•gorgmah•31s ago•0 comments

Scott Aaronson – The Truth About Quantum Computing [video]

https://www.youtube.com/watch?v=cq4atriB-Rc
1•nill0•2m ago•0 comments

Show HN: Runner – desktop app for running Claude Code, Codex as a crew

https://github.com/yicheng47/runner
2•yicheng47•4m ago•0 comments

Macro Wall Display

https://kensingtonhomes.uk
1•postmanag•5m ago•0 comments

I built Search Engine $100 business ideas filtered by budget,niche and AI Tools

https://onehundredbiz.com
1•DaveOne•5m ago•0 comments

Running Gitea Runner with Rootless Podman

https://www.nite07.com/en/posts/quadlet-gitea-runner-podman/
1•speckx•6m ago•0 comments

An AI system to help scientists write expert-level empirical software

https://arxiv.org/abs/2509.06503
1•cyco130•7m ago•0 comments

Plan to declare Dominion Voting Systems machines national security risks fails

https://twitter.com/ErinBanco/status/2057799461435220399
1•cf100clunk•9m ago•1 comments

Introducing the Godot Asset Store

https://godotengine.org/article/introducing-the-godot-asset-store/
2•makepanic•10m ago•0 comments

The Marquis, the Island, the Diary, and the Deal: The Casati Stampa Murders

https://www.utterlyinteresting.com/post/the-marquis-the-island-the-diary-and-the-deal-the-casati-...
1•amarcheschi•10m ago•0 comments

How to Call an API from an Email

https://redo.com/eng-blog/how-to-call-an-api-from-an-email/
1•tyurok•12m ago•0 comments

App Store stopped $2.2B in potentially fraudulent transactions in 2025

https://www.apple.com/newsroom/2026/05/the-app-store-stopped-over-2-point-2-billion-usd-in-fraudu...
1•CharlesW•13m ago•0 comments

A Marketplace of Fine Tuned SLMs for Agentic Tasks

https://marketplace.neurometric.ai/
2•robmay•14m ago•1 comments

Just Use Opus

https://ai.nevolin.be/just-use-opus
1•ilja-nevo•15m ago•0 comments

Bytecode VMs in surprising places (2024)

https://dubroy.com/blog/bytecode-vms-in-surprising-places/
1•azhenley•16m ago•0 comments

See the clouds streaming and vanishing around this planet – 690 light years away

https://www.nature.com/articles/d41586-026-01608-3
1•leephillips•16m ago•0 comments

Show HN: Glimpse, Markdown reader using Apple's on-device foundation model

https://apps.apple.com/us/app/glimpse-markdown-viewer/id6761304904?mt=12
2•duman•17m ago•0 comments

Self Hosting Passwords

https://chuck.is/passwords/
1•speckx•17m ago•1 comments

Ask HN: Are LLMs creating busy work?

4•m3h•20m ago•3 comments

Engineering Is Not Dead, Because Accountability Isn't

https://paolino.me/engineering-is-not-dead/
2•earcar•20m ago•2 comments

Cursor hits $3B in revenue and now has 3K+ customers paying at least $100K each

https://www.bloomberg.com/news/articles/2026-05-21/cursor-hits-3-billion-annual-sales-rate-ahead-...
1•thoughtpeddler•20m ago•0 comments

Apple Sports expands to more than 90 new countries and regions

https://www.apple.com/newsroom/2026/05/apple-sports-expands-to-more-than-90-new-countries-and-reg...
2•tosh•21m ago•0 comments

Neoclassical C++: segmented iterators revisited

https://boostedcpp.net/2026/05/18/neoclassical-c-segmented-iterators-revisited-1/
1•ibobev•21m ago•0 comments

Sleep helps brain clean Alzheimer's-linked toxins, study says

https://www.ft.com/content/d33c6e3f-3162-44d3-8de2-2ff0b555217e
1•bookofjoe•21m ago•1 comments

A world from a sheet of paper – Tadashi Tokieda [video]

https://www.youtube.com/watch?v=8p02DtmyQhU
2•nill0•22m ago•0 comments

Sora shutdown leaves Critterz at the Cannes market without its model

https://thenextweb.com/news/critterz-misses-cannes-openai-sora-shutdown
3•thm•23m ago•1 comments

Ceres, an open copilot for VS Code with budget and local LLMs support

https://marketplace.visualstudio.com/items?itemName=pa-andreas.skia-ai-sidebar
2•libandreas•23m ago•1 comments

Ohio data center tax break cost $1.4B more than expected in 2025

https://signalohio.org/ohio-data-center-tax-break-cost-1-4-billion-more-than-expected-in-2025/
1•mooreds•24m ago•0 comments

Deno 2.8

https://deno.com/blog/v2.8#task-runner
6•soheilpro•24m ago•0 comments

Building OpenWrt for the Seeed Studio WM6108 802.11ah HaLow Radio

https://www.beyondlogic.org/building-openwrt-for-the-seeed-studio-wm6108-802-11ah-halow-radio/
1•speckx•24m ago•0 comments