frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563
6•bundie•8mo ago

Comments

qwertox•8mo ago
Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.
turtleyacht•8mo ago
It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.
qwertox•8mo ago
Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•8mo ago
Oh, I see. Having a flag to skip deletion during test runs is a good rule then.
rvz•8mo ago
Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•8mo ago
Who runs such an AI generated script without checking the code first?
qwertox•8mo ago
To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•8mo ago
Right so lets just always run the code as is ?
qwertox•8mo ago
No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•8mo ago
> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•8mo ago
In which Roko's Basilisk fires a warning shot.
jethronethro•8mo ago
This is why you test code or a script before running it for real. Live and learn, I guess ...

Show HN: I built a satellite forensic engine to detect fraud in Carbon Markets

1•kccanarch•2m ago•0 comments

China's Z.ai claims it trained a model using only Huawei hardware

https://www.theregister.com/2026/01/15/zhipu_glm_image_huawei_hardware/
1•50kIters•3m ago•0 comments

Show HN: Matriq – Search inside video files using natural language

https://www.matriq.video/
1•Daviduche03•4m ago•0 comments

MailPilot - get outside while your agents work and you email them

1•keepamovin•6m ago•0 comments

France fines telcos €42M for sub-par security prior to 24M customer breach

https://www.theregister.com/2026/01/14/france_fines_free_free_mobile/
1•pjmlp•10m ago•0 comments

Vibe-ported airfoil design code (XFOIL) from Fortran to JavaScript web app

https://www.vibefoil.com/
1•argon•11m ago•1 comments

Physical Unclonable Function

https://en.wikipedia.org/wiki/Physical_unclonable_function
1•cyanf•11m ago•0 comments

Ask HN: Why AI Code Editors Suck in Closing Tags?

1•cryptography•11m ago•0 comments

Show HN: Mounty – Because I was too lazy to edit fstab

https://github.com/xndbogdan/mounty
1•bobsterlobster•12m ago•0 comments

Show HN: ViralIQ – Get structured feedback on videos before publishing

https://viraliq.app
1•mathewmon•13m ago•0 comments

Video: Fixing North America's Big Elevator Problem

https://www.sightline.org/2026/01/11/video-fixing-north-americas-big-elevator-problem/
1•NN88•15m ago•0 comments

Ffe – Flat File Extractor

https://ff-extractor.sourceforge.net/ffe.html
1•igitur•16m ago•0 comments

Iran hits 144 hours without internet (7 days)

https://twitter.com/netblocks/status/2011480882939400490
1•ukblewis•16m ago•0 comments

The Return of NASA's SpaceX Crew-11

https://www.nasa.gov/
2•runningmike•18m ago•0 comments

Beszel: Simple, Lightweight Server Monitoring

https://beszel.dev/
1•thunderbong•20m ago•0 comments

Classic games of pro dota list

https://classicdota.com/
1•marysminefnuf•22m ago•0 comments

Codex Monitor: An app to minitor your (Codex) situation

https://github.com/Dimillian/CodexMonitor
1•dimillian•23m ago•0 comments

How Prompt Injections Gradually Evolved into a Multi-Step Malware

https://arxiv.org/abs/2601.09625
2•50kIters•25m ago•0 comments

Show HN: Sptfw – (unofficial) Spotify wrapped, how mediocre is your taste?

https://github.com/fwttnnn/sptfw
2•fwttnnn•34m ago•2 comments

Ask HN: What travel apps do you use while traveling?

2•Nora23•35m ago•1 comments

Satellites of Indian startups doomed in ISRO PSLV failure. Were they insured?

https://www.wionews.com/science/6-satellites-of-indian-startups-doomed-in-isro-pslv-failure-were-...
1•akbarnama•35m ago•0 comments

Open Source AI Impact: Japan's Draft "Principle-Code"

https://discuss.opensource.org/t/open-source-ai-impact-japan-s-draft-principle-code-comments-open...
1•totetsu•36m ago•0 comments

We Are Excited About Confessions

https://alignment.openai.com/confessions/
1•TMWNN•38m ago•0 comments

Mira Murati's startup, is losing two of its co-founders to OpenAI

https://techcrunch.com/2026/01/14/mira-muratis-startup-thinking-machines-lab-is-losing-two-of-its...
1•7777777phil•41m ago•0 comments

How the Washington State Legislature Works

https://www.brethorsting.com/blog/2026/01/how-the-washington-state-legislature-works/
2•aaronbrethorst•41m ago•0 comments

Tenor API is shutting down

2•ankushkun_•42m ago•3 comments

What If Your AI Never Forgot? The Claude 4 Memory Experiment

https://www.gptfrontier.com/what-if-your-ai-never-forgot-the-claude-4-memory-experiment/
1•ssengupta3•42m ago•0 comments

Dynamic semantic navigation for app launchers

1•powerwordtree•42m ago•0 comments

You Are Claude Code, Anthropic's Official CLI for Claude

https://fst.wtf/you-are-claude-code-anthropics-official-cli-for-claude
1•fullstacktard•44m ago•0 comments

When a Free Model Beat Claude-Sonnet-4.5

https://askcodi.substack.com/p/when-a-free-model-beat-claude-sonnet
1•himalayansailor•46m ago•0 comments