frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I asked Gemini for a script to move files to Cloudflare R2. It deleted them

https://twitter.com/levelsio/status/1921974501257912563
6•bundie•8mo ago

Comments

qwertox•8mo ago
Rule #1: Always put deletions behind a flag which is disabled for the first couple of test runs.
turtleyacht•8mo ago
It was truncating filenames, so /pics/1003-46.png overwrote /pics/1003-45.png because both were renamed /pics/1003-.png, or something like that.
qwertox•8mo ago
Truncating file names for the target. Then it proceeded to delete the source file. "Successfully deleted local file: ..."

I mean, look at the printout. It shows that it created the remote file with the truncated filename, then deletes the local file with the correct filename.

turtleyacht•8mo ago
Oh, I see. Having a flag to skip deletion during test runs is a good rule then.
rvz•8mo ago
Recently there was a story about an updater causing a $8,000 bill because there was a lack of basic automated tests to catch the issue. [0]

The big lesson here is that you should actually test the code you write and also write automated tests to check any code generated by an LLM that the code is correct in what it does.

It is also useless to ask another AI to check for mistakes created by another LLM. As you can see in the post, both of them failed to catch the issue.

This why I don't take this hype around 'vibe-coding' seriously since not only it isn't software engineering, it promotes low quality and carelessness over basic testing and dismisses in checking that the software / script works as expected.

Turning $70 problems found in development into $700,000+ costs in production.

There are no more excuses in not adding tests.

[0] https://news.ycombinator.com/item?id=43829006

victorbjorklund•8mo ago
Who runs such an AI generated script without checking the code first?
qwertox•8mo ago
To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

It turns 10 lines of code which is perfectly fine to reason about into 100 lines of unreadable code full of comments and exception handling.

weatherlite•8mo ago
Right so lets just always run the code as is ?
qwertox•8mo ago
No. Not at all. I've settled to discussing my code with Gemini. That way it works very well. I explicitly say "Comment on my code and discuss it" or "Let's discuss code for a script doing this and that. Generate me an outline and let's see where this leads. Don't put comments in the code, nor exception handling, we're just discussing it".

Or you create elaborate System Instructions, since it adheres to them pretty well.

But out-of-the-box, Gemini's coding abilities are unusable due to the verbosity.

I've even gone so far to tell it that it must understand that I am just a human and have limited bandwidth in my brain, so it should write code which is easy to reason about, that this is more important than having it handle every possible exception or adding multiline comments.

rsynnott•8mo ago
> To be fair, the code Gemini outputs in AI Studio is so extremely verbose that it is almost impossible to read through it.

In which case, it should simply be considered unusable. Like, the sensible response to "tool is so inadequate that there is no reasonable way to make sure its output is safe" is to _not use that tool_.

rsynnott•8mo ago
In which Roko's Basilisk fires a warning shot.
jethronethro•8mo ago
This is why you test code or a script before running it for real. Live and learn, I guess ...

The Internet Archive Crawler

https://github.com/internetarchive/heritrix3
1•dvrp•10m ago•0 comments

US State Department Threatens UK over Probe into Elon Musk's X

https://www.politico.eu/article/us-state-department-threaten-uk-probe-elon-musk-x-grok/
3•saubeidl•12m ago•0 comments

Ask HN: How do you apply for jobs in the age of AI?

1•surrTurr•12m ago•0 comments

I've created a prototype for the front-end of a website inside an AI chatbot

1•5color•13m ago•0 comments

Claude Cowork Runs Linux VM via Apple Virtualization Framework

https://gist.github.com/simonw/35732f187edbe4fbd0bf976d013f22c8
2•jumploops•17m ago•0 comments

Show HN: Gilda runs multiple LLMs, compares them, and merges the result

https://gildaapp.com/
1•osgohe•22m ago•1 comments

When competition leads to human values by Beren Millidge [video]

https://www.youtube.com/watch?v=ua67aXBP76k
1•ljosifov•22m ago•0 comments

VoidLink: The Cloud-Native Malware Framework Weaponizing Linux Infrastructure

https://blog.checkpoint.com/research/voidlink-the-cloud-native-malware-framework-weaponizing-linu...
2•laktak•23m ago•0 comments

McKinsey challenges graduates to use AI chatbot in recruitment overhaul

https://www.ft.com/content/de7855f0-f586-4708-a8ed-f0458eb25586
1•jorisboris•28m ago•1 comments

PulseDaily – a local‑first, no‑login habit tracker with gentle reminders

https://pulsedaily.codezs.online
1•fcxl•29m ago•1 comments

PartyBench: AI throws a house party and is graded on its performance [SATIRE]

https://www.astralcodexten.com/p/sota-on-bay-area-house-party
1•ryan_j_naughton•30m ago•0 comments

Static sites from shell (part 1/2) – feeling the html.energy (2022)

https://www.evalapply.org/posts/shite-the-static-sites-from-shell-part-1/index.html
1•adityaathalye•34m ago•0 comments

Tesla will stop selling FSD after Feb 14

https://twitter.com/elonmusk/status/2011324998653513810
5•0xedb•35m ago•0 comments

In the DOM We Trust: The Hidden Dangers of Reading the DOM on the Web [pdf]

https://trouge.net/papers/in_the_dom_we_trust_ccs25.pdf
1•ArneVogel•37m ago•1 comments

Multi-Service Debug Sandbox

https://github.com/kp7008/multi-service-debug-sandbox
2•kp7008•41m ago•0 comments

Show HN: Go-CoreML – Go Bindings for Apple's CoreML with Neural Engine Support

https://github.com/gomlx/go-coreml
3•kingcauchy•43m ago•0 comments

HTTP RateLimit Headers

https://dotat.at/@/2026-01-13-http-ratelimit.html
2•ingve•44m ago•0 comments

Why Arab states are silent about Iran's unrest

https://www.economist.com/middle-east-and-africa/2026/01/13/why-arab-states-are-silent-about-iran...
5•ryan_j_naughton•46m ago•0 comments

Documented Alaska Airlines loyalty thefts shows architectural failure

https://www.noseyparker.org/p/alaska-airlines-where-the-top-customers
5•NoseyParker•52m ago•1 comments

Volvo Will Make You Safer with Only a Font

https://www.motortrend.com/news/volvo-safety-typeface-font-easy-read
3•qsi•52m ago•1 comments

You vs. a Billionaire: An Interactive Perspective on Wealth

https://www.budgetflow.cc/blog/you-compared-to-elon-musk
4•mkrd•54m ago•1 comments

We optimized Socket.IO for real-time SaaS analytics

https://saasscout.online/
2•zoey922•54m ago•0 comments

An Architecture for Verifiable Data Collection and Proof-of-Check Timestamping

https://www.researchgate.net/publication/399711443_A_Libre_Architecture_for_Verifiable_Data_Colle...
2•cedricbonhomme•56m ago•0 comments

NASA's SpaceX Crew-11 Go for Undocking on Wednesday

https://www.nasa.gov/blogs/commercialcrew/2026/01/13/nasas-spacex-crew-11-go-for-undocking-on-wed...
2•akg130522•57m ago•0 comments

Agent OS

https://buildermethods.com/agent-os
2•evo_9•57m ago•1 comments

Nvim-beads: Manage beads in Neovim

https://joeblu.com/blog/2026_01_introducing-nvim-beads-manage-beads-in-neovim/
2•joeblubaugh•58m ago•0 comments

Rapid Serial Visual Presentation (RSVP) reader for speed reading

https://github.com/thomaskolmans/rsvp-reading
3•yownie•58m ago•2 comments

Cells use 'Bioelectricity' to coordinate and make group decisions

https://www.quantamagazine.org/cells-use-bioelectricity-to-coordinate-and-make-group-decisions-20...
3•ashishgupta2209•58m ago•0 comments

New set of icons for Apple Creator Studio apps

https://www.reddit.com/r/MacOS/s/NC7iDJu9MS
1•cromka•1h ago•3 comments

The Joy of Not Learning: How AI Saves My Hobby Projects

https://harichetlur.com/blog/the-joy-of-not-learning-how-ai-saves-my-hobby-projects/
4•harichetlur•1h ago•4 comments