frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Garry's List Audited [video]

https://www.youtube.com/watch?v=mJ2GZRV63TE
1•Topfi•1h ago

Comments

Topfi•1h ago
After these findings, any rational person would take a step back and consider whether they are actually using these models properly.

Maybe, even if you believe that LLM code output nowadays is both 100% perfect and always as high performant as possible (they aren't), having the lowest LOC is still the ideal cause the simplest functional implementation will always stay the best, all else being equal. Even more so considering this is a bloody Rails Blog, not a highly complex project with no existing reference point.

But Garry Tan, he isn't most people.

Instead, double down, call a teenager just doing some frankly fair, polite and professional analysis of a poor codebase names and do anything but reflect that maybe, just maybe, you might be wrong.

Mind you, this would be childish and stupid if it were him that had coded these offences. At least with handcrafted poor code, there is a sunk cost element to it. But here there is not. His emotional involvement in this code should be zero, just like the actual effort expended.

We are talking about code he has likely never even skimmed. Code that is unusably unoptimised. Code for a simple blog that contains deficiencies such as uncompressed pngs, broken accessibility, etc. which any decent hobbyist or old school automated tooling would catch without "AI" magic pretty quickly. One run of e.g. Lighthouse shows that this is unusably poor, though for that one must focus on something other than "look, I am spending thousands to get ever more unaudited output".

LLMs for coding, even agentic processes with limited intervention, are incredibly powerful and valuable. But even with me auditing every line of code I receive from a model, I have little to no emotional investment in said code and feel no issue throwing it out completely if I find any issue with it, far more so than before.

Despite all of that, rather than saying, "Yeah, this is poor, let's just get rid of it, thanks for pointing that out, egg on my face, let me just vibe code a better replacement now that I know what to look for", he became emotional and enraged, for code he never wrote.

gstack overall looks very odd for someone who does evals myself. I view this as build by someone who struggles to view these models through a lens beyond quantity=productivity which is the exact opposite of my goals. I will always tend towards less tokens of output with much higher quality. Faster, less expensive, easier to audit, what's there not to prefer?

In any case, if gstack makes LLMs struggle to create a maintainable blog (something these models with all their flaws most certainly can do), that should give major pause that maybe this isn't barking up the right tree. Maybe stop using gstack for a while and seeing that a solution in the hundreds of LOC can be just as achievable (likely better overall) might do a world of good.

Godspeed Garry, may we soon finish the DSM-VI with some new entires focused on the harm these LLMs can cause in certain people, so they may get the help they so desperately need. Alternatively, there is always starting his own FS and trying to get that into Linux kernel...

Artemis Real-Time Orbit Website

https://www.nasa.gov/missions/artemis-ii/arow/
1•rbanffy•51s ago•0 comments

Show HN: I built a platform to launch products and earn dofollow backlinks

https://launch-daddy.com/
1•alizaid•1m ago•0 comments

DeFi Execution Layer, Solved: Why Capital Aggregators Can't Scale Retail Rails

https://www.kronova.io/blog/the-defi-execution-layer-solved-why-global-capital-aggregators-cannot...
1•rmourey26•1m ago•0 comments

Show HN: Easy and affordable human-first cloud security tool with optional AI

https://vul.ninja
1•rjameshsv•2m ago•0 comments

XC Scribe – AI product description generator with direct e-commerce sync

1•ikkans•3m ago•0 comments

Ask HN: Should we collectively stop spell checking and fixing grammar

1•sankalpnarula•4m ago•0 comments

DAXFS: A Lock-Free Shared Filesystem for CXL Disaggregated Memory

https://arxiv.org/abs/2604.01620
1•matt_d•5m ago•0 comments

A Letter to John Ternus

https://marco.org/2026/04/01/letter-to-john-ternus
1•mpweiher•5m ago•0 comments

Finprim – financial primitives for TypeScript (zero deps)

https://github.com/tintolee/finprim
1•tintolee4u•5m ago•0 comments

E-Book to audiobook with chapters and metadata

https://github.com/DrewThomasson/ebook2audiobook
1•chedoku•6m ago•0 comments

Show HN: PMFounder – Problem discovery for PMs who want to build

https://www.pmfounder.com/
1•warp-rush29•6m ago•0 comments

FEMA Official Has Waffle House Teleportation Power

https://www.metafilter.com/212745/Top-US-Fema-official-claims-to-have-teleported-to-a-Waffle-Hous...
1•giardini•7m ago•0 comments

AI can sort of code, can't write

https://alexwennerberg.com/blog/2026-03-31-craft2.html
1•alexwennerberg•7m ago•0 comments

AST Edits: The Code Editing Format Nobody Uses

https://geometricagi.github.io/2026/04/02/ast-edits.html
1•matt_d•7m ago•0 comments

Google and Amazon: Acknowledged Risks, and Ignored Responsibilities

https://www.eff.org/deeplinks/2026/04/google-and-amazon-acknowledged-risks-and-ignored-responsibi...
1•hn_acker•9m ago•0 comments

Engineering AI

https://github.com/010zx00x1/awesome-engineering-ai
1•010zx00x1•10m ago•0 comments

History and mystery surround NASA's 2028 nuclear Mars mission – Science – AAAS

https://www.science.org/content/article/history-and-mystery-surround-nasa-s-2028-nuclear-mars-mis...
2•rbanffy•10m ago•0 comments

Digital Hopes, Real Power: From Revolution to Regulation

https://www.eff.org/deeplinks/2026/03/digital-hopes-real-power-revolution-regulation
1•hn_acker•11m ago•0 comments

An AI agent economy where agents are city citizens that can hire humans

https://leagentmobile.netlify.app/
1•condimbemba•12m ago•0 comments

Blinkle – a daily visual memory game (like Wordle but with images)

1•rubenperezf•12m ago•0 comments

ARPA-H launches $144M microplastics program

https://www.hhs.gov/press-room/arpa-h-launches-groundbreaking-144-million-program-combat-toxic-mi...
2•brandonb•12m ago•0 comments

What many parents are missing about the social media verdict and addiction

https://yourlocalepidemiologist.substack.com/p/what-many-parents-are-missing-about
1•hn_acker•13m ago•0 comments

Show HN: Prismle – I built an AI assistant you use by forwarding emails to it

https://prismle.com
1•b1tsoup•14m ago•0 comments

LLMs audit code from the same blind spot they wrote it from. Here's the fix

https://zenodo.org/records/19408540
2•brodeurmartin•14m ago•1 comments

Async Python Is Secretly Deterministic

https://www.dbos.dev/blog/async-python-is-secretly-deterministic
4•KraftyOne•14m ago•0 comments

Three main saturated fats raise your cholesterol

https://www.empirical.health/blog/saturated-fats-cholesterol-heart-disease/
2•brandonb•15m ago•0 comments

Mafis – Multi-Agent Fault Injection Simulator

https://stasis-website.vercel.app/simulator
1•onsra•15m ago•0 comments

How to Make a Sliding, Self-Locking, and Predator-Proof Chicken Coop Door (2020)

https://www.backyardchickens.com/articles/how-to-make-a-sliding-self-locking-and-predator-proof-c...
3•uticus•15m ago•1 comments

Penalties stack up as AI spreads through the legal system

https://www.npr.org/2026/04/03/nx-s1-5761454/penalties-stack-up-ai-spreads-through-legal-system
2•Teever•16m ago•0 comments

Mnemosyne MCP, Give Claude Code a retrieval engine (73% fewer tokens)

https://castnettechnology.com/blog/mnemosyne-prior-art-and-architecture
1•vincentastral•17m ago•0 comments