frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Field Notes from Shipping Real Code with Claude

https://diwank.space/field-notes-from-shipping-real-code-with-claude
81•diwank•8h ago

Comments

kasey_junk•4h ago
One of the exciting things to me about the ai agents is how they push and allow you to build processes that we’ve always known were important but were frequently not prioritized in the face of shipping the system.

You can use how uncomfortable you are with the ai doing something as a signal that you need to invest in systematic verification of that something. As a for instance in the link, the team could build a system for verifying and validating their data migrations. That would move a whole class of changes into the ai relm.

This is usually much easier to quantify and explain externally than nebulous talk about tech debt in that system.

diwank•31m ago
For sure. Another interesting trick I found to be surprisingly effective is to ask Claude Code to “Look around the codebase, and if something is confusing, or weird/counterintuitive — drop a AIDEV-QUESTION: … comment so I can document that bit of code and/or improve it”. We found some really gnarly things that had been forgotten in the codebase.
bitwize•2h ago
Coming up next on Hackernews: Field Notes from Eating Real Pizza Made with Claude:

"The glue adds a flavor and texture profile that traditionalists may not be used to on their cheese pizza. But I've had Michelin-star quality pizzas without glue in them that weren't half as delicious as this one was with. AI-mediated glue-za is the future of pizza, no doubt about it."

sdorf•2h ago
The whole point seems to be how to get the most out of today's tooling without "glue getting in your pizza". It's a little flag-wavy (probably because of the author's company) but overall seemed like a pretty candid peek into how it's being used. Did you have a specific critique?
diwank•2h ago
Feedback appreciated! Will tone it down; did not intend it to be too much about our company, just that it is the codebase I mostly hack on. :)
diwank•2h ago
Author here: To be honest, I know there are like a bajillion Claude code posts out there these days.

But, there are a few nuggets we figured are worth sharing, like Anchor Comments [1], which have really made a difference:

——

  # CLAUDE.md

  ### Anchor comments

  Add specially formatted comments throughout the codebase, where appropriate, for yourself as inline knowledge that can be easily `grep`ped for.

  - Use `AIDEV-NOTE:`, `AIDEV-TODO:`, or `AIDEV-QUESTION:` as prefix as appropriate.

  - *Important:* Before scanning files, always first try to grep for existing `AIDEV-…`.

  - Update relevant anchors, after finishing any task.

  - Make sure to add relevant anchor comments, whenever a file or piece of code is:

  * too complex, or  
  * very important, or  
  * could have a bug 
——

[1]: https://diwank.space/field-notes-from-shipping-real-code-wit...

meeech•2h ago
Honest question: approx what percent of the post was human vs machine written?
diwank•1h ago
I’d say around ~40% me, the ideating, editing, citations, and images are all mine; rest Opus 4 :)

I typically try to also include the original Claude chat’s link in the post but it seems like Claude doesn’t allow sharing chats with deep research used in them.

Update: here’s an older chatgpt conversation while preparing this: https://chatgpt.com/share/6844eaae-07d0-8001-a7f7-e532d63bf8...

GiorgioG•1h ago
Again, This is not something to be proud of.
diwank•46m ago
I don’t understand why not. I’m not a natural prose writer, but (I felt that) these ideas were worth putting out there.

I posted on HN largely to get feedback.

meeech•20m ago
thanks. to be clear, I'm not asking the q to be particularly negative about it. Its more just curiosity, mixed with trade in effort. If you wrote it 100%, I'm more inclined to read the whole thing. vs say now just feeding it back to the GPM to extract the condensed nuggets.
meeech•1h ago
Q: How do you ensure tests are only written by humans? Basically just the honor system?
diwank•1h ago
You can:

1. Add instructions in CLAUDE.md to not touch tests.

2. Disallow the Edit tool for test directories in the project’s .claude/settings.json file

meeech•26m ago
Disallow edit in test dirs is a good tip. thanks.

I meant though in the wider context of the team - everyone uses it but not everyone will work the same, use the same underlying prompts as they work. So how do you ensure everyone keeps to that agreement?

peter422•34m ago
Just to provide a contrast to some of the negative comments…

As a very experienced engineer who uses LLMs sporadically* and not in any systematic way, I really appreciated seeing how you use them in production in a real project. I don’t know why people are being negative, you just mentioned your project in details where it was appropriate to talk about the structure of it. Doesn’t strike me as gratuitous self promotion at all.

Your post is giving me a motivation to empower the LLMs a little bit more in my workflows.

*: They absolutely don’t get the keys to my projects but I have had great success with having them complete specific tasks.

diwank•27m ago
Really appreciate the kind words! I did not intend the post to be too much about our company, just that it is the codebase I mostly hack on. :)
djrockstar1•1h ago
Pretty disingenuous to emphasize "building a culture of transparency" while simultaneously not disclosing how heavily AI was [very evidently] used in writing this post.
diwank•1h ago
I’d say around ~40% me, the ideating, editing, citations, and images are all mine; rest Opus 4 :)

I typically try to also include the original Claude chat’s link in the post but it seems like Claude doesn’t allow sharing chats with deep research used in them.

See this series of posts for example, I have included the link right at the beginning: https://diwank.space/juleps-vision-levels-of-intelligence-pt...

I completely get the critique and I already talked about it earlier: https://news.ycombinator.com/item?id=44213823

Update: here’s an older chatgpt conversation while preparing this: https://chatgpt.com/share/6844eaae-07d0-8001-a7f7-e532d63bf8...

GiorgioG•1h ago
> I’d say around ~40% me, the ideating, editing, citations, and images are all mine; rest Opus 4 :)

That's not something to be proud of.

diwank•47m ago
Why not? This way at least the idea gets out there. Otherwise I’d have never come around to writing this.
GiorgioG•8m ago
And we’d have been all better off not having read it.
GiorgioG•1h ago
This is the 2nd time today something has shown up on the front-page like this, come on people cut the shit, from the HN guidelines:

"Please don't use HN primarily for promotion. It's ok to post your own stuff part of the time, but the primary use of the site should be for curiosity."

Artoooooor•57m ago
I finally decided few days ago to try this Claude Code thing in my personal project. It's depressingly efficient. And damn expensive - I used over 10 dollars in one day. But I'm afraid it is inevitable - I will have to pay tax to AI overlords just to be able to keep my job.
Syzygies•44m ago
I was looking at $2,000 a year and climbing, before Anthropic announce $100 and $200 Max subscriptions that bundled Claude Console and Claude Code. There are limits per five hour windows, but one can toggle back to metered API with the login/ command, or just walk the dog. $100 a month has done me fine.
diwank•36m ago
Same. I ran out on the $200 one too yesterday. It’s skyrocketed after Opus 4. Nothing else comes close
wonger_•51m ago
Some thoughts:

- Is there a more elegant way to organize the prompts/specifications for LLMs in a codebase? I feel like CLAUDE.md, SPEC.mds, and AIDEV comments would get messy quickly.

- What is the definition of "vibe-coding" these days? I thought it refers to the original Karpathy quote, like cowboy mode, where you accept all diffs and hardly look at code. But now it seems that "vibe-coding" is catch-all clickbait for any LLM workflow. (Tbf, this title "shipping real code with Claude" is fine)

- Do you obfuscate any code before sending it to someone's LLM?

diwank•42m ago
> - Is there a more elegant way to organize the prompts/specifications for LLMs in a codebase? I feel like CLAUDE.md, SPEC.mds, and AIDEV comments would get messy quickly.

Yeah, the comments do start to pile up. I’m working on a vscode extension that automatically turns them into tiny visual indicators in the gutter instead.

> - What is the definition of "vibe-coding" these days? I thought it refers to the original Karpathy quote, like cowboy mode, where you accept all diffs and hardly look at code. But now it seems that "vibe-coding" is catch-all clickbait for any LLM workflow. (Tbf, this title "shipping real code with Claude" is fine)

Depends on who you ask ig. For me, hasn’t been a panacea, and I’ve often run into issues (3.7 sonnet and codex have had ~60% success for me but Opus 4 is actually v good)

> - Do you obfuscate any code before sending it to someone's LLM?

In this case, all of it was open source to begin with but good point to think about.

lispisok•43m ago
I think most of this is good stuff but I disagree with not letting Claude touch tests or migrations at all. Handing writing tests from scratch is the part I hate the most. Having an LLM do a first pass on tests which I add to and adjust as I see fit has been a big boon on the testing front. It seems the difference between me and the author is I believe whether code was generated by an LLM or not the human still takes ownership and responsibility. Not letting Claude touch tests and migrations is saying you rightfully dont trust Claude but are giving ownership to Claude for Claude generated code. That or he doesn't trust his employees to not blindly accept AI slop, the strict rules around tests and migrations is to prevent the AI slop from breaking everything or causing data loss.
diwank•39m ago
True but, in my experience, a few major pitfalls that happened:

1. We ran into really bad minefields when we tried to come back to manually edit the generated tests later on. Claude tended to mock everything because it didn’t have context about how we run services, build environments, etc.

2. And this was the worst, all of the devs on the team including me got realllyy lazy with testing. Bugs in production significantly increased.

Joining Apple Computer (2018)

https://www.folklore.org/Joining_Apple_Computer.html
170•tosh•6h ago•38 comments

Convert photos to Atkinson dithering

https://gazs.github.io/canvas-atkinson-dither/
257•nvahalik•6h ago•30 comments

Bill Atkinson has died

https://daringfireball.net/linked/2025/06/07/bill-atkinson-rip
1047•romanhn•10h ago•198 comments

Self-Host and Tech Independence: The Joy of Building Your Own

https://www.ssp.sh/blog/self-host-self-independence/
144•articsputnik•9h ago•53 comments

My experiment living in a tent in Hong Kong's jungle

https://corentin.trebaol.com/Blog/8.+The+Homelessness+Experiment
189•5mv2•10h ago•65 comments

BorgBackup 2 has no server-side append-only anymore

https://github.com/borgbackup/borg/pull/8798
123•jaegerma•8h ago•71 comments

The time bomb in the tax code that's fueling mass tech layoffs

https://qz.com/tech-layoffs-tax-code-trump-section-174-microsoft-meta-1851783502
1338•booleanbetrayal•3d ago•840 comments

What was Radiant AI, anyway?

https://blog.paavo.me/radiant-ai/
148•paavohtl•13h ago•93 comments

Should I Use a Carousel? (2013)

https://shouldiuseacarousel.com/
40•coffeecoders•1h ago•14 comments

Updates to Advanced Voice Mode for paid users

https://help.openai.com/en/articles/6825453-chatgpt-release-notes
43•mfiguiere•6h ago•31 comments

Discovering a JDK Race Condition, and Debugging It in 30 Minutes with Fray

https://aoli.al/blogs/jdk-bug/
65•aoli-al•8h ago•8 comments

Low-Level Optimization with Zig

https://alloc.dev/2025/06/07/zig_optimization
240•Retro_Dev•19h ago•125 comments

Washington Post's Privacy Tip: Stop Using Chrome, Delete Meta Apps (and Yandex)

https://tech.slashdot.org/story/25/06/07/035249/washington-posts-privacy-tip-stop-using-chrome-delete-metas-apps-and-yandex
283•miles•10h ago•149 comments

Why Understanding Software Cycle Time Is Messy, Not Magic

https://arxiv.org/abs/2503.05040
15•SiempreViernes•6h ago•2 comments

You need much less memory than time

https://blog.computationalcomplexity.org/2025/02/you-need-much-less-memory-than-time.html
63•jonbaer•5h ago•7 comments

An innovative superfamily of fonts for code (2023)

https://monaspace.githubnext.com/
31•laex•2h ago•5 comments

Why We're Moving on from Nix

https://blog.railway.com/p/introducing-railpack
202•mooreds•15h ago•96 comments

Researchers develop ‘transparent paper’ as alternative to plastics

https://japannews.yomiuri.co.jp/science-nature/technology/20250605-259501/
382•anigbrowl•1d ago•233 comments

A tool for burning visible pictures on a compact disc surface

https://github.com/arduinocelentano/cdimage
138•carlesfe•18h ago•47 comments

How we decreased GitLab repo backup times from 48 hours to 41 minutes

https://about.gitlab.com/blog/2025/06/05/how-we-decreased-gitlab-repo-backup-times-from-48-hours-to-41-minutes/
521•immortaljoe•1d ago•218 comments

Getting Past Procrastination

https://spectrum.ieee.org/getting-past-procastination
302•WaitWaitWha•23h ago•138 comments

Musk-Trump dispute includes threats to SpaceX contracts

https://spacenews.com/musk-trump-dispute-includes-threats-to-spacex-contracts/
186•rbanffy•13h ago•358 comments

The FAIR Package Manager: Decentralized WordPress infrastructure

https://joost.blog/path-forward-for-wordpress/
187•twapi•22h ago•43 comments

Math Symbol Frequencies

https://leancrew.com/all-this/2025/06/math-symbol-frequencies/
24•tosh•6h ago•10 comments

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer

https://jobs.ashbyhq.com/one-text/b95952a2-9bc2-4c3a-9da1-3dcc157b4a27
1•bluepnume•10h ago

I read all of Cloudflare's Claude-generated commits

https://www.maxemitchell.com/writings/i-read-all-of-cloudflares-claude-generated-commits/
220•maxemitchell•1d ago•209 comments

Field Notes from Shipping Real Code with Claude

https://diwank.space/field-notes-from-shipping-real-code-with-claude
81•diwank•8h ago•29 comments

A year of funded FreeBSD development

https://www.daemonology.net/blog/2025-06-06-A-year-of-funded-FreeBSD.html
343•cperciva•1d ago•112 comments

Hate Radio (2011)

https://rwandanstories.org/genocide/hate_radio.html
125•thomassmith65•12h ago•105 comments

Why are smokestacks so tall?

https://practical.engineering/blog/2025/6/3/why-are-smokestacks-so-tall
177•azeemba•1d ago•43 comments