AI Coding: A Sober Review

https://www.ubicloud.com/blog/ai-coding-a-sober-review

16•furkansahin•1h ago

Comments

CuriouslyC•1h ago

A vibe article on vibe coding.

softwaredoug•1h ago

This space is filled with personal anecdotes and studies from providers. It's hard to get objective perspectives from independent labs.

troupo•1h ago

It's hard to go beyond anecdotes because it's impossible to measure outcomes objectively.

shikharbhardwaj•1h ago

Hi! Author of the blog post here.

I completely agree, getting an objective measure for the developer experience from these various tools is not easy. On one hand, you have a series of benchmarks from LLM providers. While reflecting some degree of fitness to specific tasks, they often fail to translate to real-world usage. On the other hand, you have the tool providers with different features and product claims, and user anecdotes for very different use-cases.

The attempt with this post was to summarize my experience across some of these tools and highlight some specific features which worked better for me vs others. Given how quickly things are changing in this space, the primary conclusion is that using a tool day-to-day, discovering its strengths and deficiencies and working to eliminate the ones with high hit-rate is best at this point.

ozgune•1h ago

(Disclaimer: Ozgun from Ubicloud)

I agree with you. I feel the challenge is that using AI coding tools is still an art, and not a science. That's why we see many qualitative studies that sometimes conflict with each other.

In this case, we found the following interesting. That's why we nudged Shikhar to blog about his experience and put a disclaimer at the top.

* Our codebase is in Ruby and follows a design pattern uncommon industry * We don't have a horse in this game * I haven't seen an evaluation that evaluates coding tools in (a) coding, (b) testing, and (c) debugging dimension

ExxKA•1h ago

I am none the wiser. How do I get my 5 minutes back?

GardenLetter27•1h ago

This reads like an advert for Continue.dev

willahmad•1h ago

Here's my experience with these tools:

Good: I can prototype things very quickly thanks to these tools

Bad: After couple of vibe coding iterations, I don't have a mental model of the project.

Good: When I open my past projects where I have very good mental models, I can come up with a nice prompt and build anything quickly again.

Bad: After couple of iterations I become lazy, and eventually my mental models break.

There's definitely a use for these tools. But be careful, job of engineers are not only coding but also training their memory to build solutions and bridge real world problem with software solution. If you lose this skill of thinking, you will be obsolete quickly

accrual•55m ago

This matches my experience as well. When I'm working on a codebase that I started and know well, it feels like magic to chat with an AI and watch patches appear on the screen to accept/deny. I only accept about 50% of the AI patches before tweaks because it's my project and I care about keeping on the track I laid out.

When I'm vibe coding something from scratch I don't have the mental model, I don't always review everything closely, and eventually it becomes an "AI project" that I'm just making requests against to hopefully achieve my goal.

softwaredoug•33m ago

And when you lose your mental model it’s harder to prompt the LLM for good code.

Ask HN: Is Claude Code less useful in recent weeks for you?

Online therapy for expats and digital nomads

Show HN: Xiaoniao – Paste-as-Translation (Go and AI)

Not Buying American Anymore

Startup Working to Bring Back Dodo Bird Raises $120M

Microbial iron oxide respiration coupled to sulfide oxidation

Giant Subterranean Neutrino Detector Is Taking on the Mysteries of Physics

A Fusion-Reactor-Inspired Thruster Could Deorbit Space Junk

Generating Blue Noise Sample Points with Mitchell's Best Candidate Algorithm

What It's Like to Work Inside a Broken CDC

Frivolous, unethical and unjustifiable- SF agency misspent $4.6M audit finds

Tell HN: Discord is apparently rolling out age verification for EU/EEA residents

Did NASA's Perseverance rover find evidence of ancient life on Mars?

Screen readers do not need to be saved by AI

Moving off of TypeScript, 2.5M lines of code

A Guide to Midnight Commander (2012)

Show HN: I did a 4 hour conversational audiobook on the history of data centers

Gmail Mail Delivery Subsystem Being Used for Spam Delivery Bypassing Filters

What can we learn from Spotify layoffs? (2024)

Fiverr cuts 30% of staff in pivot to 'AI-first'

All the Sad Young Terminally Online Men

Distributed Training of LLM's: A Survey

AI companion futures osmarks' website

Ads are coming to a Samsung smart fridge near you

Knitted Anatomy

The Rye Resurgence Project: An Origin Story

Show HN: A GPT Realtime Web Game Where You Convince Aliens Not to Invade

AlphaEarth Provides New Ways to See, and Understand, Earth

RTCW: One source port for all Return to Castle Wolfenstein games

Single device amplifies signals while shielding qubits from unwanted noise