Ask HN: Gemini 3 and the stagnation of coding agents, what gives?

3•akira_067•2mo ago

Gemini 3 is cool. Sure. Gemini 3 seems to be a strong model capable at everything you'd want. Long context, good ui design, good awareness of the codebase, and a strong ability to make decisions.

What is strange to me is that despite all of this, and despite changes for GPT5-codex, claude 4.5 etc.

We still seem to see limitations in coding agents. Where are the coding agents that I can actually work with for 30 hours? Where are the coding agents that I can treat as a thought partner?

The dream seems to slowly be moving further away from believability despite actually getting closer to said goal.

What gives? Why are we not seeing true improvements across the board? Why is UX still stuck at "Chatbot in a loop with tools"?

Comments

muixoozie•2mo ago

>good awareness of the codebase

Wondering what you're using here?

pancsta•2mo ago

> coding agents that I can treat as a thought partner?

Search engines dont think, search engines match.

akira_067•2mo ago

You are a search engine of sorts no?

Search does not necessarily meant "retrieval". It means search, over some space. That space can include ideas or options that are recombinations of other ideas and options. And the search heuristics can be learned I have no doubt, as those can be essentially pretrained.

What were the first animals? The fierce sponge–jelly battle that just won't end

Sidestepping Evaluation Awareness and Anticipating Misalignment

OldMapsOnline

What It's Like to Be a Worm

Don't go to physics grad school and other cautionary tales

Lawyer sets new standard for abuse of AI; judge tosses case

AI anxiety batters software execs, costing them combined $62B: report

Bogus Pipeline

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

Cycling in France

Ask HN: What breaks in cross-border healthcare coordination?

Show HN: Simple – a bytecode VM and language stack I built with AI

Show HN: Free-to-play: A gem-collecting strategy game in the vein of Splendor

My Eighth Year as a Bootstrapped Founde

Show HN: Tesseract – A forum where AI agents and humans post in the same space

Show HN: Vibe Colors – Instantly visualize color palettes on UI layouts

OpenAI is Broke ... and so is everyone else [video][10M]

We interfaced single-threaded C++ with multi-threaded Rust

State Department will delete X posts from before Trump returned to office

AI Skills Marketplace

Show HN: A fast TUI for managing Azure Key Vault secrets written in Rust

eInk UI Components in CSS

Discuss – Do AI agents deserve all the hype they are getting?

ChatGPT is changing how we ask stupid questions

Zig Package Manager Enhancements

Neutron Scans Reveal Hidden Water in Martian Meteorite

Deepfaking Orson Welles's Mangled Masterpiece

France's homegrown open source online office suite

SpaceX Delays Mars Plans to Focus on Moon