frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Are Agents Just Hype?

4•spacemnstr42069•9h ago
I have spent the last few days trying (and failing) to find real cases where AI agents actually scale in production. Outside of coding agents and dev-productivity tools, I am struggling to see anything that looks like a genuinely scalable agent system.

Most of what people are calling “agents” today are basically deterministic workflows with one or two LLM calls glued together. That is not an agent. That is a at best API pipeline.

So I am genuinely curious: are there any real examples of agents handling large, messy, multi-step workflows at scale? Not demos, not toy projects, not VC decks.

Comments

AIinyourAI•8h ago
There probably are examples out there but paywalled heavily as they probably still make more money from selling the guides than actually using the agents.
spacemnstr42069•4h ago
You mean agent is not value? Selling guides making money, lol!
scosman•8h ago
No they work fine. Not a panacea, but when the use case permits.

Usually layers of tools clustered under sub-agents, and fairly detailed orchestration prompts at higher levels. Orchestration via agent-prompts can be better than hard-coded workflows when they require qualitative assessments.

spacemnstr42069•4h ago
any example of such agent you see?
muzani•3h ago
Cursor has been doing this in production since the start of 2025. You give it instructions or heck, a screenshot of a bug. It searches for relevant code based on the problem. It searches in an area around the code, for tests, behaviors. If you've written a comment pointing to Jira on a certain bug that this code is responsible for fixing (instead of writing a test), it can check that ticket for expected behavior. It may write tests to fill in these gaps or it may just write the code. Then it runs the tests if possible. If a test fails, it compares to the new code. It writes new code and reruns tests.

Tool-use is common in most of the major AI models now and it's really the differentiator between how they perform when writing code. Few write correct code the first time. What makes them different is the ability to read and modify complex code across multiple files, without being told which files.

I think by next year, we could see this extend across the UI domain - it writes code, runs it, views the UI, critiques the results, then tweaks things like font and whitespace. I did a prototype mid-year which would even show it to a user, and it would talk them through what they liked or didn't like. But you can even chain it between multiple LLMs (designer, programmer, customer roles) and it would fit your definition.

What is the most beautiful / highest quality code you've seen (or written)?

10•gooob•3h ago•7 comments

Ask HN: What Are You Working On? (Nov 2025)

455•david927•5d ago•1335 comments

Ask HN: Is Computer Science still a good choice?

16•speedylight•14h ago•22 comments

Why isn't everyone using Cerebras?

3•tghack•7h ago•1 comments

Ask HN: Interviewing currently (or recently)? how have interviews changed?

3•akudha•7h ago•0 comments

Ask HN: How to learn concurrency?

35•shivajikobardan•1d ago•18 comments

Ask HN: Are Agents Just Hype?

4•spacemnstr42069•9h ago•5 comments

Ask HN: How do you handle logging and evaluation when training ML models?

3•calepayson•11h ago•1 comments

Ask HN: How common is banning Docker?

9•martypitt•17h ago•11 comments

Ask HN: Looking for a good course to learn proof assistant Lean 4

5•rabarbers•19h ago•1 comments

Ask HN: What's your solution to tech addiction?

4•asim•14h ago•9 comments

Ask HN: How does one stay motivated to grind through LeetCode?

84•blutoot•2d ago•103 comments

Ask HN: Could Microsoft replace its CEO with ChatGPT?

23•nothrowaways•2d ago•36 comments

Ask HN: How would you set up a child’s first Linux computer?

221•evolve2k•5d ago•296 comments

Ask HN: What metrics do you track in a Conversational Agent?

4•shubhamintech•20h ago•0 comments

The Lotus program analysis framework

2•cutelimination•1d ago•0 comments

Ask HN: Senior people, how did your career evolve?

80•Seb-C•3d ago•51 comments

Tell HN: X is opening any tweet link in a webview whether you press it or not

647•stillatit•1w ago•519 comments

Ask HN: My family business runs on a 1993-era text-based-UI (TUI). Anybody else?

316•urnicus•1w ago•308 comments

Ask HN: Who is hiring? (November 2025)

400•whoishiring•1w ago•580 comments

Ask HN: How do you get over the fear of sharing code?

74•sodokuwizard•5d ago•90 comments

Ask HN: What do you do while LLM is writing code

8•me551ah•1d ago•13 comments

You've reached the end!