frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?

43•UmYeahNo•1d ago•27 comments

Ask HN: Non AI-obsessed tech forums

17•nanocat•5h ago•11 comments

Ask HN: Ideas for small ways to make the world a better place

8•jlmcgraw•7h ago•16 comments

Ask HN: 10 months since the Llama-4 release: what happened to Meta AI?

42•Invictus0•23h ago•11 comments

AI Regex Scientist: A self-improving regex solver

5•PranoyP•9h ago•1 comments

Ask HN: Who wants to be hired? (February 2026)

139•whoishiring•4d ago•510 comments

Ask HN: Who is hiring? (February 2026)

312•whoishiring•4d ago•511 comments

Ask HN: Any International Job Boards for International Workers?

2•15charslong•4h ago•1 comments

Ask HN: Why LLM providers sell access instead of consulting services?

4•pera•15h ago•13 comments

Tell HN: Another round of Zendesk email spam

104•Philpax•2d ago•54 comments

Ask HN: Is Connecting via SSH Risky?

19•atrevbot•1d ago•37 comments

Ask HN: What is the most complicated Algorithm you came up with yourself?

3•meffmadd•17h ago•7 comments

Ask HN: Has your whole engineering team gone big into AI coding? How's it going?

17•jchung•1d ago•12 comments

Ask HN: How does ChatGPT decide which websites to recommend?

5•nworley•1d ago•11 comments

Ask HN: Is it just me or are most businesses insane?

7•justenough•1d ago•5 comments

Ask HN: Mem0 stores memories, but doesn't learn user patterns

9•fliellerjulian•2d ago•6 comments

Ask HN: Anyone Seeing YT ads related to chats on ChatGPT?

2•guhsnamih•1d ago•4 comments

Ask HN: Does global decoupling from the USA signal comeback of the desktop app?

5•wewewedxfgdf•1d ago•2 comments

Ask HN: Is there anyone here who still uses slide rules?

123•blenderob•3d ago•122 comments

Kernighan on Programming

170•chrisjj•4d ago•61 comments

We built a serverless GPU inference platform with predictable latency

5•QubridAI•1d ago•1 comments

Ask HN: How Did You Validate?

4•haute_cuisine•1d ago•4 comments

Ask HN: Cheap laptop for Linux without GUI (for writing)

15•locusofself•3d ago•16 comments

Ask HN: Have you been fired because of AI?

17•s-stude•3d ago•15 comments

Test management tools for automation heavy teams

2•Divyakurian•1d ago•2 comments

Ask HN: Does a good "read it later" app exist?

7•buchanae•3d ago•18 comments

Ask HN: OpenClaw users, what is your token spend?

14•8cvor6j844qw_d6•4d ago•6 comments

Ask HN: Anyone have a "sovereign" solution for phone calls?

11•kldg•3d ago•1 comments

Ask HN: Has anybody moved their local community off of Facebook groups?

23•madsohm•4d ago•17 comments

How do you deal with SEO nowadays?

5•jackota•1d ago•8 comments
Open in hackernews

Ask HN: Are Agents Just Hype?

10•spacemnstr42069•2mo ago
I have spent the last few days trying (and failing) to find real cases where AI agents actually scale in production. Outside of coding agents and dev-productivity tools, I am struggling to see anything that looks like a genuinely scalable agent system.

Most of what people are calling “agents” today are basically deterministic workflows with one or two LLM calls glued together. That is not an agent. That is a at best API pipeline.

So I am genuinely curious: are there any real examples of agents handling large, messy, multi-step workflows at scale? Not demos, not toy projects, not VC decks.

Comments

AIinyourAI•2mo ago
There probably are examples out there but paywalled heavily as they probably still make more money from selling the guides than actually using the agents.
spacemnstr42069•2mo ago
You mean agent is not value? Selling guides making money, lol!
scosman•2mo ago
No they work fine. Not a panacea, but when the use case permits.

Usually layers of tools clustered under sub-agents, and fairly detailed orchestration prompts at higher levels. Orchestration via agent-prompts can be better than hard-coded workflows when they require qualitative assessments.

spacemnstr42069•2mo ago
any example of such agent you see?
muzani•2mo ago
Cursor has been doing this in production since the start of 2025. You give it instructions or heck, a screenshot of a bug. It searches for relevant code based on the problem. It searches in an area around the code, for tests, behaviors. If you've written a comment pointing to Jira on a certain bug that this code is responsible for fixing (instead of writing a test), it can check that ticket for expected behavior. It may write tests to fill in these gaps or it may just write the code. Then it runs the tests if possible. If a test fails, it compares to the new code. It writes new code and reruns tests.

Tool-use is common in most of the major AI models now and it's really the differentiator between how they perform when writing code. Few write correct code the first time. What makes them different is the ability to read and modify complex code across multiple files, without being told which files.

I think by next year, we could see this extend across the UI domain - it writes code, runs it, views the UI, critiques the results, then tweaks things like font and whitespace. I did a prototype mid-year which would even show it to a user, and it would talk them through what they liked or didn't like. But you can even chain it between multiple LLMs (designer, programmer, customer roles) and it would fit your definition.

spacemnstr42069•2mo ago
Agreed, but as mentioned in the original post, I am looking for use cases out of coding agents. Coding agents are scaling, yes! What are other use cases?
iamflimflam1•2mo ago
A lot of things are happening behind closed doors.

The real value is in horribly manual internal processes where the solutions are agents driving very specific tools that drive weird and wacky systems.

Generic out of the box agents that will solve your particular problem are not a thing yet.

In regard to it’s just an “API pipeline” - the power of agents should be - which set of API calls do I string together to solve a user’s request.