frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Is any of you using LLMs to create full features in big enterprise apps?

4•not_that_d•5h ago
Let me be clear first. I don't dislike LLMs, I query them, trigger agents to do stuff where I kind of know what the end goal is and to make analisys of small parts of an application.

That said, everytime I give it something a little more complex that do something in a single file script it fails me horribly. Either the code is really bad, or the approach is as bad a someone who doesn't really know what to do or it plains start doing things that I explicitly said not to do in the initial prompt.

I have sometimes asked my LLM fan's coworkers to come and help when that happens and they also are not able to "fix it", but somehow I am the one doing it wrong due "wrong prompt" or "lack of correct context".

I have created a lot of "Agents.md" files, drop files into the context window... Nothing.

When I need to do green field stuff, or PoCs it delivers fast, but then applying it to work inside an existent big application fails.

The only place where I feel as "productive" as I heard from other people is when I do stuff in languages or technologies I don't know at all, but then again, I also don't know if that functional code I get at the end is broken in things I am not aware of.

Are any of you guys really using LLMs to create full features in big enterprise apps?

Comments

linesofcode•4h ago
The quality of an LLM outputs is greatly dependent on how many guard rails you have setup to keep it on track and heuristics to point it on right direction (type checking + running tests after every change for example).

What is health of your enterprise code base? If it’s anything like ones I’ve experienced it’s a legacy mess then it’s absolutely understandable that an LLMs output is subpar when taking on larger tasks.

Also depends on the models and plan you’re on. There is a significant increase in quality when comparing Cursors default model on a free plan vs Opus 4.5 on a maximum Claude plan.

I think a good exercise is to prohibit yourself from writing any code manually and force yourself to do LLM only, might sound silly but it will develop that skill-set.

Try Claude code in thinking mode with the some super powers - https://github.com/obra/superpowers

I routinely make an implementation plan with Claude and then step away for 15 mins while it spins - the results aren’t perfect but fixing that remaining 10% is better than writing 100% of it myself.

not_that_d•3h ago
The code is quite easy to follow to be honest, we have documented a lot of stuff and segmented functionality into libraries that follow an app/feature/models pattern. Almost every service we have, has unit tests explicitly describing what the public api is doing or supposed to do on several scenarios, we never test implementation details.

Given it to new people of course carry questions, but most of them (juniors) could just follow the code given an entry point for that task, this from BE to FE.

I use the github copilot premium models available.

> I routinely make an implementation plan with Claude and then step away for 15 mins while it spins - the results aren’t perfect but fixing that remaining 10% is better than writing 100% of it myself.

I have to be honest, I just did this two times and the amount of code that needed to be fixed, and the mental overload to find open bugs was much more than just guide the LLM on every step. But this was a couple of months ago.

not_that_d•3h ago
Besides my other response, it can also be I am not smart enough for it.

Agentic QA – Open-source middleware to fuzz-test agents for loops

32•Saurabh_Kumar_•6d ago•5 comments

Ask HN: Should "I asked $AI, and it said" replies be forbidden in HN guidelines?

882•embedding-shape•19h ago•432 comments

Ask HN: Is it still worth learning a new programming language?

5•xparadigm•1h ago•6 comments

Ask HN: What Are You Working On? (December 2025)

8•mchaver•2h ago•10 comments

Why are "remote" jobs in late 2025 still limited to hiring in US/CA/UK/DE?

11•ftonato•8h ago•4 comments

Is any of you using LLMs to create full features in big enterprise apps?

4•not_that_d•5h ago•3 comments

Ask HN: What are young technically minded people reading?

7•drdec•17h ago•8 comments

I'm Peter Roberts, immigration attorney who does work for YC and startups. AMA

226•proberts•4d ago•303 comments

Ask HN: Who wants to be hired? (December 2025)

160•whoishiring•1w ago•428 comments

What's Next? Clippy Copilot?

6•johnnyballgame•20h ago•4 comments

Ask HN: Quality of recent gens of Dell/Lenovo laptops worse than 10 years ago?

112•ferguess_k•1w ago•206 comments

Ask HN: Who is hiring? (December 2025)

314•whoishiring•1w ago•521 comments

Ask HN: Are there any viable Android phones for a power user to buy nowadays?

12•gooob•1d ago•6 comments

Cursor and Claude Opus 4.5 is a game changer

15•seinecle•2d ago•8 comments

Ask HN: Modern C# book for experienced developers?

30•Fire-Dragon-DoL•4d ago•7 comments

Ask HN: Cloudflare WAF Alternatives?

28•rco8786•4d ago•15 comments

Ask HN: Is it just me or techno-optimism died in the past few years?

36•shubhamjain•2d ago•37 comments

Ask HN: Why does every B2B SaaS have to look like Linear/Stripe?

8•PaulShin•3d ago•15 comments

Ask HN: Why don't GPU/TPU manufacturers commoditize their RAM complement

4•DoctorOetker•2d ago•11 comments

You've reached the end!