frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Is it still worth making "Huge" Language Models for dev tools?

2•twoelf•1h ago
I just want to ask the frontier builders and developers who are working on the flagship models a few questions. Is it still cost-efficient and worth it to keep making huge language models, when smaller, specialized models should be enough?

Meaning that, when a user is working in a codebase with a certain framework, should the agent/model also know the complete chemical composition of an element, world history, and other random facts? Or should it only know the related and needed things? For example, an agent working in a MERN stack should really only know:

- Language Documentation - Framework and Library Documentation - English Interpretation - Composition and Combination of the above

The writing style and other details are already customized by developers who have been building for a long time; tools like Prettier and ESLint can do this. And in engineering, aren't the steps usually:

- What is needed? - What are we working with? - What is the end goal? - What should be the best combination of libraries, and for what?

The schematics, blueprints, and high-level design should come first, and then we build on top of that. This seems like it would be very easy if we specifically made specialized models for development. Because most of the best system models, architectures, conventions, and structures for the needed code already exist and are well-defined in the community by developers. Just like ESLint and Prettier custom rules, shouldn't our AI models be structured like that too?

Or do the agents/LLMs/models really need to know all of these unnecessary things like chemical compositions and history?

Because if we only included what was necessary for a MERN stack-specific model, all of the needed structured data could fit into an ultra-lightweight model (under 200K parameters), assuming a separate interpreter handles the English. If we make specialized models for each framework and stack, then a swarm of small agents is more than capable of taking a project all the way to completion, not just to an MVP.

Furthermore, massive models suffer from stale training data. If a library updates, you can't easily retrain a 1-trillion parameter behemoth. But in a decoupled system (where a small llm model handles the English reasoning, and sub-100K parameter structured data handles the framework rules), you can update the framework data instantly on release day. We should be building efficient Compound AI Systems that separate reasoning from knowledge, rather than burning massive GPU compute to calculate world history just to output a React component.

Is this the real current issue?

Comments

twoelf•20m ago
https://x.com/twoelf47/status/2038633678277107986

Category Theory Illustrated – Types

https://abuseofnotation.github.io/category-theory-illustrated/06_type/
1•boris_m•1m ago•0 comments

VibePad – New AI Padding Model

https://www.npmjs.com/package/vibepad
1•zwhitchcox•5m ago•0 comments

Reaching 100% Type Coverage by Deleting Unannotated Code

https://pyrefly.org/blog/100-percent-type-coverage/
1•ocamoss•6m ago•0 comments

What One Month of Intense Red-Light Therapy Did to My Mind

https://www.nytimes.com/2026/03/31/magazine/red-light-therapy-blanket-wellness-benefits.html
1•prmph•7m ago•0 comments

Itsid – LLM with perfect input reproduction for e.g. license removal

https://itsid.cloud
1•fuglede_•7m ago•0 comments

Show HN: I made a Mario Galaxy game with Claude Code and Three.js in 53 days

https://supertommy.com/games/super-mario-galaxy-movie-game/
1•supertommy•7m ago•0 comments

Clock that shows what percentage of your life has passed

https://driesdepoorter.be/product/shortlife-v4/
3•driesdep•9m ago•0 comments

Garryslist Code Audit

https://twitter.com/Gregorein/status/2038953953442812305
2•thomasjudge•9m ago•0 comments

Building an Arcade Cabinet (Part 1, Design & Materials)

https://lukechu.dev/post/arcade-design-and-materials
2•lukechu10•10m ago•0 comments

"Why does this code look like this?" Nobody knows. That's the problem

https://maintainable.fm/episodes/russ-olsen-the-hidden-cost-of-forgetting-why-the-code-looks-like...
3•birdculture•11m ago•0 comments

Iran threatens Nvidia, Apple and other 18 tech companies

https://www.cnbc.com/2026/04/01/iran-irgc-nvidia-appple-attack-threat.html
3•johnbarron•11m ago•1 comments

Show HN: Sycamore – next gen Rust UI library powered by fine-grained reactivity

https://sycamore.dev
2•lukechu10•12m ago•0 comments

Show HN: Apindex – self-hosted API catalog to map and understand internal APIs

https://apindex.dev/
2•snicky11•13m ago•0 comments

Show HN: Agent Arnold – Gym tracker 100% vibe-coded from my phone between sets

https://agent-arnold.app/
3•bojanstef4•14m ago•0 comments

Kagi.com/?Fun=Yes

https://kagi.com/html/welcome
2•yazantapuz•16m ago•0 comments

Yes, a Smartphone Can Be Too Big for the Masses

https://www.wsj.com/business/telecom/yes-a-smartphone-can-be-too-big-for-the-masses-7968b2fb
4•bookofjoe•17m ago•2 comments

Show HN: Ebash – AI-Powered Shell

https://github.com/alexandershov/ebash
3•NigelTufnel•17m ago•0 comments

Human intention is still running on dial-up

https://k2xl.substack.com/p/human-intention-is-still-running
3•k2xl•17m ago•0 comments

Low energy transfers in space: getting to the Moon with Lagrange points

https://lukechu.dev/post/low-energy-transfers
4•lukechu10•18m ago•3 comments

MediQuest: Free quiz matching med students to their ideal specialty

https://mediquest-en.vercel.app
2•philbitt•19m ago•0 comments

Mass robotaxi malfunction halts traffic in Chinese city

https://www.bbc.co.uk/news/articles/cvge91r9j80o
5•neversaydie•19m ago•0 comments

Always Hot Cloud Storage Is a Lie

https://twitter.com/andresribeiroo/status/2039317164818043365
2•andresribeiro•20m ago•0 comments

Show HN: Oy – The Yo App for Agents

https://oy-agent.com
2•jumploops•21m ago•0 comments

Emotional Distance Tax

https://cernius.substack.com/p/emotional-distance-tax
2•surprisetalk•22m ago•0 comments

Why Gen Z Culture Is Basically Medieval China [video]

https://www.youtube.com/watch?v=pIWZM-FrC3I
2•surprisetalk•22m ago•0 comments

Dear Aliens: A Writing Contest

https://www.dearaliens.net/
2•surprisetalk•22m ago•0 comments

Gstack to Be Renamed as Gslop

https://twitter.com/Gregorein/status/2038953944475472316
2•dheerajmp•24m ago•0 comments

Allbirds, Once Silicon Valley's Favorite Shoe, Sells for $39M

https://www.nytimes.com/2026/03/31/business/allbirds-sold-39-million.html
3•NauticalStu•24m ago•0 comments

Adding a Custom CosmosDB Memory to Azure AI Agent

https://furotmark.github.io/2026/03/31/Adding-A-Custom-CosmosDB-Memory-To-Azure-AI-Agent.html
2•furoTmark•24m ago•0 comments

Ronald G. Wayne Is More Than Two Weeks at Apple

https://tedium.co/2026/03/31/ronald-g-wayne-apple-interview/
3•speckx•25m ago•0 comments