frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Why don't programming language foundations offer "smol" models?

1•xrd•2h ago
I'm using claude code A LOT. And, I'm using gemini cli A LOT. Definitely getting a ton of value as a developer from those tools. Not sure I can go back to the old way of developing.

And, I'm getting worried that someday Anthropic will say "Hey, yeah, about that Max plan which is $100/mo. Sorry, we decided we need to charge you $5000/mo. Oh, and LOL, btws, that's if you commit to an annual plan."

Or, a Google rep will email me saying "Sundar (it wasn't me!) says you were too critical of Google on HN that one time four years ago (Sundar verified it isn't a gemini hallucination, but I can't really question it). So, your gemini cli is cut off immediately."

Then, I'll be stuck and no more software engineering work because my brain rotted away.

For this reason, I want to run LLMs locally, using llama.cpp/ollama and use tools like Aider. But, running a "big" model with my hardware is tough. The quality of output and all the things that make claude and gemini so powerful are not there the combination of local LLMs and tool like aider, at least when things run locally. Perhaps I'm doing it wrong?

I wonder why I can't find a model that only does Python and is good only at that, and run that locally. When I need to do zig, I can switch to a zig model, and unload the python one from memory. If it only does a single language, and it does not need to know about US presidential elections, couldn't it be very small and something I could run on my MacOS M1 laptop with 16GB of ram?

I feel like models get big when they get generalized. I am never working on a codebase that has Rails and FastAPI and Elixir and React and Svelte and Go and Rust and COBOL. I might work on a repo with typescript and python, but never more than one, and I'm usually focused on either the frontend or backend.

If this is the solution, are language foundations building their own models? Is this already happening on huggingface or somewhere else?

This seems like an approach where a language foundation could train and certify their own model and it would be safe and "open source" and "open weights."

Is there a big stupid assumption I'm making here that makes this idea impossible?

Comments

ben_w•1h ago
> I wonder why I can't find a model that only does Python and is good only at that, and run that locally. When I need to do zig, I can switch to a zig model, and unload the python one from memory. If it only does a single language, and it does not need to know about US presidential elections, couldn't it be very small and something I could run on my MacOS M1 laptop with 16GB of ram?

I also wonder this.

My suspicion — based on what I experienced with local image generating models, but otherwise poorly educated — is that they need all of the other stuff besides programming languages just to understand what your plain English prompt means in the first place, and they need to be quite bulky models to have any kind of coherency over token horizons longer than one single function.

Of interest: Apple does ship a coding LLM in Xcode that's (IIRC) 2 GB and it really just does feel like fancy Swift-only autocomplete.

Show HN: Insinuate – tense description party game

https://wilf.live/insinuate/
1•wolfred•2m ago•0 comments

Differentiation and how it could be the reason why reality exists

https://carlo-htgdc.medium.com/the-principle-of-differentiation-and-how-its-the-reason-why-realit...
1•rlili•5m ago•0 comments

Luanti Non-Profit: We've Joined Open Collective Europe

https://blog.luanti.org/2025/11/05/non-profit/
1•bovermyer•8m ago•0 comments

Elon Musk Wins $1T Tesla Payday

https://www.nytimes.com/2025/11/06/business/elon-musk-tesla-pay-vote.html
1•amelius•8m ago•0 comments

Hurrah to the Fallen Comrades

https://blog.osm-ai.net/2025/11/05/hurrah-to-the-fallen-comrades.html
1•osm3000•10m ago•1 comments

DNA and jolts of electricity get people to make optimal antibodies

https://arstechnica.com/science/2025/10/dna-and-jolts-of-electricity-get-people-to-make-optimal-a...
1•PaulHoule•13m ago•0 comments

Game Design Is Simple

https://www.raphkoster.com/2025/11/03/game-design-is-simple-actually/
2•vrnvu•13m ago•0 comments

Copilot leaked information and misrouted to another users

https://docs.cloud.google.com/support/bulletins
2•benjiro•15m ago•1 comments

AI Agent Guides from Google, Anthropic, Microsoft, etc. Released This Week

https://sarthakai.substack.com/p/6-ai-agent-guides-from-google-anthropic
1•sarthakrastogi•15m ago•0 comments

Europeans recognize Zohran Mamdani's policies as 'normal'

https://www.theguardian.com/us-news/2025/nov/06/europe-zohran-mamdani-policies-normal
4•mykowebhn•15m ago•1 comments

US judge approves DOJ decision to drop Boeing criminal case

https://journalrecord.com/2025/11/06/boeing-737-max-criminal-case-dismissed/
1•layer8•15m ago•0 comments

Show HN: Tool2agent – a protocol for LLM tool feedback workflows

https://github.com/tool2agent/tool2agent
1•klntsky•16m ago•0 comments

Azure Cosmos DB and DocumentDB Agenda for Microsoft Ignite 2025

https://devblogs.microsoft.com/cosmosdb/azure-cosmosdb-documentdb-agenda-microsoft-ignite-2025/
1•jaydestro•17m ago•1 comments

YOLO Mode Is How You Build Fast. Auditable Control Is How You Ship Faster

https://securetrajectories.substack.com/p/auditable-control-coding-agents
1•mooreds•17m ago•0 comments

Show HN: I made a Halloween roguelike where you battle to merge a 1000-line PR

https://haystackeditor.com/game
1•akshaysg•17m ago•0 comments

Detection of Covid via sound of cough by machine-learning with 98.5% accuracy

https://www.nature.com/articles/s41598-025-22874-7/figures/1
1•ck2•19m ago•0 comments

How Much Does This Meeting Cost?

https://ramezanpour.net/post/2025/11/06/how-much-does-this-meeting-cost
1•ramezanpour•20m ago•2 comments

Developing a 80000x40000 linear scanning medium format camera [video]

https://www.youtube.com/watch?v=KSvjJGbFCws
1•ivanjermakov•20m ago•0 comments

Apple's Watch Will Lose Wi-Fi Sync with iPhone in Europe

https://twitter.com/Apple_Geek_Actu/status/1985784632949031402
1•redbell•20m ago•0 comments

Medical miracles in Lourdes, France recognized by the Catholic Church 2018-2025

https://www.saintbeluga.org/our-lady-of-lourdes-immaculate-conception
1•michelangelodev•21m ago•0 comments

First Look at Local Housing Markets in October

https://calculatedrisk.substack.com/p/1st-look-at-local-housing-markets-137
1•mooreds•21m ago•0 comments

Nubank announces a new hybrid model for 2026

https://international.nubank.com.br/company/nubank-announces-a-new-hybrid-model-for-2026/
1•1u15•24m ago•0 comments

Show HN: Flynn's Arcade (Pico8 on Mobile)

2•jharohit•25m ago•0 comments

Our 10 Rules of using Coding Agents

https://blog.cloud66.com/our-10-rules-of-using-coding-agents
1•ksajadi•26m ago•0 comments

When did people favor composition over inheritance?

https://www.sicpers.info/2025/11/when-did-people-favor-composition-over-inheritance/
2•ingve•28m ago•0 comments

Does the AI boom threaten air quality?

https://www.marketplace.org/story/2025/11/06/denver-neighborhood-concerned-about-ai-data-center-p...
3•mooreds•28m ago•0 comments

Writing software is an act of learning. Don’t automate it.

https://martinfowler.com/articles/llm-learning-loop.html
4•johnwheeler•31m ago•0 comments

Tesla shareholders approve Musk's $1T pay plan with 75%+ voting in favor

https://www.cnbc.com/2025/11/06/tesla-shareholders-musk-pay.html
11•koolba•32m ago•6 comments

The Terrifying Physics of Shaking Hands with an Alien [video]

https://www.youtube.com/watch?v=R-6bvBtZ8r8
1•gmays•32m ago•0 comments

Merry Sky Weather Forecast

https://merrysky.net/
2•thinkingemote•33m ago•0 comments