frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Modular LLM framework inspired by Linux – aiming for a one-GPU future

2•openkame•5mo ago
I want to share a concept I've been thinking about, which I call *AI-Kernel*.

The idea is to manage large language models (LLMs) like we manage the Linux kernel: - A *stable, long-term maintained base model* (the "kernel") - Modular fine-tuned components (LoRA) as "patches/extensions" - A public registry of LoRA modules, with ratings and metadata - Flexible loaders (Ollama, llama.cpp, vLLM) to run the kernel + LoRAs - A unified frontend (React/JS or CLI) to interact with the system - Fully local or cloud, depending on user choice

---

### Why?

LLMs are growing in size, cost, and opacity. Instead of bigger and bigger models, what if we focused on *efficiency, modularity, and sustainability*?

This proposal suggests a benchmark for AI sustainability:

> If GPT-5 runs on 10,000 GPUs in 2025, > then GPT-4 should run (with all features intact) on a *single GPU in 2026* – even if slower. > In 2027, GPT-5 should become the single-GPU target.

Always *one generation behind, but fully local and sovereign*.

---

### How it works

[ AI-Kernel (base LLM) ] |

+----------+----------+ \| | | \[ LoRA A ] \[ LoRA B ] \[ LoRA C ] ← Modular specialization | \[ Loader (Ollama / llama.cpp / vLLM) ] | \[ Frontend UI (web / desktop) ] | User

LoRAs are small, stackable, and don't alter the base model. Like VS Code extensions, they can be published, rated, shared, and combined.

---

### Transparency

I’m *a self-taught developer*, not an AI researcher. This is not a working product or codebase — just a structured idea for discussion.

Maybe others already thought of it. Maybe I’ve missed limits or blockers. But I wanted to write it down clearly and let more qualified people refine or challenge it.

This draft was co-written with GPT, in full transparency. The vision is mine; the wording was assisted.

---

### What this is NOT

- Not a fork or fight against existing projects - Not an implementation with code (yet) - Not claiming novelty or exclusive ownership

It’s simply a *direction to consider*: A modular, open, kernel-like model for AI that is sustainable and private.

---

### Call to action

If this resonates with you: - Improve it - Challenge it - Build loaders, registries, or LoRA modules - Or just ignore it if you think it’s irrelevant

We don’t need dozens of forks of LLMs. We need *one clean foundation, and thousands of flexible adaptations*.

Let’s build it — together. ```

Comments

incomingpain•5mo ago
Good luck with model compatibility, the inevitable fracturing of the project when llama, qwen, and gpt are just not possible.

You probably need >1000 people to maintain this project.

What's the value add over what ollama already does?

We're also going containerization because "hacking" is a thing.

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
1•somethingp•26s ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
1•saubeidl•1m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•4m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205
1•tosh•6m ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew
1•gl2334•6m ago•0 comments

New hire fixed a problem so fast, their boss left to become a yoga instructor

https://www.theregister.com/2026/02/06/on_call/
1•Brajeshwar•8m ago•0 comments

Four horsemen of the AI-pocalypse line up capex bigger than Israel's GDP

https://www.theregister.com/2026/02/06/ai_capex_plans/
1•Brajeshwar•8m ago•0 comments

A free Dynamic QR Code generator (no expiring links)

https://free-dynamic-qr-generator.com/
1•nookeshkarri7•9m ago•1 comments

nextTick but for React.js

https://suhaotian.github.io/use-next-tick/
1•jeremy_su•10m ago•0 comments

Show HN: I Built an AI-Powered Pull Request Review Tool

https://github.com/HighGarden-Studio/HighReview
1•highgarden•11m ago•0 comments

Git-am applies commit message diffs

https://lore.kernel.org/git/bcqvh7ahjjgzpgxwnr4kh3hfkksfruf54refyry3ha7qk7dldf@fij5calmscvm/
1•rkta•13m ago•0 comments

ClawEmail: 1min setup for OpenClaw agents with Gmail, Docs

https://clawemail.com
1•aleks5678•20m ago•1 comments

UnAutomating the Economy: More Labor but at What Cost?

https://www.greshm.org/blog/unautomating-the-economy/
1•Suncho•27m ago•1 comments

Show HN: Gettorr – Stream magnet links in the browser via WebRTC (no install)

https://gettorr.com/
1•BenaouidateMed•28m ago•0 comments

Statin drugs safer than previously thought

https://www.semafor.com/article/02/06/2026/statin-drugs-safer-than-previously-thought
1•stareatgoats•30m ago•0 comments

Handy when you just want to distract yourself for a moment

https://d6.h5go.life/
1•TrendSpotterPro•31m ago•0 comments

More States Are Taking Aim at a Controversial Early Reading Method

https://www.edweek.org/teaching-learning/more-states-are-taking-aim-at-a-controversial-early-read...
1•lelanthran•33m ago•0 comments

AI will not save developer productivity

https://www.infoworld.com/article/4125409/ai-will-not-save-developer-productivity.html
1•indentit•38m ago•0 comments

How I do and don't use agents

https://twitter.com/jessfraz/status/2019975917863661760
1•tosh•44m ago•0 comments

BTDUex Safe? The Back End Withdrawal Anomalies

1•aoijfoqfw•46m ago•0 comments

Show HN: Compile-Time Vibe Coding

https://github.com/Michael-JB/vibecode
6•michaelchicory•49m ago•1 comments

Show HN: Ensemble – macOS App to Manage Claude Code Skills, MCPs, and Claude.md

https://github.com/O0000-code/Ensemble
1•IO0oI•52m ago•1 comments

PR to support XMPP channels in OpenClaw

https://github.com/openclaw/openclaw/pull/9741
1•mickael•53m ago•0 comments

Twenty: A Modern Alternative to Salesforce

https://github.com/twentyhq/twenty
1•tosh•54m ago•0 comments

Raspberry Pi: More memory-driven price rises

https://www.raspberrypi.com/news/more-memory-driven-price-rises/
2•calcifer•1h ago•0 comments

Level Up Your Gaming

https://d4.h5go.life/
1•LinkLens•1h ago•1 comments

Di.day is a movement to encourage people to ditch Big Tech

https://itsfoss.com/news/di-day-celebration/
4•MilnerRoute•1h ago•0 comments

Show HN: AI generated personal affirmations playing when your phone is locked

https://MyAffirmations.Guru
4•alaserm•1h ago•3 comments

Show HN: GTM MCP Server- Let AI Manage Your Google Tag Manager Containers

https://github.com/paolobietolini/gtm-mcp-server
1•paolobietolini•1h ago•0 comments

Launch of X (Twitter) API Pay-per-Use Pricing

https://devcommunity.x.com/t/announcing-the-launch-of-x-api-pay-per-use-pricing/256476
1•thinkingemote•1h ago•0 comments