frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Modular LLM framework inspired by Linux – aiming for a one-GPU future

2•openkame•5mo ago
I want to share a concept I've been thinking about, which I call *AI-Kernel*.

The idea is to manage large language models (LLMs) like we manage the Linux kernel: - A *stable, long-term maintained base model* (the "kernel") - Modular fine-tuned components (LoRA) as "patches/extensions" - A public registry of LoRA modules, with ratings and metadata - Flexible loaders (Ollama, llama.cpp, vLLM) to run the kernel + LoRAs - A unified frontend (React/JS or CLI) to interact with the system - Fully local or cloud, depending on user choice

---

### Why?

LLMs are growing in size, cost, and opacity. Instead of bigger and bigger models, what if we focused on *efficiency, modularity, and sustainability*?

This proposal suggests a benchmark for AI sustainability:

> If GPT-5 runs on 10,000 GPUs in 2025, > then GPT-4 should run (with all features intact) on a *single GPU in 2026* – even if slower. > In 2027, GPT-5 should become the single-GPU target.

Always *one generation behind, but fully local and sovereign*.

---

### How it works

[ AI-Kernel (base LLM) ] |

+----------+----------+ \| | | \[ LoRA A ] \[ LoRA B ] \[ LoRA C ] ← Modular specialization | \[ Loader (Ollama / llama.cpp / vLLM) ] | \[ Frontend UI (web / desktop) ] | User

LoRAs are small, stackable, and don't alter the base model. Like VS Code extensions, they can be published, rated, shared, and combined.

---

### Transparency

I’m *a self-taught developer*, not an AI researcher. This is not a working product or codebase — just a structured idea for discussion.

Maybe others already thought of it. Maybe I’ve missed limits or blockers. But I wanted to write it down clearly and let more qualified people refine or challenge it.

This draft was co-written with GPT, in full transparency. The vision is mine; the wording was assisted.

---

### What this is NOT

- Not a fork or fight against existing projects - Not an implementation with code (yet) - Not claiming novelty or exclusive ownership

It’s simply a *direction to consider*: A modular, open, kernel-like model for AI that is sustainable and private.

---

### Call to action

If this resonates with you: - Improve it - Challenge it - Build loaders, registries, or LoRA modules - Or just ignore it if you think it’s irrelevant

We don’t need dozens of forks of LLMs. We need *one clean foundation, and thousands of flexible adaptations*.

Let’s build it — together. ```

Comments

incomingpain•5mo ago
Good luck with model compatibility, the inevitable fracturing of the project when llama, qwen, and gpt are just not possible.

You probably need >1000 people to maintain this project.

What's the value add over what ollama already does?

We're also going containerization because "hacking" is a thing.

Los Alamos Primer

https://blog.szczepan.org/blog/los-alamos-primer/
1•alkyon•21s ago•0 comments

NewASM Virtual Machine

https://github.com/bracesoftware/newasm
1•DEntisT_•2m ago•0 comments

Terminal-Bench 2.0 Leaderboard

https://www.tbench.ai/leaderboard/terminal-bench/2.0
1•tosh•2m ago•0 comments

I vibe coded a BBS bank with a real working ledger

https://mini-ledger.exe.xyz/
1•simonvc•3m ago•1 comments

The Path to Mojo 1.0

https://www.modular.com/blog/the-path-to-mojo-1-0
1•tosh•6m ago•0 comments

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

https://github.com/voice-of-japan/Virtual-Protest-Protocol/blob/main/README.md
4•sakanakana00•9m ago•0 comments

Show HN: I built Divvy to split restaurant bills from a photo

https://divvyai.app/
3•pieterdy•11m ago•0 comments

Hot Reloading in Rust? Subsecond and Dioxus to the Rescue

https://codethoughts.io/posts/2026-02-07-rust-hot-reloading/
3•Tehnix•12m ago•1 comments

Skim – vibe review your PRs

https://github.com/Haizzz/skim
2•haizzz•13m ago•1 comments

Show HN: Open-source AI assistant for interview reasoning

https://github.com/evinjohnn/natively-cluely-ai-assistant
4•Nive11•13m ago•5 comments

Tech Edge: A Living Playbook for America's Technology Long Game

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-01/260120_EST_Tech_Edge_0.pdf?Version...
2•hunglee2•17m ago•0 comments

Golden Cross vs. Death Cross: Crypto Trading Guide

https://chartscout.io/golden-cross-vs-death-cross-crypto-trading-guide
2•chartscout•20m ago•0 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
3•AlexeyBrin•23m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
2•machielrey•24m ago•1 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
3•tablets•29m ago•1 comments

They were drawn to Korea with dreams of K-pop stardom – but then let down

https://www.bbc.com/news/articles/cvgnq9rwyqno
2•breve•31m ago•0 comments

Show HN: AI-Powered Merchant Intelligence

https://nodee.co
1•jjkirsch•33m ago•0 comments

Bash parallel tasks and error handling

https://github.com/themattrix/bash-concurrent
2•pastage•33m ago•0 comments

Let's compile Quake like it's 1997

https://fabiensanglard.net/compile_like_1997/index.html
2•billiob•34m ago•0 comments

Reverse Engineering Medium.com's Editor: How Copy, Paste, and Images Work

https://app.writtte.com/read/gP0H6W5
2•birdculture•39m ago•0 comments

Go 1.22, SQLite, and Next.js: The "Boring" Back End

https://mohammedeabdelaziz.github.io/articles/go-next-pt-2
1•mohammede•45m ago•0 comments

Laibach the Whistleblowers [video]

https://www.youtube.com/watch?v=c6Mx2mxpaCY
1•KnuthIsGod•47m ago•1 comments

Slop News - The Front Page right now but it's only Slop

https://slop-news.pages.dev/slop-news
1•keepamovin•51m ago•1 comments

Economists vs. Technologists on AI

https://ideasindevelopment.substack.com/p/economists-vs-technologists-on-ai
1•econlmics•53m ago•0 comments

Life at the Edge

https://asadk.com/p/edge
4•tosh•59m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md
4•oxxoxoxooo•1h ago•1 comments

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

2•InvoxoEU•1h ago•0 comments

A Tale of Two Standards, POSIX and Win32 (2005)

https://www.samba.org/samba/news/articles/low_point/tale_two_stds_os2.html
4•goranmoomin•1h ago•0 comments

Ask HN: Is the Downfall of SaaS Started?

4•throwaw12•1h ago•0 comments

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
3•senekor•1h ago•0 comments