frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Modular LLM framework inspired by Linux – aiming for a one-GPU future

2•openkame•2h ago
I want to share a concept I've been thinking about, which I call *AI-Kernel*.

The idea is to manage large language models (LLMs) like we manage the Linux kernel: - A *stable, long-term maintained base model* (the "kernel") - Modular fine-tuned components (LoRA) as "patches/extensions" - A public registry of LoRA modules, with ratings and metadata - Flexible loaders (Ollama, llama.cpp, vLLM) to run the kernel + LoRAs - A unified frontend (React/JS or CLI) to interact with the system - Fully local or cloud, depending on user choice

---

### Why?

LLMs are growing in size, cost, and opacity. Instead of bigger and bigger models, what if we focused on *efficiency, modularity, and sustainability*?

This proposal suggests a benchmark for AI sustainability:

> If GPT-5 runs on 10,000 GPUs in 2025, > then GPT-4 should run (with all features intact) on a *single GPU in 2026* – even if slower. > In 2027, GPT-5 should become the single-GPU target.

Always *one generation behind, but fully local and sovereign*.

---

### How it works

[ AI-Kernel (base LLM) ] |

+----------+----------+ \| | | \[ LoRA A ] \[ LoRA B ] \[ LoRA C ] ← Modular specialization | \[ Loader (Ollama / llama.cpp / vLLM) ] | \[ Frontend UI (web / desktop) ] | User

LoRAs are small, stackable, and don't alter the base model. Like VS Code extensions, they can be published, rated, shared, and combined.

---

### Transparency

I’m *a self-taught developer*, not an AI researcher. This is not a working product or codebase — just a structured idea for discussion.

Maybe others already thought of it. Maybe I’ve missed limits or blockers. But I wanted to write it down clearly and let more qualified people refine or challenge it.

This draft was co-written with GPT, in full transparency. The vision is mine; the wording was assisted.

---

### What this is NOT

- Not a fork or fight against existing projects - Not an implementation with code (yet) - Not claiming novelty or exclusive ownership

It’s simply a *direction to consider*: A modular, open, kernel-like model for AI that is sustainable and private.

---

### Call to action

If this resonates with you: - Improve it - Challenge it - Build loaders, registries, or LoRA modules - Or just ignore it if you think it’s irrelevant

We don’t need dozens of forks of LLMs. We need *one clean foundation, and thousands of flexible adaptations*.

Let’s build it — together. ```

Spin loss into energy: New principle could enable ultra-low power devices

https://phys.org/news/2025-08-loss-energy-principle-enable-ultra.html
1•westurner•31s ago•0 comments

Loomer/mtg/maga backlash to trump

https://thehill.com/homenews/administration/5470438-donald-trump-maga-backlash-chinese-students/
1•DaveZale•2m ago•0 comments

Crystal 1.17.0 Is Released

https://crystal-lang.org/2025/07/16/1.17.0-released/
1•ksec•3m ago•0 comments

Maestro 2.0

https://maestro.dev/blog/introducing-maestro-2-0-0
1•lysecret•7m ago•0 comments

OpenAI Makes a Play for Healthcare

https://gizmodo.com/openai-makes-a-play-for-healthcare-2000648210
2•rntn•10m ago•0 comments

Proposal to Ban Ghost Jobs: The Truth in Job Advertising and Accountability Act

https://www.cnbc.com/2025/08/25/tech-worker-was-frustrated-with-ghost-jobs-now-hes-trying-to-pass...
4•Teever•11m ago•0 comments

Free website to play when bored

https://sites.google.com/view/drive-u-7-home/home
1•edrftgyhuj•11m ago•0 comments

India's Most Shocking Rebirth Court-Proven Titu Singh Case [video]

https://www.youtube.com/watch?v=1TtYoCGTTpQ
1•TriNetra•11m ago•0 comments

New vision-based system teaches machines to understand their bodies

https://www.eecs.mit.edu/robot-know-thyself-new-vision-based-system-teaches-machines-to-understan...
2•stmw•11m ago•0 comments

Meta Talks World-Lock Rendering for AR/Mr at Hot Chips 2025 – ServeTheHome

https://www.servethehome.com/meta-talks-world-lock-rendering-for-ar-mr-at-hot-chips-2025/
1•rbanffy•12m ago•0 comments

Modern Dentistry Is a Microplastic Minefield

https://www.theatlantic.com/health/archive/2025/08/modern-dentistry-microplastic/683996/
1•chapulin•12m ago•0 comments

National Weather Service application includes pledge to support Exec Orders

https://bsky.app/profile/johnmoralestv.bsky.social/post/3lxcugc6m422q
1•jaredwiener•14m ago•0 comments

Substack now requires In-App Purchases (IAP) on Apple devices

https://twitter.com/denk_tweets/status/1960354365484486904
1•ericzawo•14m ago•0 comments

Show HN: Old-School TUI File Viewer for Modern Terminals

https://www.youtube.com/watch?v=-VlH742uRys
1•velorek•15m ago•0 comments

Programming After AI: Why System Boundary Taste Matters

https://interjectedfuture.com/programming-after-ai-why-system-boundary-taste-matters/
1•iamwil•15m ago•0 comments

Calvinball Makes the Supreme Court [pdf]

https://www.supremecourt.gov/opinions/24pdf/25a103_kh7p.pdf
1•Bogdanp•16m ago•0 comments

Crowdsourcing Hedge Fund Gets $500M JPMorgan Commitment

https://www.bloomberg.com/news/articles/2025-08-26/crowdsourcing-hedge-fund-gets-500-million-jpmo...
1•ItsBeenAwhile•16m ago•0 comments

AI-generated scientific hypotheses lag human ones when put to the test

https://www.science.org/content/article/ai-generated-scientific-hypotheses-lag-human-ones-when-pu...
2•rbanffy•16m ago•0 comments

I'm Worried About Junior Developers

https://envylabs.com/insights/junior-developer-job-market-looking-forward
1•wonger_•17m ago•0 comments

Rv, a new kind of Ruby management tool

https://andre.arko.net/2025/08/25/rv-a-new-kind-of-ruby-management-tool/
3•ciconia•17m ago•1 comments

SMS URLs

https://sethmlarson.dev/sms-urls
1•SethMLarson•17m ago•0 comments

SigNoz (YC W21, Open Source Datadog) Is Hiring DevRel Engineers in the US

https://jobs.ashbyhq.com/SigNoz/8447522c-1163-48d0-8f55-fac25f64a0f3
1•pranay01•17m ago•0 comments

AI Barbie Dolls Could Change Playtime Forever

https://spectrum.ieee.org/ai-barbie-dolls
1•rbanffy•18m ago•0 comments

How plants and fungi trade resources without a brain

https://www.npr.org/sections/planet-money/2025/08/26/g-s1-85185/plants-fungi-resources-trade-coop...
2•marojejian•22m ago•0 comments

Pouch: A non-IP protocol for communication between devices and cloud services

https://github.com/golioth/pouch
3•hasheddan•23m ago•0 comments

Show HN: Framework to Create Linters for Python, YAML, TOML, JSON

https://github.com/open-nudge/lintkit
4•szymonmaszke•23m ago•1 comments

The Canary in the Classroom

https://hollisrobbinsanecdotal.substack.com/p/the-canary-in-the-classroom
1•HR01•23m ago•0 comments

LLM Context Management: How to Improve Performance and Lower Costs

https://eval.16x.engineer/blog/llm-context-management-guide
2•paradite•26m ago•0 comments

Heritability Puzzlers

https://dynomight.net/heritable/
3•norswap•27m ago•0 comments

Show HN: Agent51 – npx agent51 get top 5 post titles on Hacker News

https://github.com/aaurelions/agent51
1•aaurelions•29m ago•0 comments