frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Slicing an 80B MoE LLM into 40B domain specialists

https://github.com/JThomas-CoE/College-of-Experts-AI/tree/main/CoE-Demo-v1.5
3•JThomas-CoE•1h ago

Comments

JThomas-CoE•1h ago
I wanted to share a proof-of-concept from the College of Experts project on the separability of machine intelligence.

We hypothesized that domain knowledge in monolithic Mixture-of-Experts models is not holographically entangled across all routing layers, but physically separable. Using histographic activation profiling across 10 coding languages, we surgically extracted the 256 experts responsible for Python from Qwen3-Coder-Next-80B and separately extracted the 256 experts responsible for Web/Frontend logic. We used a bias activation function across the 48 layers which modified the expert ranking and selected experts up to the expert budget of 256 per layer.

The resulting 40B Python Specialist retains a 93% score on HumanEval (compared to the 80B model's 94%), despite losing half its parameters. Conversely, the 40B Web Specialist retains near-perfect UI generation capabilities while completely losing the ability to emit raw Python logic. Note that this was achieved strictly via weight-slicing the unmodified .gguf file, with zero post-surgery fine-tuning.

The repo linked above contains Demo v1.5, which uses a fast ONNX supervisor (DML/CUDA) to hot-swap these massive 40B lobes via Ollama, allowing 80B-class MoE routing on consumer hardware (29GB VRAM footprint).

Below are the relevant links:

Whitepaper (PDF): [https://github.com/JThomas-CoE/College-of-Experts-AI/blob/ma...] The Extracted Models: [https://huggingface.co/JThomas-CoE/CoE-WEB2-40b-A3b-GGUF] We are currently preparing to decompose the new Qwen3.5-35B model into a full 10-domain suite. I would love to hear feedback on the layer-slicing methodology or the architectural implications of hosting the routing supervisor outside of the LLM inference engine.

Practical techniques for issue resolution with agentic AI

https://blog.scottlogic.com/2026/03/05/analysis-implementation-reflection-practical-techniques.html
1•oriondean•37s ago•0 comments

I just released PluriSnake, a new kind of snake puzzle game. [macOS/iOS/iPadOS]

https://apps.apple.com/us/app/plurisnake/id6756577045
1•amichail•55s ago•1 comments

Halfway on the path to community support for free-threaded Python

https://labs.quansight.org/blog/free-threaded-python-halfway
1•lumpa•1m ago•0 comments

Britain is ejecting hereditary nobles from Parliament after 700 years

https://apnews.com/article/uk-house-of-lords-hereditary-peers-expelled-535df8781dd01e8970acda1dca...
2•divbzero•2m ago•0 comments

Meta patented an AI that lets you keep posting from beyond the grave

https://www.businessinsider.com/meta-granted-patent-for-ai-llm-bot-dead-paused-accounts-2026-2
2•JumpCrisscross•2m ago•0 comments

Show HN: HDC-based function caller ranks #2 on BFCL V4 – $2.08 vs. Opus at $87

https://github.com/glyphh-ai/model-bfcl
1•timmetime•2m ago•0 comments

SuperPowers: Agentic skills framework that works

https://github.com/obra/superpowers
1•danebalia•2m ago•1 comments

Show HN: An offline-first expense tracker on Cloudflare D1 and SQLite WASM

https://github.com/momentmaker/pancakemaker
1•momentmaker•3m ago•0 comments

Show HN: DiscoVox – Free audiobooks with synchronized text for language learning

https://discovox.org/en/library
1•floo•5m ago•0 comments

A CLI wrapper for making Kubernetes commands much easier

https://github.com/alaminopu/kctl
1•alaminopu•8m ago•0 comments

Cryogenic transmission electron microscopy reveals nanostructure of PEDOT:PSS

https://www.nature.com/articles/s41467-026-68890-7
1•PaulHoule•9m ago•0 comments

The AI Is the Computer

https://www.perplexity.ai/hub/blog/the-ai-is-the-computer
1•jonbaer•10m ago•0 comments

Changing the Economics of Quality with Claude Code-Generated User Stories

https://www.brethorsting.com/blog/2026/03/changing-the-economics-of-quality-with-claude-code-gene...
1•aaronbrethorst•10m ago•0 comments

FIDES: End-to-end Compartments for Mixed-language Systems [pdf]

https://kcsrk.info/papers/fides_asiaccs_2026.pdf
1•matt_d•10m ago•0 comments

Many SWE-bench-Passing PRs would not be merged

https://metr.org/notes/2026-03-10-many-swe-bench-passing-prs-would-not-be-merged-into-main/
1•mustaphah•11m ago•0 comments

Ask HN: What are the best product landing pages you've stumbled upon?

1•chistev•12m ago•0 comments

AITutor – vimtutor, but for AI-assisted coding

1•thehecticbyte•13m ago•0 comments

The Anthropic Institute

https://www.anthropic.com/news/the-anthropic-institute
2•mmaia•15m ago•0 comments

Kavka's Toxin Puzzle

https://en.wikipedia.org/wiki/Kavka%27s_toxin_puzzle
1•rzk•17m ago•0 comments

Harry Potter by Balenciaga (2026) [video]

https://www.youtube.com/watch?v=gtnt84CDP-s
1•GeoAtreides•18m ago•0 comments

I'm struggling and I don't have anyone else to share this with except you

4•owlcompliance•18m ago•2 comments

Queen's Wish: A Portmortem of Mixed Success

https://bottomfeeder.substack.com/p/queens-wish-a-portmortem-of-mixed
1•Tomte•18m ago•1 comments

The Agency: Meticulously crafted AI agent personalities

https://github.com/msitarzewski/agency-agents
1•danebalia•20m ago•1 comments

Practical Type Inference: High‑Throughput Recovery of Real‑World Types

https://arxiv.org/abs/2603.08225
1•matt_d•20m ago•0 comments

New 'negative light' technology hides data transfers in plain sight

https://www.unsw.edu.au/newsroom/news/2026/03/New-negative-light-technology-hides-data-transfers-...
4•wjSgoWPm5bWAhXB•21m ago•0 comments

'a window into the past': The homes revealing how Tudor people lived

https://www.bbc.com/culture/article/20260309-the-homes-revealing-how-tudor-people-really-lived
1•makaimc•22m ago•0 comments

Rabbit r1 with whatever model you want

https://github.com/ShayneP/rabbit-r1-livekit-skill
2•ShayneP•24m ago•1 comments

Request Copilot code review from GitHub CLI

https://github.blog/changelog/2026-03-11-request-copilot-code-review-from-github-cli/
2•danebalia•25m ago•1 comments

Microsoft brings new "Xbox mode" to Windows 11 PCs next month

https://www.windowscentral.com/microsoft/windows-11/windows-11-xbox-mode-announcement-gdc-2026-pr...
1•nikodunk•25m ago•0 comments

WordPress/PHP-AI-client: provider agnostic PHP client SDK to communicate with AI

https://github.com/WordPress/php-ai-client
1•ulrischa•25m ago•0 comments