frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Model-literals, model-aliases, and preference-aligned routing for LLMs

https://docs.archgw.com/guides/llm_router.html
1•honorable_coder•1h ago
Today we’re shipping a major update to ArchGW (an edge and service proxy for agents [1]): a unified router that supports three strategies for directing traffic to LLMs — from explicit model names, to semantic aliases, to dynamic preference-aligned routing. Here’s how each works on its own, and how they come together.

Preference-aligned routing decouples task detection (e.g., code generation, image editing, Q&A) from LLM assignment. This approach captures the preferences developers establish when testing and evaluating LLMs on their domain-specific workflows and tasks. So, rather than relying on an automatic router trained to beat abstract benchmarks like MMLU or MT-Bench, developers can dynamically route requests to the most suitable model based on internal evaluations — and easily swap out the underlying moodel for specific actions and workflows. This is powered by our 1.5B Arch-Router LLM [2]. We also published our research on this recently[3]

Modal-aliases provide semantic, version-controlled names for models. Instead of using provider-specific model names like gpt-4o-mini or claude-3-5-sonnet-20241022 in your client you can create meaningful aliases like "fast-model" or "arch.summarize.v1". This allows you to test new models, swap out the config safely without having to do code-wide search/replace every time you want to use a new model for a very specific workflow or task.

Model-literals (nothing new) lets you specify exact provider/model combinations (e.g., openai/gpt-4o, anthropic/claude-3-5-sonnet-20241022), giving you full control and transparency over which model handles each request.

P.S. we routinely get asked why we didn't build semantic/embedding models for routing use cases or use some form of clustering technique. Clustering/embedding routers miss context, negation, and short elliptical queries, etc. An autoregressive approach conditions on the full context, letting the model reason about the task and generate an explicit label that can be used to match to an agent, task or LLM. In practice, this generalizes better to unseen or low-frequency intents and stays robust as conversations drift, without brittle thresholds or post-hoc cluster tuning.

[1] https://github.com/katanemo/archgw [2] https://huggingface.co/katanemo/Arch-Router-1.5B [2] https://arxiv.org/abs/2506.16655

As the Far Right Rises, Don't Be Ezra Klein

https://www.currentaffairs.org/news/as-the-far-right-rises-dont-be-ezra-klein
1•optimalsolver•2m ago•0 comments

LLM Routing Strategies

https://martianlantern.github.io//2025/09/llm-routing/
1•martianlantern•2m ago•0 comments

Show HN: Parsing Crusader Kings III data files to generate mods

https://wheybags.com/blog/ck3mods.html
1•wheybags•4m ago•0 comments

iOS and iPadOS 26: The MacStories Review

https://www.macstories.net/stories/ios-and-ipados-26-the-macstories-review/
1•tosh•4m ago•0 comments

NCine Dev Update 22

https://encelo.github.io/2025-09-21-ncine-dev-update-22/
1•encelo•6m ago•1 comments

Dutch disease: How an economic boom turns to doom

https://en.wikipedia.org/wiki/Dutch_disease
1•o999•6m ago•0 comments

Wordoid is shutting down – favorite name generators?

https://wordoid.com/
1•voina•13m ago•0 comments

UK formally recognises Palestinian state

https://www.bbc.com/news/articles/ce800enrglzo
16•novateg•16m ago•6 comments

Trump pushes Justice Department to prosecute his political opponents

https://www.bbc.com/news/articles/c1wgg4vgeedo
7•duxup•17m ago•1 comments

Notes on QuestDB's Design: Architecture and Internals

https://ankitsultana.com/2025/09/12/questdb-braindump.html
1•ankitsultana•17m ago•0 comments

A board member's perspective of the RubyGems controversy

https://apiguy.substack.com/p/a-board-members-perspective-of-the
4•Qwuke•19m ago•0 comments

Science of the Gaps

https://lukesmith.xyz/articles/science-of-the-gaps/
2•snjy7•20m ago•0 comments

Psychedelics Blew His Mind. He Wants Other Philosophers to Open Theirs

https://www.nytimes.com/2025/09/21/books/review/justin-smith-ruiu-on-drugs-philosophy.html
1•cactusplant7374•20m ago•1 comments

Notorious software bug was killing people 40 years ago

https://www.tomshardware.com/software/notorious-software-bug-was-killing-people-40-years-ago-at-l...
4•gpi•20m ago•0 comments

U.S. State Department Introduces $1 Fee for Diversity Visa Lottery Registration

https://www.boundless.com/blog/us-state-department-introduces-1-dollar-fee-for-dv-lottery-registr...
1•hippich•21m ago•0 comments

Montblanc is getting into the digital notepad game

https://www.theverge.com/news/782288/montblanc-is-getting-into-the-digital-notepad-game
3•voisin•24m ago•0 comments

China Road Trip Exposes List of Uninvestable Assets in the West

https://www.bloomberg.com/news/articles/2025-09-21/china-road-trip-exposes-list-of-uninvestable-a...
3•JumpCrisscross•25m ago•1 comments

Show HN: AgentSafe – per-task micro-VM sandbox for AI agents (Go)

https://github.com/Sarthak30/agentsafe
1•sdeshwal•26m ago•0 comments

In Historic Shift, U.K., Australia and Canada Recognize a Palestinian State

https://www.wsj.com/world/middle-east/in-historic-shift-u-k-australia-and-canada-recognize-a-pale...
4•JumpCrisscross•27m ago•0 comments

Emacs GTD flow evolved (2023)

https://daryl.wakatara.com/emacs-gtd-flow-evolved/
1•fjk•28m ago•0 comments

How Political Narratives are distributed on Twitter/X

https://www.mpg.de/25378603/0911-matn-influencers-multipliers-and-the-structure-of-polarization-1...
2•Improvement•33m ago•0 comments

Are smartphones eroding the experience of watching football?

https://www.theguardian.com/football/2025/sep/08/smartphones-eroding-experience-watching-football...
1•PaulHoule•37m ago•0 comments

XAI's Colossus 2 – First Gigawatt Datacenter in the World

https://semianalysis.com/2025/09/16/xais-colossus-2-first-gigawatt-datacenter/
1•rbanffy•38m ago•0 comments

Black Hole Merger: Clearest Evidence yet Einstein, Hawking, Kerr Were Right

https://www.universetoday.com/articles/black-hole-merger-provides-clearest-evidence-yet-that-eins...
2•rbanffy•38m ago•0 comments

Show HN: TypeScript API for India's National Stock Exchange

https://github.com/bshada/nseapi
1•_bshada•38m ago•0 comments

Who is responsible when a chatbot speaks?

https://theconversation.com/suicide-by-chatbot-puts-big-tech-in-the-product-liability-hot-seat-26...
3•rntn•40m ago•1 comments

Storing Unwise Amounts of Data in JavaScript Bigints

https://jonathan-frere.com/posts/bigints-are-cool/
3•Bogdanp•41m ago•0 comments

Show HN: Terminal Calculator in Pure C – Matlab in Your CLI

1•den_dev•45m ago•0 comments

God-washing: Fund fined for misleading clients(2024)

https://www.morningstar.com/news/marketwatch/20240921266/god-washing-fund-fined-for-misleading-cl...
2•rolph•48m ago•0 comments

WebKit Features in Safari 26.0

https://webkit.org/blog/17333/webkit-features-in-safari-26-0/
1•ksec•51m ago•0 comments