frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Lessons from building an intelligent LLM router

https://github.com/Egham-7/adaptive
1•botirk•1h ago

Comments

botirk•1h ago
We have been experimenting with routing inference across LLMs, and the path has been full of wrong turns.

Our first attempt was to just use a large LLM itself to decide routing. It was too costly and the decisions were unreliable.

Next we tried training a small fine-tuned LLM as a router. It was cheaper, but the outputs were poor and not trustworthy.

Then we wrote heuristics to map prompt types to model IDs. That worked for a while, but it was brittle. Every API change or workload shift broke it.

Eventually we shifted to thinking in terms of model criteria instead of hardcoded model IDs. We benchmarked models across task types, domains, and complexity levels, and made routing decisions based on those profiles.

To estimate task type and complexity, we used NVIDIA’s Prompt Task and Complexity Classifier. It classifies prompts into categories like QA, summarization, code generation, and more. It also scores prompts along six dimensions such as creativity, reasoning, domain knowledge, contextual knowledge, constraints, and few-shots. From this it produces a weighted overall complexity score.

This gave us a structured way to decide when a prompt justified a premium model like Claude Opus 4.1 and when a smaller model like GPT-5-mini would perform just as well.

Now we are working on integrating this with Google’s UniRoute (https://arxiv.org/abs/2502.08773

Failing to Understand the Exponential, Again

https://www.julian.ac/blog/2025/09/27/failing-to-understand-the-exponential-again/
1•lairv•6m ago•0 comments

QuillSQL – a Rust Relational DB

https://github.com/feichai0017/QuillSQL
1•feichai0017•7m ago•1 comments

Nuclear Fracking: Repeatedly Nuking Yourself for Commercial Reasons [video]

https://www.youtube.com/watch?v=Rsu0lHkOIFg
1•alastairr•7m ago•0 comments

Just using open source isn't radical any more, Europe

https://www.theregister.com/2025/09/26/open_source_in_europe_2025/
1•LorenDB•8m ago•0 comments

Hedge-Fund Stars Are Making So Much Now That They Are Hiring Agents

https://www.wsj.com/finance/investing/hedge-fund-stars-hire-talent-agents-2083cd54
1•bookofjoe•9m ago•1 comments

The stripper who ushered in the subscription-based internet

https://thehustle.co/originals/the-stripper-who-ushered-in-the-modern-subscription-based-internet
1•kansaswriter•10m ago•0 comments

Post by Vada (Windows 11 Forum): SSD Dissappears after waking up from sleep.

https://www.elevenforum.com/t/ssd-dissappears-after-waking-up-from-sleep.40259/
1•sipofwater•10m ago•1 comments

No Evidence of Disease

https://idlewords.com/2012/09/no_evidence_of_disease.htm
1•honzabe•14m ago•0 comments

The Perplexity Search API

https://www.perplexity.ai/hub/blog/introducing-the-perplexity-search-api
1•882542F3884314B•15m ago•0 comments

Show HN: I built a Shopify app that make discounts easy, even on markets

https://apps.shopify.com/fyra-market-sales-and-discounts
1•Fyradev•16m ago•0 comments

The Post-American Order Starts in Riyadh and Islamabad

https://www.bloomberg.com/opinion/articles/2025-09-24/the-post-american-order-starts-in-riyadh-an...
2•nabla9•19m ago•1 comments

ADHD in Adults: The Invisible Rhinoceros

https://pmc.ncbi.nlm.nih.gov/articles/PMC2861517/
2•wonger_•19m ago•0 comments

Impressions of CachyOS

https://www.nickstambaugh.dev/posts/cachyos-impressions
2•sieep•23m ago•0 comments

Mind Maps and the Commonplace Book: The Explorer and the Architect of My Ideas

https://mindthenerd.com/mind-maps-and-the-commonplace-book-the-explorer-and-the-architect-of-my-i...
1•ednite•27m ago•1 comments

Prompt2Tool – 1800 Free AI-Powered Tools in One Platform

https://prompt2tool.com/
1•prompt2tool•30m ago•1 comments

EPA tells some scientists to stop publishing studies

https://www.washingtonpost.com/climate-environment/2025/09/20/epa-scientists-research-publications/
2•geox•35m ago•0 comments

Android will soon run Linux apps better, and that's great for Google's PC plans

https://www.androidauthority.com/android-linux-terminal-gpu-rendering-3601664/
2•sipofwater•36m ago•1 comments

Industry-compatible silicon spin-qubit unit cells exceeding 99% fidelity

https://www.nature.com/articles/s41586-025-09531-9
1•ceolin•37m ago•0 comments

The Rapture Is Happening Right Now!

https://medium.com/luminasticity/the-rapture-is-happening-right-now-7f262539beb8
1•bryanrasmussen•37m ago•0 comments

The current war on science, and who's behind it

https://arstechnica.com/science/2025/09/who-should-we-blame-for-the-current-war-on-science/
1•ZeroGravitas•38m ago•0 comments

Small Data

https://topicpartition.io/definitions/small-data
2•Bogdanp•40m ago•0 comments

3D printing factory to open in Dededo, will produce parts of Navy ships

https://www.guampdn.com/news/3d-printing-factory-to-open-in-dededo-will-produce-parts-of-navy-shi...
1•sipofwater•41m ago•1 comments

Timing Conclusions: GPS, NTP, PTP Timing with Linux

https://scottstuff.net/posts/2025/06/10/timing-conclusions/
1•fanf2•43m ago•0 comments

'Ostrich Effect': Researchers pinpoint the age we start avoiding information

https://medicalxpress.com/news/2025-09-ostrich-effect-age.html
2•pseudolus•43m ago•0 comments

Islands, Airports and the Joy of Being Constrained

https://hknhr.com/constraints.html
1•haakonhr•45m ago•0 comments

A New Wave: From Big Data to Small Data

https://www.fabi.ai/blog/a-new-wave-from-big-data-to-small-data
1•peterdstallion•50m ago•0 comments

Mogan Salon

http://forum.texmacs.cn/t/join-the-mogan-salon/2077
1•amichail•53m ago•0 comments

Designer biobots made from human lung cells

https://engineering.cmu.edu/news-events/news/2025/09/26-ciliabot.html
1•giuliomagnifico•56m ago•0 comments

Autonomous Workflow Agent Architecture

https://agentic-patterns.com/patterns/autonomous-workflow-agent-architecture/
1•nkko•58m ago•0 comments

The untold story of Where is My Train's 100M users

https://newsletter.theindianotes.com/p/the-untold-story-of-where-is-my-trains
1•ahmetd•1h ago•0 comments