frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLMRouter – Stop using GPT-4/o1 for everything (16 routing strategies)

https://github.com/ulab-uiuc/LLMRouter
2•tao2024•2h ago

Comments

tao2024•2h ago
OP here. I'm a CS PhD student at UIUC working on User Modeling and Applied ML.

We built LLMRouter because we noticed a gap in the current LLM stack: everyone knows we shouldn't route every query to GPT-4/o1 (it's slow and expensive), but building a reliable router that handles context, reasoning, and user history is surprisingly hard.

Most existing solutions are either simple regex/keyword matching or closed-source APIs. We wanted to build a standard, open-source library that unifies the SOTA.

What LLMRouter actually does: It provides a unified interface to 16+ routing strategies, ranging from lightweight ML to heavy reasoning agents:

Single-Round: Classification-based (KNN, SVM, BERT) and Embedding-based methods.

Multi-Round & Agentic: Routers that "think" before assigning models (CoT reasoning) or break down tasks step-by-step.

Personalized Routing: This is a key focus of our research. The router learns from user interaction history to fit individual preferences (e.g., some users prefer concise answers from faster models, others need detailed reasoning).

The Pipeline: We didn't just ship the model weights. The library includes:

Data Generation: A pipeline to generate synthetic routing data for your specific domain.

Benchmarks: 11 datasets to evaluate router performance.

Deployment: A CLI and Gradio UI to visualize routing decisions in real-time.

In our experiments, we typically see 30–50% cost reduction while maintaining response quality by correctly identifying easy vs. hard queries.

The code is open source (MIT/Apache): https://github.com/ulab-uiuc/LLMRouter

Happy to answer any questions about the implementation details or the specific RL/Ranking algorithms we used!

I built a receipt printer for GitHub issues

https://aschmelyun.com/blog/i-built-a-receipt-printer-for-github-issues/
1•itzlambda•5m ago•1 comments

Documentation for Developers

https://leaddev.com/communication/build-documentation-developers-actually-navigate
1•shehabas•5m ago•0 comments

The ARR Illusion in the Age of AI

https://oswarld.com/eng/insight/250816_ai-arr-illusion-gmv-vs-arr
1•haebom•6m ago•0 comments

Show HN: Magic Input – Use your iPhone as a keyboard and trackpad for your Mac

1•willswire•6m ago•0 comments

Asahi Linux M1 DisplayPort working during CCC #39c3

https://github.com/AsahiLinux/linux/tree/fairydust
3•heredoc•10m ago•1 comments

AI-generated content in Wikipedia – a tale of caution [video]

https://media.ccc.de/v/39c3-ai-generated-content-in-wikipedia-a-tale-of-caution
1•vinni2•11m ago•0 comments

Show HN: Simple Chrome extension to play focus music

https://chromewebstore.google.com/detail/focus-music/bnecaegenddgoleofplogafikcdkckkm
1•404softwarelabs•13m ago•0 comments

My 2025 review as an indie dev

https://xenodium.com/my-2025-review-as-an-indie-dev
1•xenodium•14m ago•0 comments

Sora will make social media creators 'far, far, far less valuable'

https://www.businessinsider.com/lightspeed-partner-sora-creators-far-less-valuable-2025-12
1•bookofjoe•14m ago•0 comments

An Anti-A.I. Movement Is Coming. Which Party Will Lead It?

https://www.nytimes.com/2025/12/29/opinion/ai-democracy.html
2•voxleone•14m ago•1 comments

Show HN: A simple, free Hacker News reader for iOS

https://apps.apple.com/us/app/hacker-news-reader-app/id6502296871
1•togido•15m ago•0 comments

Want to Learn about Timepieces

1•ClaudeGustav2•16m ago•0 comments

When the lights went out, and the shooting started, Y2K [felt] all too real

https://www.theregister.com/2025/12/29/on_call/
1•dijksterhuis•16m ago•0 comments

Show HN: A Simple Geometric Constraint Solver

https://github.com/kasznar/geometric-constraint-solver
2•kasznar•21m ago•0 comments

Show HN: Ornix – Zero-setup folder organizer for macOS

https://kolee.kr/apps/kolee-ornix
1•yahoai•21m ago•1 comments

Happy 16th Birthday, Krebsonsecurity.com

https://krebsonsecurity.com/2025/12/happy-16th-birthday-krebsonsecurity-com/
2•feross•24m ago•0 comments

Why AV1 is not used more broadly (2023)

https://old.reddit.com/r/AV1/comments/17314ik/comment/k44x4lj/
3•tosh•24m ago•0 comments

The processes behind making the combs and cylinders at Reuge

https://www.thenakedwatchmaker.com/making-reugecombs
1•ClaudeGustav2•25m ago•0 comments

AI-Powered (SaaS, App, etc.) Idea Validation System

https://github.com/kzeitar/idea-sieve
1•khalidzeiter•28m ago•1 comments

The New Billionaires of the A.I. Boom

https://www.nytimes.com/2025/12/29/technology/new-billionaires-ai-boom.html
1•thm•32m ago•0 comments

Why Vxlan?

https://goyalankit.com/blog/note-on-vxlan
2•goyalankit•35m ago•0 comments

Wired Magazine Got Hacked

https://www.facebook.com/ethical.hack.group/posts/wired-magazine-got-hacked-23-million-subscriber...
2•nomilk•40m ago•0 comments

Non-Zero-Sum Games

https://nonzerosum.games/
22•8organicbits•43m ago•0 comments

Wormhole send and receive die with SIGILL

https://github.com/magic-wormhole/magic-wormhole/issues/693
1•hggh•45m ago•0 comments

I was tired of FFmpeg, so I made FFmpeg for humans

1•alpbak•47m ago•1 comments

Ask HN: Does enterprise GenAI adoption come from constraint, not intelligence?

1•genum_Lab•53m ago•0 comments

Towards an Algebraic Theory of Context-Free Languages (1996) [pdf]

http://www-igm.univ-mlv.fr/~berstel/Articles/1996AlgebraicTheory.pdf
1•aebtebeten•58m ago•0 comments

Are We Compressed Yet?

https://github.com/xiph/awcy
2•tosh•1h ago•0 comments

The Ascent of the AI Therapist

https://www.technologyreview.com/2025/12/30/1129392/book-reviews-ai-therapy-mental-health/
1•fleahunter•1h ago•0 comments

My Three-Day Retreat in Total Darkness

https://www.nytimes.com/2025/10/21/magazine/dark-retreat-meditation-sensory-deprivation-spiritual...
2•bookofjoe•1h ago•1 comments