frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Hypermode Model Router Preview – OpenRouter Alternative

https://hypermode.com/blog/introducing-model-router
33•iamtherhino•6mo ago

Comments

jbellis•6mo ago
What I'm seeing with Brokk (https://brokk.ai) is that models are not really interchangeable for code authoring. Even with frontier models like GP2.5 and Sonnet 3.7, Sonnet is significantly better about following instructions ("don't add redundant comments") while GP2.5 has more raw intelligence. So we're using litellm to create a unified API to consume but the premise of "route your requests to whatever model is responding fastest" doesn't seem that attractive.

But OpenRouter is ridiculously popular so it must be very useful for other use cases!

johnymontana•6mo ago
I think the value here is being able to have a unified API to access hosted open source models and proprietary models. And then being able to switch between models without changing any code. Model optionality was one of the factors Hypermode called out in the 12 Factor Agentic App: https://hypermode.com/blog/the-twelve-factor-agentic-app

Also, being able to use models from multiple services and open source models without signing up for another service / bring your own API key is a big accelerator for folks getting started with Hypermode agents.

iamtherhino•6mo ago
Hey! Co-founder of Hypermode here.

Agreed on swapping models for code-gen doesn't make sense. We're mostly indexed on GPT-4.1 for our AgentBuilder product. I haven't found it easy to move between models for code super effective.

The most popular use case we've seen from folks is on the iteration/experimentation phase of building an agent/tool. We made ModelRouter originally as an internal service for our "prompt to agent" product, where folks are trying a few dozen models/MCPs/tools/data/etc really quickly as they try to find a local maximum for some automation or job.

0xDEAFBEAD•6mo ago
Are there any of these tools which will use your evals to automatically recommend a model to use? Imagine if you didn't need to follow model releases anymore, and you just had a heuristic that would automatically select the right price/performance tradeoff. Maybe there's even a way to route queries differently to more expensive models depending on how tricky they are.

(This would be more for using models at scale in production as opposed to individual use for code authoring etc.)

jbellis•6mo ago
Yeah, that seems possible, but a dumb preprocessing step won't help and a smart one will add significant latency.

Feels a bit halting-problem-ish: can you tell if a problem is too hard for model A without being smarter than model A yourself?

0xDEAFBEAD•6mo ago
I imagine if your volume is high enough it could be worthwhile to at least check to see if simple preprocessing gets you anywhere.

Basically compare model performance on a bunch of problems, and see if the queries which actually require an expensive model have anything in common (e.g. low Flesch-Kincaid readability, or a bag-of-words approach which tries to detect the frequency of subordinate clauses/potentially ambiguous pronouns, or word rarity, or whatever).

Maybe my knowledge of old-school NLP methods is useful after all :-) Generally those methods tend to be far less compute-intensive. If you wanted to go really crazy on performance, you might even use a Bloom filter to do fast, imprecise counting of words of various types.

Then you could add some old-school, compute-lite ML, like an ordinary linear regression on the old-school-NLP-derived features.

Really the win would be for a company like Hypermode to implement this automatically for customers who want it (high volume customers who don't mind saving money).

Actually, a company like Hypermode might be uniquely well-positioned to offer this service to smaller customers as well, if query difficulty heuristics generalize well across different workloads. Assuming they have access to data for a large variety of customers, they could look for heuristics that generalize well.

iamtherhino•6mo ago
I really like this approach.

I think there's a big advantage to be had for folks brining "old school" ML approaches to LLMs. We've been spending a lot of time looking at the expert systems from the 90s.

Another one we've been looking at is applying some query planning approaches to these systems to see if we can pull responses from cache instead of invoking the model again.

Obviously there's a lot of complexity to identifying where we could apply some smaller ML models or cache-- but it's been a really fun exploration.

0xDEAFBEAD•6mo ago
>We've been spending a lot of time looking at the expert systems from the 90s.

No way. I would definitely be curious to hear more if you want to share.

iamtherhino•6mo ago
We've been playing with that in the background. I can try to shoot you a preview in a few weeks. It works pretty well for reasoning tasks/NLP workloads but for workloads that need a "correct" answer, it's really tough to maintain accuracy when swapping models.

What we've seen most successful is making recommendations in the agent creation process for a given tool/workload and then leaving them somewhat static after creation.

0xDEAFBEAD•6mo ago
That's fair. Maybe you could even send the user an email if you detect a new model release or pricing change which handles their workload for cheaper at comparable quality, to notify them to investigate.
iamtherhino•6mo ago
That's a good idea-- then give them a link to "replay last X inferences with model ABC" so they can do a quick eyeball eval.
0xDEAFBEAD•6mo ago
Sweet, maybe you'll like my other idea in this thread too: https://news.ycombinator.com/item?id=43929194
threeducks•6mo ago
The Python API example looks like it has been written by an LLM. You don't need to import json, you don't need to set the content type and it is good practice to use context managers ("with" statement) to release the connection in case of exceptions. Also, you don't gain anything by commenting variables with the name of the variable.

The following sample (probably) does the same thing and is almost half as short. I have not tested it because there is no signup (EDIT: I was mistaken, there actually is a "signup" behind the login link, which is Google or GitHub login, so the naming makes sense. I confused it with a previously more prominent waitlist link.)

    import requests

    # Your Hypermode Workspace API key
    api_key = "<YOUR_HYP_WKS_KEY>"

    # Use the Hypermode Model Router API endpoint
    url = f"https://models.hypermode.host/v1/chat/completions"

    headers = {"Authorization": f"Bearer {api_key}"}

    payload = {
        "model": "meta-llama/llama-4-scout-17b-16e-instruct",
        "messages": [
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "What is Dgraph?"},
        ],
        "max_tokens": 150,
        "temperature": 0.7,
    }

    # Make the API request
    with requests.post(url, headers=headers, json=payload) as response:
        response.raise_for_status()
        print(response.json()["choices"][0]["message"]["content"])
iamtherhino•6mo ago
Signups are open: hypermode.com/sign-up

There's a waitlist for our prompt to agent product in the banner. That's a good call to update it to be more clear.

threeducks•6mo ago
Oh, I did not catch that. Sorry!
iamtherhino•6mo ago
Not at all! I'm updating the banner now
iamtherhino•6mo ago
updated our python example too!
KTibow•6mo ago
`post` automatically releases the connection. `with` only makes sense when you use a `requests.Session()`.
threeducks•6mo ago
You are right! https://github.com/psf/requests/blob/c65c780849563c891f35ffc...

The post function calls request the request function which uses its own context manager that will call the close function of the connection object.

hobo_mark•6mo ago
Is there something like OpenRouter, but for text-to-speech models?
iamtherhino•6mo ago
I haven't seen one yet-- no reason we couldn't do that with Hypermode. I'll do some exploration!
maxbendick•6mo ago
The logo is fairly evocative of the SS insignia.

To explain in the clearest terms: unlike the SS insignia, the lightning bolt in the logo has tapering at the bottom. The second element in the logo, the slash, does not have tapering at the bottom. The general shape of the logo is the same as the SS insignia: two diagonal elements side-by-side (which would be all good on its own). The mind tends to see repetition, so it has a tendency to "mix up" the two elements of the logo. The mind also has a tendency to remember similar things. Putting it all together, the logo has a chance to evoke the SS insignia.

I may just be reading too much Theweleit and W. Reich nowadays, but I think you'll get catch some flak for this logo if it becomes recognizable outside the tech milieu.

iamtherhino•6mo ago
Thanks for the feedback-- I can say emphatically, that's not our intention in the least. We chose a lightning bolt to evoke speed, i.e., the "hyper" in Hypermode. I've asked design to take another look at the "H" logo.
maxbendick•6mo ago
Thanks so much for replying. I didn't think it was your intention at all.

Chinese astronauts stranded in space after debris hits their return capsule

https://www.livescience.com/space/space-exploration/three-chinese-astronauts-stranded-in-space-af...
1•gradus_ad•4m ago•0 comments

Show HN: Computational Metaphysics: Zeroth Implementation of Grover's and Shor's

https://polymetron.substack.com/p/computational-metaphysics-no-more
1•polymetron•6m ago•0 comments

Automated bank data analysis just leveled up

1•vinserello•8m ago•0 comments

I taught an octopus piano (It took 6 months) [video]

https://www.youtube.com/watch?v=PcWnQ7fYzwI
1•DyslexicAtheist•9m ago•0 comments

Python's GIL removal is a poison pill and benefits Mojo

https://www.theserverside.com/video/Pythons-GIL-removal-is-a-poison-pill-and-benefits-Mojo
1•ljlolel•11m ago•0 comments

Kagi Small Web

https://kagi.com/smallweb
2•stranges•16m ago•0 comments

Show HN: I built a Mac App that brings Vercel UX to my $5 VPS

https://servercompass.app/
1•vankhoa1505•19m ago•0 comments

Stinking, Spongy, Dark, Huge: A Spider Web Unlike Any Seen Before

https://www.nytimes.com/2025/11/08/science/biggest-spiderweb-sulfur-cave.html
1•jcfrei•20m ago•0 comments

AI Ruined Programming, So I Fixed It (2025)

https://lightcapai.medium.com/i-fixed-programming-after-ai-ruined-it-6196a2560410
2•HenryAI•25m ago•2 comments

Gnome OS Team cancels Framework [video]

https://www.youtube.com/watch?v=KAzXypM5qPc
1•xaxaxa123•29m ago•1 comments

Password to Louvre's Video Surveillance System Was 'Louvre'

https://abcnews.go.com/International/password-louvres-video-surveillance-system-louvre-employee/s...
1•jruohonen•31m ago•0 comments

Tool2Agent – a protocol for LLM tool feedback workflows

https://tool2agent.org/
1•klntsky•36m ago•1 comments

Meta's Data Centers Drive Economic Growth Across the US

https://about.fb.com/news/2025/11/meta-data-centers-drive-economic-growth-across-us/
1•HelloUsername•36m ago•0 comments

Converting hot dog plasma video to sound with OpenCV

https://www.jeffgeerling.com/blog/2025/converting-hot-dog-plasma-video-sound-opencv
1•calcifer•38m ago•0 comments

Disassembling Terabytes of Random Data with Zig and Capstone to Prove a Point

https://jstrieb.github.io/posts/random-instructions/
2•birdculture•43m ago•0 comments

Immigration agents have new technology to identify and track people

https://www.npr.org/2025/11/08/nx-s1-5585691/ice-facial-recognition-immigration-tracking-spyware
3•01-_-•45m ago•1 comments

The Great AI Rush: Are Companies Building Intelligence–Or Just Buying It?

https://comuniq.xyz/post?t=502
1•01-_-•45m ago•0 comments

Tinder Wants Your Camera Roll and Calls It Chemistry

https://reclaimthenet.org/tinder-wants-your-camera-roll-and-calls-it-chemistry
2•wakawaka28•51m ago•2 comments

Show HN: Gempix2 – AI Image Editor and Generator Powered by Nano Banana 2

https://gempix-2.co/
1•nicohayes•51m ago•0 comments

Copy button added to Stack Overflow

https://meta.stackexchange.com/questions/414573/results-of-the-october-2025-community-asks-sprint...
10•exploraz•57m ago•8 comments

BBC '100% fake news', says Donald Trump's press secretary

https://www.theguardian.com/media/2025/nov/08/bbc-100-fake-news-says-donald-trumps-press-secretary
5•beardyw•58m ago•0 comments

AI Anki Generator

https://ai-anki.top
1•namewyf•1h ago•1 comments

My friends and I accidentally faked the Ryzen 7 9700X3D leaks

https://old.reddit.com/r/pcmasterrace/comments/1orc6jl/my_friends_and_i_accidentally_faked_the_ry...
56•djrockstar1•1h ago•1 comments

KI-Chatbots Destroying the Internet

https://gnulinux.ch/zum-wochenende-die-geister-die-sam-rief
1•Vreak-o•1h ago•1 comments

Vigorlong [pdf]

https://facilities.med.wustl.edu/app/uploads/formidable/55/VigorLongapprove-rt46gf.pdf
1•danaevanr•1h ago•0 comments

Always Be Ready to Leave (Even If You Never Do)

https://andreacanton.dev/posts/2025-11-08-always-ready-to-leave/
3•andreacanton•1h ago•1 comments

Who's watching the watchers? This Mozilla fellow, and her Surveillance Watch map

https://www.theregister.com/2025/11/08/mozilla_fellow_al_shafei/
2•defrost•1h ago•1 comments

SpaceX launches 28 Starlink satellites from California's central coast [video]

https://www.space.com/space-exploration/launches-spacecraft/spacex-starlink-11-14-b1093-vsfb-ocisly
2•ashishgupta2209•1h ago•0 comments

At the Border of Fantasy and Reality

https://desa.pl/en/stories/at-the-border-of-fantasy-and-reality/
2•jruohonen•1h ago•0 comments

Screenshot Editor – Minimalist screenshot editor

https://www.screenshoteditor.live/
1•dikshant_shah•1h ago•1 comments