frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Always get the best LLM performance for your $?

3•romain_batlle•3h ago
Hey, I built an inference router that literally makes provider of LLM compete in real-time on speed, latency, price to serve each call. So it works on open and closed model, and for closed model price is fixed so provider only “compete” on speed and latency.

Spent quite some time normalizing APIs, handling tool-calls, and managing prompt caching, but the end result sounds very cool: You always get the absolute best value for your \$ at the exact moment of inference.

Currently runs perfectly on a Roo and Cline fork, and on any OpenAI compatible BYOK app (so kind of everywhere)

Feedback very much welcomed! Please tear it apart: [https://makehub.ai](https://makehub.ai/)

Muntz Metal

https://en.wikipedia.org/wiki/Muntz_metal
1•bookofjoe•55s ago•0 comments

The Collapse of the Boston Bruins' 1981 Move to Salem, New Hampshire

https://www.tandfonline.com/doi/full/10.1080/09523367.2025.2486663
1•PaulHoule•58s ago•0 comments

China Solar Additions Surge to Record 93GW in May Ahead of Deadline

https://www.bloomberg.com/news/articles/2025-06-23/china-solar-additions-surge-to-record-in-may-ahead-of-deadline
1•toomuchtodo•3m ago•0 comments

At Antarctica's midwinter, a look back at continent's history of dark behavior

https://theconversation.com/at-antarcticas-midwinter-a-look-back-at-the-frozen-continents-long-history-of-dark-behavior-253906
1•austinallegro•3m ago•0 comments

Show HN: TNX API – Natural Language Interactions with Your Database

https://www.tnxapi.com/UI/login.php
1•Marten42•5m ago•0 comments

Iran cyberattacks against US biz more likely following air strikes

https://www.theregister.com/2025/06/23/iran_cyberattacks_against_us/
1•rntn•5m ago•0 comments

All Hail the Slop Bowl, Lunch of Our Ancestors

https://www.atlasobscura.com/articles/all-hail-the-slop-bowl
1•ecliptik•6m ago•0 comments

Moonbase Alpha: That time NASA made a meme video game

https://www.spacebar.news/moonbase-alpha-nasa-video-game/
3•todsacerdoti•10m ago•0 comments

Compass Sues to Stop 'Zillow Ban'

https://www.nytimes.com/2025/06/23/realestate/compass-zillow-lawsuit.html
2•randycupertino•11m ago•2 comments

First celestial image unveiled from revolutionary telescope

https://www.bbc.com/news/articles/cj3rmjjgx6xo
1•wood_spirit•11m ago•0 comments

Show HN: OVR, a framework for streaming HTML with AsyncGenerator JSX

https://github.com/rossrobino/ovr
1•robinoross•15m ago•0 comments

Agentic AI Hands-On in Python: MCP, CrewAI and OpenAI Agents SDK

https://www.youtube.com/watch?v=LSk5KaEGVk4
2•jonkrohn•18m ago•0 comments

Crunch time–we'll soon find out if Amazon's launch providers are up to the job

https://arstechnica.com/space/2025/06/crunch-time-well-soon-find-out-if-amazons-launch-providers-are-up-to-the-job/
2•LorenDB•18m ago•0 comments

Introduction to Mechanistic Interpretability

https://www.marioraach.de/blog/mechanistic-interpretability-1
1•mario1870•19m ago•0 comments

Run High-Performance LLM Inference Kernels from Nvidia Using FlashInfer

https://developer.nvidia.com/blog/run-high-performance-llm-inference-kernels-from-nvidia-using-flashinfer/
1•mfiguiere•19m ago•0 comments

Show HN: 11.ai – Talk to Hacker News with your voice (reads comments)

https://11.ai
3•louisjoejordan•20m ago•0 comments

Show HN: Chisel – GPU development through MCP

https://github.com/Herdora/chisel
1•technoabsurdist•21m ago•0 comments

Resurrecting flip phone typing as a Linux driver

https://github.com/FoxMoss/libt9
9•foxmoss•21m ago•1 comments

Vera C. Rubin Observatory First Look: Trifid and Lagoon Nebulae

https://rubinobservatory.org/news/rubin-first-look/trifid-lagoon
2•LorenDB•21m ago•0 comments

The AI Paradox

https://hugston.com/articles/The_AI_Paradox
1•trilogic•22m ago•0 comments

Agents for the Agent

https://ampcode.com/agents-for-the-agent
1•yomismoaqui•22m ago•0 comments

Creating a web based timezone-aware clock without any JavaScript

https://lazy-guy.github.io/blog/clock/
1•thunderbong•24m ago•0 comments

Novel About Selling Your Vision, Raising Venture, and Launching Your Startup

https://feld.com/archives/2025/06/book-fever-pitch-a-novel-about-selling-your-vision-raising-venture-capital-and-launching-your-startup/
1•rmason•24m ago•0 comments

Is "MIT Software License but No AI" Possible?

1•derwoojer•24m ago•2 comments

Levered beta is all you need

https://textql.notion.site/levered-beta-is-all-you-need-20ba769a508880388186ef0c2fa11389
3•mollynpaan•26m ago•0 comments

Thread_pool_hybrid – a faster more scalable MySQL connection handler

https://github.com/Damienkatz/thread_pool_hybrid
2•damienkatz•26m ago•1 comments

Masquerade MCP – the privacy firewall for Claude

https://github.com/postralai/masquerade
2•Sam_Eyob•30m ago•1 comments

Data visualization is a product of NSF–DOE Vera C. Rubin Observatory

https://skyviewer.app/explorer
2•perihelions•30m ago•0 comments

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

https://arxiv.org/abs/2506.17218
2•badmonster•30m ago•0 comments

Action Potentials for June

https://neurobiology.substack.com/p/action-potentials-for-june-058
2•paulpauper•30m ago•0 comments