frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Inference API that adapts to your SLA and quality constraints

https://models.exosphere.host/
6•spacemnstr42069•2h ago
Hi HN, I'm one of the creators of Exosphere. Think of us like a reliability lab for agents.

Today we are launching Exosphere Flex Inference APIs: Inference APIs should adapt to your constraints, not the other way around.

Usually, when you need to run inference at scale, you are forced into rigid boxes:

1. "Real-time" APIs (Expensive, optimized for <1s latency, prone to 429s).

2. "Batch" APIs (Cheaper, but often force 24-hour windows and rigid file formats).

3. "Self-hosted" (Total control, but high ops overhead).

We built a flexible inference engine that sits in the middle. You define the constraints—SLA (time), Cost, and Quality and the system handles the execution.

Here is how it works under the hood:

1. Flexible SLAs (The "Time" Constraint): Instead of just "now" or "tomorrow," you pass an `sla` parameter (e.g., 60 minutes, 4 hours). Our scheduler bins these requests to optimize GPU saturation across our provider mesh. You trade strict immediacy for up to ~70% lower cost.

2. Reliability Layer (The "Ops" Constraint): We abstract away the error handling. If a provider throws a 429 or 503, you shouldn't have to write a retry loop with backoff jitter. Our infrastructure absorbs these failures and retries internally. We guarantee the request eventually succeeds (within your SLA) or we don't charge you.

3. Built-in Quality Gates (The "Accuracy" Constraint): This is the feature I’m most excited about. You can define an "eval" config in the request (using LLM-as-a-Judge or python scripts). If the output doesn't meet your criteria, our system automatically feeds the failure back into the model and retries it. This moves the "validation loop" from your client code into the infrastructure.

I’d love to hear your thoughts on this approach—specifically, does moving the "retry/eval" loop into the API layer simplify your backend, or do you prefer keeping that logic client-side?

Playground: https://models.exosphere.host/

More Details: https://exosphere.host/flex-inference

A Socialist Now Runs New York. Here's What History Predicts

https://zcashexplained.com/blog/new-york-voted-socialist/
2•privacyadvocate•3m ago•0 comments

Norway zips ahead in EV race as car sales hit 96% electric

https://www.reuters.com/sustainability/climate-energy/norways-new-car-sales-were-96-electric-2025...
2•whynotmaybe•3m ago•0 comments

Gitix.ai

https://gitix.ai/
1•azolf•4m ago•0 comments

Show HN: Test Agents Using Fixtures

https://github.com/eoinmurray/incantx
1•anomancer•5m ago•0 comments

Why are weather forecasting sites so bad?

https://blog.engora.com/2025/12/why-is-weather-forecasting-so-bad.html
2•Vermin2000•5m ago•0 comments

Show HN: TymFlow transforms boring time tracking into a rewarding experience

https://tymflow.vercel.app
1•Jenni_emeka•5m ago•0 comments

Foundation, the most advanced responsive front end framework in the world

https://get.foundation
1•Alifatisk•5m ago•0 comments

OfferGridAI – side-by-side comparison of real estate offers from PDFs

https://offergridai.com
1•beechwood•6m ago•1 comments

Why do teams keep explaining the same delays every sprint?

https://smartguess.is/blog/time-in-status-jira-common-mistakes/
1•bjornbrynjar•7m ago•0 comments

Show HN: Travel Safety Data

https://travelsafetydata.com/
2•ohashi•8m ago•1 comments

Las Médulas

https://en.wikipedia.org/wiki/Las_M%C3%A9dulas
1•thunderbong•8m ago•0 comments

Repair a ship’s hull still in the river in -50˚C (2022)

https://eugene.kaspersky.com/2022/04/26/how-to-repair-the-underside-of-a-ships-hull-still-in-the-...
1•aziaziazi•14m ago•0 comments

Show HN: CheerAd – Let your audience support your website with paid messages

https://cheerad.com/pages/how-to-use.html
1•niyoseris•14m ago•0 comments

From Web to Native: Building Native Apps with SvelteKit and Capacitor

https://bryanhogan.com/blog/web-to-app-sveltekit-capacitor
1•bryanhogan•16m ago•0 comments

The Last of Us – Fighting the EU Surveillance Law Apocalypse [video]

https://media.ccc.de/v/39c3-the-last-of-us-fighting-the-eu-surveillance-law-apocalypse
1•DyslexicAtheist•16m ago•0 comments

39th Chaos Communication Congress Videos

https://media.ccc.de/b/congress/2025
2•Jommi•19m ago•0 comments

Hytale Multiplayer Servers

https://hytalemultiplayer.io
1•kaddir•20m ago•1 comments

Europe has 'lost the internet', warns Belgium's cyber security chief

https://www.ft.com/content/854fcad0-0d39-438b-975b-adf9d8b89827
1•firesteelrain•25m ago•1 comments

Free-floating planets – lonely wanderers in the Milky Way

https://en.uw.edu.pl/free-floating-planets-lonely-wanderers-in-the-milky-way/
1•Vedor•27m ago•1 comments

No Excuse for Silence

https://zenodo.org/records/18122479
1•DavidWishengrad•27m ago•2 comments

Show HN: Verifying Rust implementation logic using Lean 4 as a fuzzing oracle

https://github.com/welltyped-systems/verified-ledger
1•xmaruff•27m ago•1 comments

Replace any x.com link with xcancel.com

https://bsky.app/profile/rcsheets.net/post/3mbfwj34mts2h
3•doener•29m ago•1 comments

View source no longer makes a new request

https://masteringlaravel.io/daily/2026-01-01-view-source-no-longer-makes-a-new-request
1•vincent_s•29m ago•0 comments

New subway stations in Naples are a lesson in art and history

https://www.theglobeandmail.com/world/article-naples-metro-station-art-archeology-trains-achille-...
1•pseudolus•29m ago•1 comments

TrenchBroom 2025.4 Release

https://github.com/TrenchBroom/TrenchBroom/releases/tag/v2025.4
2•klaussilveira•31m ago•0 comments

Fat People Do Go to Heaven

https://kathy4u.substack.com/p/fat-people-do-go-to-heaven
2•TheUtleyPost•31m ago•0 comments

Show HN: Siara

https://github.com/torayeff/siara
1•torayeff•31m ago•1 comments

SD-WAN

https://en.wikipedia.org/wiki/SD-WAN
1•tosh•33m ago•0 comments

Show HN: The Dinner Decider – A bracket tournament to decide what to eat

https://www.thedinnerdecider.au/
1•manulkkase•34m ago•0 comments

Show HN: A minimalist LLM plugin for tmux

https://github.com/hynek-urban/tmux-llm
1•Wuzzy•34m ago•0 comments