frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Why is ML inference still so ad-hoc in practice?

4•krish678•5h ago
Every place I’ve seen run more than a couple of ML models in production ends up with a mess of bespoke inference services: different APIs, different auth, different logging, half-working dashboards, and tribal knowledge holding it all together.

I’ve been building a small side project that tries to standardize just the serving part — a single gateway in front of heterogeneous models (local, managed cloud, different teams) that handles inference APIs, versioning/rollback, auth, basic metrics, and health checks. No training, no AutoML, no “end-to-end MLOps platform”.

Before I sink more time into it, I’m trying to figure out whether this is:

a real gap people quietly paper over with internal glue, or

something that sounds useful but collapses under real-world constraints.

For people actually running ML in prod:

Do you already have an internal inference layer like this?

Where does inference usually go wrong (deployments, versioning, debugging, compliance)?

At what scale does it stop being worth abstracting at all?

Not announcing anything — genuinely curious whether this resonates or if I’m just rediscovering why everyone rolls their own.

Show HN: Gaming Couch – a local multiplayer party game platform for 8 players

https://gamingcouch.com
302•ChaosOp•5d ago•98 comments

Show HN: Xcc700: Self-hosting mini C compiler for ESP32 (Xtensa) in 700 lines

https://github.com/valdanylchuk/xcc700
3•isitcontent•35m ago•0 comments

Show HN: Mergen – A native, local-first SQL client built with Go and Wails

https://github.com/parevo/mergen
2•parevo•45m ago•0 comments

Show HN: GeneGuessr – a daily biology web puzzle

https://geneguessr.brinedew.bio/
66•brinedew•3d ago•13 comments

Show HN: Lamp Carousel – DIY kinetic sculpture powered by lamp heat (2024)

https://evan.widloski.com/posts/spinners/
88•Evidlo•1d ago•16 comments

Show HN: I built a tool to help small teams automate basic analytical tasks

2•LunarFrost88•5h ago•2 comments

Show HN: Domain Search MCP – AI-powered domain availability checker

https://github.com/dorukardahan/domain-search-mcp
2•dorukardahan•6h ago•2 comments

Show HN: Minimalist editor that lives in browser, stores everything in the URL

https://github.com/antonmedv/textarea
451•medv•1d ago•163 comments

Show HN: Exploring Mathematics with Python

https://coe.psu.ac.th/ad/explore/
255•Andrew2565•6d ago•28 comments

Show HN: I embedded 10M StreetView images

https://view.geospot.sdan.io/
12•sdan•18h ago•3 comments

Show HN: Vibium – Browser automation for AI and humans, by Selenium's creator

https://github.com/VibiumDev/vibium
429•hugs•1d ago•120 comments

Show HN: AI Accel,Tension-based pruning framework(40% sparsity, 1.5-2x speedups)

https://github.com/wwes4/AI_Accel_1.5x
2•wwes369•9h ago•0 comments

Show HN: Why is ML inference still so ad-hoc in practice?

4•krish678•5h ago•0 comments

Show HN: CineCLI – Browse and torrent movies directly from your terminal

https://github.com/eyeblech/cinecli
341•samsep10l•3d ago•106 comments

Show HN: I treated my brain like a buggy server and wrote a patch (Shi-Mo Model)

https://github.com/317317317apple-a11y/shi-mo-protocol/blob/main/README.md
14•ShiMo_Protocol•22h ago•5 comments

Show HN: Why many AI-generated websites don't show up on Google

https://pagesmith.ai/seo-for-ai-generated-sites
11•manu_trustdom•23h ago•5 comments

Show HN: Turn raw HTML into production-ready images for free

https://html2png.dev
149•alvinunreal•2d ago•80 comments

Show HN: A Claude Code plugin that catch destructive Git and filesystem commands

https://github.com/kenryu42/claude-code-safety-net
3•kenryu•12h ago•0 comments

Show HN: Fun sketch – Bring your sketches to life

https://funsketch.kigun.org/
7•mishu2•12h ago•2 comments

Show HN: Jmail – Google Suite for Epstein files

https://www.jmail.world
1549•lukeigel•5d ago•359 comments

Show HN: A local-first, reversible PII scrubber for AI workflows

https://medium.com/@tj.ruesch/a-local-first-reversible-pii-scrubber-for-ai-workflows-using-onnx-a...
37•tjruesch•1d ago•13 comments

Show HN: WebPtoPNG – I built a WebP to PNG tool, everything runs in the browser

https://webptopng.cc/
20•akseli_ukkonen•1d ago•19 comments

Show HN: I built an OCI container runtime in Python(for fun)

https://github.com/Kaleab-Ayenew/puncker-rt
5•kalishayish•20h ago•0 comments

Show HN: Kapso – WhatsApp for developers

https://kapso.ai/
45•aamatte•2d ago•25 comments

Show HN: ReadHn - Reading list for top HN posts

https://www.readhn.top/
10•taabishm2•1d ago•0 comments

Show HN: Frame an web synth for desktop or mobile with hand gesture support

https://oyehoy.net/
2•markrai•16h ago•0 comments

Show HN: Kill List–A local-first PWA where tasks deletes if not done by midnight

https://killlist-production.up.railway.app
3•msldiarra•16h ago•5 comments

Show HN: HN Wrapped 2025 - an LLM reviews your year on HN

https://hn-wrapped.kadoa.com?year=2025
311•hubraumhugo•6d ago•153 comments

Show HN: Microsoft Agent Viewer

https://acs-viewer.pages.dev/
9•ellg•1d ago•0 comments

Show HN: Books mentioned on Hacker News in 2025

https://hackernews-readings-613604506318.us-west1.run.app
608•seinvak•4d ago•212 comments