frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Client-side GPU load balancing with Redis and Lua

https://galileo.ai/blog/how-we-boosted-gpu-utilization-by-40-with-redis-lua
7•lneiman•5d ago

Comments

lneiman•5d ago
Author here. We were hitting tail latency and low GPU utilization issues serving SLMs via Triton.

I built a scrappy client-side router using Redis and Lua to track real-time GPU load. It boosted utilization by ~40% and improved latencies.

Happy to hear feedback on the implementation or thoughts on better ways to do this!

pbrumm•1d ago
Have you tried switching it to a job queue where the GPU instances try to keep themselves busy. That way you can auto scale the gpus based on utilization. I find it easier to tune and you can monitor latency and backlogs easier. It does require some async mechanisms to the client but I have found it easier to maintain

12 Days of Shell

https://12days.cmdchallenge.com
83•zoidb•2h ago•23 comments

Show HN: Web app that lets you send email time capsules

https://resurf.me
13•walrussama•45m ago•3 comments

GitHub Actions Has a Package Manager, and It Might Be the Worst

https://nesbitt.io/2025/12/06/github-actions-package-manager.html
164•robin_reala•4h ago•88 comments

Jujutsu Worktrees Are Convenient

https://shaddy.dev/notes/jj-worktrees/
48•nvader•4d ago•28 comments

Turtletoy

https://turtletoy.net/
218•ustad•4d ago•41 comments

Emacs is my new window manager

https://www.howardism.org/Technical/Emacs/new-window-manager.html
116•gpi•2d ago•41 comments

Nango (YC W23) is hiring back-end engineers and dev-rels (remote)

https://jobs.ashbyhq.com/Nango
1•bastienbeurier•21m ago

Damn Small Linux

https://www.damnsmalllinux.org/
123•grubbs•10h ago•33 comments

I failed to recreate the 1996 Space Jam website with Claude

https://j0nah.com/i-failed-to-recreate-the-1996-space-jam-website-with-claude/
467•thecr0w•19h ago•379 comments

Show HN: Lockenv – Simple encrypted secrets storage for Git

https://github.com/illarion/lockenv
35•shoemann•4h ago•8 comments

Bag of words, have mercy on us

https://www.experimental-history.com/p/bag-of-words-have-mercy-on-us
206•ntnbr•13h ago•218 comments

Client-side GPU load balancing with Redis and Lua

https://galileo.ai/blog/how-we-boosted-gpu-utilization-by-40-with-redis-lua
7•lneiman•5d ago•2 comments

Bad Dye Job

https://daringfireball.net/2025/12/bad_dye_job
12•mpweiher•35m ago•0 comments

Dollar-stores overcharge customers while promising low prices

https://www.theguardian.com/us-news/2025/dec/03/customers-pay-more-rising-dollar-store-costs
406•bookofjoe•21h ago•552 comments

Show HN: ReadyKit – Superfast SaaS Starter with Multi-Tenant Workspaces

https://readykit.dev/
64•level09•1w ago•11 comments

Google Titans architecture, helping AI have long-term memory

https://research.google/blog/titans-miras-helping-ai-have-long-term-memory/
521•Alifatisk•23h ago•173 comments

The C++ standard for the F-35 Fighter Jet [video]

https://www.youtube.com/watch?v=Gv4sDL9Ljww
278•AareyBaba•18h ago•322 comments

The fuck off contact page

https://www.nicchan.me/blog/the-f-off-contact-page/
237•OuterVale•3h ago•97 comments

Mechanical power generation using Earth's ambient radiation

https://www.science.org/doi/10.1126/sciadv.adw6833
132•defrost•14h ago•41 comments

I wasted years of my life in crypto

https://twitter.com/kenchangh/status/1994854381267947640
304•Anon84•23h ago•452 comments

An Interactive Guide to the Fourier Transform

https://betterexplained.com/articles/an-interactive-guide-to-the-fourier-transform/
212•pykello•6d ago•37 comments

Solving Rush Hour, the Puzzle (2018)

https://www.michaelfogleman.com/rush/
35•xeonmc•1w ago•5 comments

Survivors Clung to Wreckage for Some 45 Minutes Before U.S. Military Killed Them

https://theintercept.com/2025/12/05/boat-strike-survivors-double-tap/
14•belter•55m ago•2 comments

CATL expects oceanic electric ships in 3 years

https://cleantechnica.com/2025/12/05/catl-expects-oceanic-electric-ships-in-3-years/
128•thelastgallon•1d ago•158 comments

The Anatomy of a macOS App

https://eclecticlight.co/2025/12/04/the-anatomy-of-a-macos-app/
246•elashri•23h ago•73 comments

Einstein: NewtonOS running on other operating systems

https://github.com/pguyot/Einstein
26•fanf2•2h ago•3 comments

Why Leftover Pizza Might Be Healthier

https://www.scientificamerican.com/video/why-leftover-pizza-is-actually-healthier-the-science-of-...
18•Brajeshwar•2h ago•1 comments

How I block all online ads

https://troubled.engineer/posts/no-ads/
236•StrLght•14h ago•204 comments

Scala 3 slowed us down?

https://kmaliszewski9.github.io/scala/2025/12/07/scala3-slowdown.html
237•kmaliszewski•21h ago•137 comments

Nested Learning: A new ML paradigm for continual learning

https://research.google/blog/introducing-nested-learning-a-new-ml-paradigm-for-continual-learning/
132•themgt•21h ago•10 comments