frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Orpheus, An Agent runtime that scales on queue depth and not CPU

https://github.com/arpitnath/orpheus
6•arpitnath42•2h ago

Comments

arpitnath42•1h ago
Hey HN! Over the last year, I’ve been running AI agents in production and kept hitting the same issue. Responses would suddenly take 30+ seconds, even though everything looked fine. CPU was low, memory was fine, no errors. But requests were clearly getting stuck.

The reason turned out to be simple. These agents spend most of their time waiting for AI APIs to respond. While they wait, they barely use the CPU. So the system looks idle even when it’s overloaded. Low CPU doesn’t mean low demand.

I first tried normal autoscaling based on CPU. It never worked. CPU stayed low, so nothing scaled up, and queues kept growing.

Then I moved to scaling based on queue size. That worked much better. I tried a few setups using different queues and cloud tools. The idea was right, but the setup was heavy. It needed extra services and cloud-specific configs and still suffered from slow container startups.

What I really wanted was something simpler: scale based on how much work is waiting, react quickly, and avoid complex infrastructure.

After running this approach in production for about seven months, I pulled it out into a small standalone system called Orpheus. It scales based on pending work, starts workers quickly, keeps agent state on disk so it survives restarts, and doesn’t automatically retry crashed jobs (which helps avoid repeating things like emails or payments). It also avoids container overhead and gives each agent a built-in way to connect tools and models.

This isn’t meant to replace Kubernetes. It’s meant for AI agent workloads where CPU-based scaling gives the wrong signal.

Show HN: Image MetaHub – Search Local AI Images by Prompt, Model, LoRA, Seed

https://github.com/LuqP2/Image-MetaHub
1•LucasPi•1m ago•0 comments

The Sazabi Manifesto

https://www.sazabi.com/manifesto
1•shcallaway•2m ago•1 comments

Show HN: Tabstack Research – An API for verified web research (by Mozilla)

4•MrTravisB•3m ago•0 comments

Pick Your Agent: Use Claude and Codex on Agent HQ

https://github.blog/news-insights/company-news/pick-your-agent-use-claude-and-codex-on-agent-hq/
1•abraham•4m ago•0 comments

DHS Hunts Down 67-Year-Old U.S. Citizen Who Criticized Them in Email

https://newrepublic.com/post/206088/homeland-security-67-year-old-us-citizen-criticized-email
2•randycupertino•5m ago•1 comments

Claude Code's /Insights

https://www.natemeyvis.com/claude-codes-insights/
1•ingve•7m ago•0 comments

Release Pandoc 3.9

https://github.com/jgm/pandoc/releases/tag/3.9
4•Tomte•9m ago•1 comments

YC S26 Application: "Attach a coding agent session you're particularly proud of"

1•simplydt•9m ago•0 comments

Why does this site require security questions?

https://txt.texas.gov/getting-started/security-questions-capture
1•newsoftheday•10m ago•1 comments

VS Code for Linux may be hoarding trashed files

https://www.theregister.com/2026/02/04/vs_code_for_linux_trash_fail/
1•u1hcw9nx•10m ago•0 comments

I made an open-source juypter alternative

https://github.com/DannyMang/more-compute
1•danielung22•10m ago•0 comments

Map Shows 21 States Where Deaths Now Outnumber Births (2025)

https://www.newsweek.com/map-shows-21-states-where-deaths-now-outnumber-births-2092400
2•toomuchtodo•10m ago•0 comments

The Great Unwind

https://occupywallst.com/yen
2•jart•11m ago•0 comments

Cracking the Clit (2017)

https://logicmag.io/sex/cracking-the-clit/
1•joebig•12m ago•0 comments

MariaDB Cloud BYOA: run managed MariaDB inside your Azure subscription

https://mariadb.com/resources/blog/announcing-mariadb-cloud-byoa-fully-managed-mariadb-in-your-az...
1•alejandro-du•14m ago•0 comments

cad0: A Text-to-CAD Model

https://campedersen.com/cad0
7•ecto•15m ago•1 comments

A2RL – Autonomous Drone Racing Championship [video]

https://www.youtube.com/watch?v=P25BwtepmUk
1•aanet•16m ago•1 comments

Min Blogger-Mania-Profil

1•agnes-nordic•17m ago•0 comments

Show HN: A free focus app that locks work if you skip breaks

https://www.kensho.zone/
1•bestonearth•17m ago•0 comments

Show HN: tmpo – Local-first CLI time tracker with automatic project detection

https://github.com/DylanDevelops/tmpo
1•dylandevelops•17m ago•0 comments

Ask HN: When will LLMs generate professional-level CAD models?

4•dsrtslnd23•18m ago•1 comments

Who Bit My Border? (2012)

https://archive.nytimes.com/opinionator.blogs.nytimes.com/2012/03/13/who-bit-my-border/
1•thunderbong•18m ago•0 comments

Mazeru: Clojure full-stack server-state using Deno and Datastar

https://codeberg.org/gregorybleiker/mazeru
1•simonpure•18m ago•0 comments

Server CPUs join memory in the supply shortage, pushing up prices

https://www.theregister.com/2026/02/04/server_cpus_memory_shortage/
1•jjgreen•18m ago•0 comments

Feedsmith: Emacs RSS reader with Feedbin sync, pluggable back ends

https://github.com/curtismchale/feedsmith
2•ingve•19m ago•0 comments

Show HN: Workout app for when you're short on time

https://apps.apple.com/il/app/quickworkout-tap-train/id6757982958
2•Benboren•20m ago•1 comments

NASA's Perseverance Rover Completes First AI-Planned Drive on Mars

https://www.jpl.nasa.gov/news/nasas-perseverance-rover-completes-first-ai-planned-drive-on-mars/
2•kgrimes2•20m ago•0 comments

Intel will start making GPUs

https://techcrunch.com/2026/02/03/intel-will-start-making-gpus-a-market-dominated-by-nvidia/
12•SunshineTheCat•20m ago•11 comments

Roblox's 4D creation feature is now available in open beta

https://techcrunch.com/2026/02/04/robloxs-4d-creation-feature-is-now-available-in-open-beta/
1•rbanffy•20m ago•0 comments

Implementation of the Apollo Guidance Computer in an FPGA

https://github.com/mikeakohn/apollo11_fpga
1•PaulHoule•21m ago•0 comments