BPU (Batch Processing Unit) is a lightweight embedded scheduling core focused on keeping output pipelines stable under pressure (UART backpressure, limited bandwidth, bursty producers).
Instead of blocking or growing unbounded queues, it: enforces per-tick byte budgets, coalesces redundant events, degrades gracefully under sustained load, exposes detailed runtime statistics.
The repository includes design notes, flow diagrams, and real execution logs, which makes the runtime behavior very transparent.
Repo: https://github.com/choihimchan/bpu_v2_9b_r1
I’ve been working on an ESP-IDF backend for it, and reading through the docs gave me a lot of ideas about observability and backpressure handling in small systems.
Curious what others think about this approach.