Built ollamon, an htop-style terminal monitor for Ollama nodes. It shows installed and running models, CPU/RAM/disk usage, GPU metrics, access-log-based latency/request telemetry, and lightweight operational insights in a terminal UI. macOS GPU data is sourced from agputop, and the goal is to make local LLM infrastructure easier to observe without adding heavy dependencies.
https://github.com/hbasria/ollamon