I built AluminatiAI to fill that gap. A lightweight Python agent (pynvml, ~0% CPU overhead, ~50MB RAM) samples power draw every 5 seconds and streams to a Next.js + Supabase dashboard. The dashboard converts watts → dollars per job, per GPU, per day. Works on A100, H100, RTX 3090/4090, and Google Colab.
Install is `pip install` + one env var. Under 2 minutes.
As H100 time gets more expensive and teams are running larger sweeps against frontier models, not knowing which run cost $800 is a real problem — not just a nice-to-have dashboard.
Open source: https://github.com/AgentMulder404/aluminatai-landing Website: https://www.aluminatiai.com
Happy to answer questions about the sampling approach or the watt-to-dollar conversion logic.