https://www.janardan.xyz/writing/deconstructing-ai-eval-lab-workings
I got bored of UI work at my day job and wanted to build something. Ended up building a platform that streams KiCad (a PCB design tool) to the browser via VNC, tracks what the user is doing on the board in real time, and uses an LLM to evaluate their process at the end. The idea: coding assessments exist everywhere, but nothing like this for EE/hardware folks. Wanted to see if you could evaluate an engineer by just watching them work. Still rough. VNC latency and lag is real. No proctoring yet (MVP phase). But the core thing works. Blog has the full breakdown of how it's built.