frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

SurgeAI Blog: Human Evals vs. Academic Benchmarks

https://www.surgehq.ai//blog/human-evals-vs-academic-benchmarks
1•Olshansky•5mo ago