fp.
newest
Open in hackernews
SurgeAI Blog: Human Evals vs. Academic Benchmarks
https://www.surgehq.ai//blog/human-evals-vs-academic-benchmarks
1
•
Olshansky
•
5mo ago