frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenAI's GDPval: Why the 66% in Automated Grading Matters More Than 48% Win Rate

https://medium.com/@pranil.dasika/openais-gdpval-why-the-66-automated-grading-problem-matters-more-than-the-48-win-rate-a5e542508196
5•pdasika•1h ago

Comments

adisv•1h ago
Very comprehensive writeup @pdasika. Incredibly relevant for devs working on agentic applications for the enterprise.
kanodiaashu•1h ago
Interesting take..