OpenAI’s o3 now outperforms 94% of expert virologists." -- thread by a co-author, https://x.com/DanHendrycks/status/1914696657813561799
Paper: Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark
https://www.virologytest.ai/vct_paper.pdf
nopinsight•3h ago
OpenAI’s o3 now outperforms 94% of expert virologists." -- thread by a co-author, https://x.com/DanHendrycks/status/1914696657813561799
Paper: Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark
https://www.virologytest.ai/vct_paper.pdf