I let LLMs play a mini-tournament. Here are all the replays and results of their games: https://yare.io/ai-arena
All are able to produce 'functioning' bots, but they are nowhere near even weak human-coded bots, yet
I let LLMs play a mini-tournament. Here are all the replays and results of their games: https://yare.io/ai-arena
All are able to produce 'functioning' bots, but they are nowhere near even weak human-coded bots, yet
javadhu•14m ago
A question though, why such powerful bots like Gemini 3.1 failed against Clowder bot? Is it because of inefficient code or the LLMs did not handle edge cases? Or they are not as good as humans when it comes to strategy.