frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
13•levmiseri•22h ago
This is a game I wish I had as a kid learning programming. The concept of it is fairly similar to other coding games like Screeps, but instead of a complex world with intricate mechanics, Yare is a lot more minimal and approachable with quick 1v1 <3 min matches.

It's purely a passion project with no monetization aspirations. And it's open source: https://github.com/riesvile/yare

The first version 'launched' several years ago and I got some good feedback here: https://news.ycombinator.com/item?id=27365961 that I iterated on.

The latest overhaul is a result of simplifying everything while still keeping the skill ceiling high. And at least the LLMs seem to struggle with this challenge for now (I run a small tournament between major models - results and details here: https://yare.io/ai-arena

I'd love to hear your thoughts

Comments

javadhu•18h ago
Cool project, this is my first time seeing such project using LLMs. Took me a while to understand what's happening on the home page.

A question though, why such powerful bots like Gemini 3.1 failed against Clowder bot? Is it because of inefficient code or the LLMs did not handle edge cases? Or they are not as good as humans when it comes to strategy.

levmiseri•18h ago
I’m not sure honestly. It could be some combination of bad spatial reasoning of the LLMs and lack of any training data for this specific challenge.

You can see replays for all of the matches if you hover over the cells in the table.

dang•3h ago
Macroexpanding the previous threads:

Show HN: Yare 2 – Programmable RTS game - https://news.ycombinator.com/item?id=32394902 - Aug 2022 (26 comments)

Show HN: Yare.io – game where you control units with JavaScript - https://news.ycombinator.com/item?id=27365961 - June 2021 (64 comments)

(Btw, reposts are fine after a year or so; links to past threads are just to satisfy extra-curious readers!)

vessenes•2h ago
Cool!

From the prompt it looks like you don’t give the llms a harness to step through games or simulate - is that correct? If so I’d suggest it’s not a level playing field vs human written bots - if the humans are allowed to watch some games that is.

levmiseri•1h ago
That’s true, I’m trying to figure out a better testing environment with a feedback loop.

I did try letting the models iterate on the bot code based on a summary of an end-of-game ‘report’, but that showed only marginal improvements vs. zero-shot

Plasma Bigscreen – 10-foot interface for KDE plasma

https://plasma-bigscreen.org
245•PaulHoule•5h ago•76 comments

UUID package coming to Go standard library

https://github.com/golang/go/issues/62026
66•soypat•2h ago•17 comments

this css proves me human

https://will-keleher.com/posts/this-css-makes-me-human/
195•todsacerdoti•7h ago•72 comments

Can a wealthy family change the course of a deadly brain disease?

https://www.science.org/content/article/can-wealthy-family-change-course-deadly-brain-disease
20•Snoozus•1h ago•10 comments

Maybe There's a Pattern Here?

https://dynomight.net/pattern/
52•surprisetalk•2d ago•17 comments

LLMs work best when the user defines their acceptance criteria first

https://blog.katanaquant.com/p/your-llm-doesnt-write-correct-code
112•dnw•3h ago•91 comments

C# strings silently kill your SQL Server indexes in Dapper

https://consultwithgriff.com/dapper-nvarchar-implicit-conversion-performance-trap
80•PretzelFisch•6h ago•46 comments

Galileo's handwritten notes found in ancient astronomy text

https://www.science.org/content/article/galileo-s-handwritten-notes-found-ancient-astronomy-text
67•tzury•1d ago•9 comments

Hardening Firefox with Anthropic's Red Team

https://www.anthropic.com/news/mozilla-firefox-security
531•todsacerdoti•17h ago•149 comments

Querying 3B Vectors

https://vickiboykis.com/2026/02/21/querying-3-billion-vectors/
10•surprisetalk•3d ago•0 comments

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

https://github.com/moongate-community/moongatev2
238•squidleon•14h ago•135 comments

Tell HN: I'm 60 years old. Claude Code has ignited a passion again

253•shannoncc•4h ago•153 comments

The Shady World of IP Leasing

https://acid.vegas/blog/the-shady-world-of-ip-leasing/
79•alibarber•7h ago•48 comments

Show HN: 1v1 coding game that LLMs struggle with

https://yare.io
13•levmiseri•22h ago•5 comments

Launch HN: Palus Finance (YC W26): Better yields on idle cash for startups, SMBs

42•sam_palus•10h ago•69 comments

Tech employment now significantly worse than the 2008 or 2020 recessions

https://twitter.com/JosephPolitano/status/2029916364664611242
796•enraged_camel•11h ago•542 comments

CT Scans of Health Wearables

https://www.lumafield.com/scan-of-the-month/health-wearables
196•radeeyate•14h ago•41 comments

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

https://github.com/c0m4r/kula
22•c0m4r•4h ago•18 comments

What canceled my Go context?

https://rednafi.com/go/context-cancellation-cause/
24•mweibel•2d ago•14 comments

Entomologists use a particle accelerator to image ants at scale

https://spectrum.ieee.org/3d-scanning-particle-accelerator-antscan
109•gmays•13h ago•21 comments

Ada 2022

https://www.adaic.org/ada-resources/standards/ada22/
120•tosh•8h ago•23 comments

A Modular Robot Dashboard

https://github.com/transitiverobotics/transact
8•chfritz•1d ago•0 comments

A tool that removes censorship from open-weight LLMs

https://github.com/elder-plinius/OBLITERATUS
138•mvdwoord•14h ago•62 comments

Workers who love ‘synergizing paradigms’ might be bad at their jobs

https://news.cornell.edu/stories/2026/03/workers-who-love-synergizing-paradigms-might-be-bad-thei...
531•Anon84•15h ago•301 comments

Good Bad ISPs

https://community.torproject.org/relay/community-resources/good-bad-isps/
112•rzk•14h ago•38 comments

Analytic Fog Rendering with Volumetric Primitives (2025)

https://matejlou.blog/2025/02/11/analytic-fog-rendering-with-volumetric-primitives/
88•surprisetalk•1d ago•8 comments

Astra: An open-source observatory control software

https://github.com/ppp-one/astra
87•pppone•12h ago•21 comments

Art Bits from HyperCard

https://archives.somnolescent.net/web/mari_v2/junk/hypercard/
71•TigerUniversity•7h ago•15 comments

Multifactor (YC F25) Is Hiring an Engineering Lead

https://www.ycombinator.com/companies/multifactor/jobs/lcpd60A-engineering-lead
1•multifactor•12h ago

Game about Data of America

https://americaindata.com/
7•fidicen•4h ago•0 comments