frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Why are LLMs so bad at board games?

3•legitster•1h ago
I've come across a use case LLMs seem inexplicably bad at: reading and understanding board game rules.

It seems like it should be something an LLM would be excellent at. After all, a rulebook is just a self contained set of instructions.

And yet, not only do LLMs fail to play a board game, I have yet to get one to successfully understand a rulebook enough to even answer basic rules questions.

This seems like a massive red flag overall for the status of AI. I'm wondering if this is an overcomeable issue, or if it speaks to the underlying limitations of AI in general.

Ponytail, Yagni, and the Problem with Prompt Benchmarks

https://blog.scottlogic.com/2026/06/16/ponytail-yagni-and-the-problem-with-prompt-benchmarks.html
1•ColinEberhardt•1m ago•0 comments

Stop Using JWTs

https://gist.github.com/samsch/0d1f3d3b4745d778f78b230cf6061452
1•dzonga•1m ago•1 comments

The Minimum Viable Unit of Saleable Software

https://brandur.org/minimum-viable-unit
1•plaur782•1m ago•0 comments

Building an LLM safe design system

https://polar.sh/blog/orbit-llm-safe-design-system
1•steventey•2m ago•0 comments

Gen Z Is Turning YouTubers into Box Office Giants

https://www.ypulse.com/article/2026/06/15/gen-z-is-turning-youtubers-into-box-office-giants/
1•mooreds•3m ago•0 comments

Cursor Is a Great Restaurant

https://marginpoints.substack.com/p/cursor-is-a-great-restaurant
1•historian1066•3m ago•0 comments

Supply Chain Capitalism, Platform Mercantilism, AI Coup

https://www.ctrl-verlust.net/supplychain-kapitalismus-plattform-merkantilismus-ki-coup-und-die-gr...
1•doener•5m ago•0 comments

Agent-stdlib: A standard library for building agents

https://github.com/pebeto/agent-stdlib
1•pebeto•6m ago•0 comments

Open source Action-RPG game (Clojure)

https://github.com/damn/moon
1•resatori•7m ago•0 comments

I packaged 20 years of enterprise AI sales experience as a Claude Skill

https://github.com/vonarmen-wq/forward-deployed-selling
1•alphaspawn14•7m ago•0 comments

Why is Meta destroying its engineering organization?

https://newsletter.pragmaticengineer.com/p/why-is-meta-destroying-its-engineering
2•throwarayes•8m ago•0 comments

Training NanoGPT on Slurm with a Nix-Pinned Environment

https://flox.dev/blog/training-nanogpt-on-slurm-with-a-nix-pinned-environment/
2•rokgarbas•8m ago•0 comments

TIL: You can make HTTP requests without curl using Bash /dev/TCP

https://mareksuppa.com/til/bash-dev-tcp-http-without-curl/
2•mrshu•10m ago•1 comments

The British Social Media Ban Is Silly

https://dogdogfish.com/blog/2026/06/15/social-media-ban/
1•matthewsharpe3•10m ago•0 comments

British Colombia, Time Zones, and Postgres

https://www.crunchydata.com/blog/british-columbia-and-time-zone-changes
2•winslett•10m ago•0 comments

The AiCopalypse

https://www.objecthunter.net//articles/2026/06/16/the-aicopalypse.html
1•floken•11m ago•1 comments

Sam Bankman-Fried's Prison Experiment

https://nymag.com/intelligencer/article/sam-bankman-fried-prison-donald-trump-pardon-appeal.html
1•aanet•11m ago•0 comments

Show HN: NumUp – A Daily Math Puzzle

https://vasanth.fun/numup/
1•vasanthv•12m ago•0 comments

A Vision for a Rust Formal Specification

https://nadrieril.github.io/blog/2026/06/16/formal-spec-vision.html
1•emschwartz•12m ago•0 comments

The Pokémon Trading Card Game AI Battle Challenge

https://ptcg-abc.pokemon.co.jp/
2•esnard•13m ago•0 comments

Show HN: Memento – Self-hosted agentic search and LLM wiki over your email

4•georgeck•14m ago•1 comments

Show HN: Dino – An AI coding agent that keeps you in the loop

https://smartdino.dev
1•ylian•15m ago•0 comments

The European Commission is turning Google Search into a national-security risk

https://techletters.substack.com/p/the-european-commission-is-turning
1•miohtama•15m ago•0 comments

Snyk VulnBench JavaScript 1.0: Can LLMs Find the Same Bugs Twice?

https://arxiv.org/abs/2606.15762
1•ilreb•18m ago•0 comments

Fake Viral Guitarists Strike Again

https://www.youtube.com/watch?v=-Eik8uBvdxY
1•root-parent•19m ago•0 comments

Hackers Publish Knicks and Madison Square Garden Data Online

https://www.404media.co/hackers-publish-knicks-and-madison-square-garden-data-online/
3•Cider9986•20m ago•2 comments

Biff.fx: lightweight effects system for Clojure

https://biffweb.com/p/fx/
1•jacobobryant•20m ago•0 comments

Yes, we still need engineers

https://mattsayar.com/yes-we-still-need-engineers/
5•MattSayar•20m ago•0 comments

Ask HN: What do you think about blockchain's current trajectory

4•mobear•20m ago•1 comments

Google's Training Supercomputers from TPU v2 to Ironwood: Five Generations

https://arxiv.org/abs/2606.15870
1•matt_d•22m ago•0 comments