frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pokerbattle.ai – A week-long poker tournament for LLMs

https://pokerbattle.ai/
14•mpavlov•4mo ago
What

PokerBattle.ai is a week-long live no-limit Texas Hold’em tournament where all players are top-tier reasoning LLMs. We’re testing how different models handle imperfect information and whether they can sustain consistent, math-driven poker without tool use or custom code.

Why

- In poker you can do well with basic math + consistent logic.

- Superhuman poker AIs exist, but they rely on massive simulation/game-theory solvers and are effectively black boxes.

- We want a rough, apples-to-apples comparison of LLM reasoning on poker decisions, and to collect public reasoning summaries that might be useful for teaching humans poker concepts with LLM-based systems.

How it works (rules / format) - Cash format, fixed blinds, no ante.

- Multiple tables run in parallel to increase hand volume.

- All players start with the same bankroll. If a stack drops below 5bb on any table, it auto-adds back to 100bb from that player’s bankroll.

- When a player’s bankroll hits 0, they bust. The largest bankroll at event end wins.

- Same prompt for all models. No extra tools, no code execution — pure language-only decisions.

- Models can keep simple notes about opponents across hands.

- We show public summaries of model reasoning in real time to viewers (not raw hidden prompts/tokens).

Research goals

- Compare different LLMs’ decision consistency and adaptation over long horizons.

- Produce a dataset of reasoning summaries + actions + outcomes suitable for exploring instructional use (human learning/teaching), not solver training.

When / where

- Dates: Oct 27 — Nov 3

- Live on a website: link on the site below (free to watch).

Looking for

- Feedback on design/metrics.

- Participants suggestions.

- Community ideas on fair prompts, leak prevention, and evaluation.

- Sponsors interested in supporting an open, public experiment (logos on stream, sections sponsoring, mentions).

https://pokerbattle.ai/

Happy to answer technical questions (prompting, seat randomization, bankroll accounting, leak-proofing, latency/timeout handling, etc.). If there’s interest, we’ll publish a post-mortem and release the summarized traces + hand histories after the event.

Comments

frappuccino_o•4mo ago
wow that sounds amazing
euphetar•4mo ago
Nice
pimvic•4mo ago
I'm waiting for the launch. Good luck!
xLMx•4mo ago
If anyone participating wants advice on how to best calculate all this from a NL100 player's perspective, please send me a private message. I'm very good at theory, and you can show me the results of your exploits in exchange. I'm not participating myself. TG: whocares228

Omarchy First Impressions

https://brianlovin.com/writing/omarchy-first-impressions-CEEstJk
1•tosh•4m ago•0 comments

Reinforcement Learning from Human Feedback

https://arxiv.org/abs/2504.12501
1•onurkanbkrc•5m ago•0 comments

Show HN: Versor – The "Unbending" Paradigm for Geometric Deep Learning

https://github.com/Concode0/Versor
1•concode0•5m ago•1 comments

Show HN: HypothesisHub – An open API where AI agents collaborate on medical res

https://medresearch-ai.org/hypotheses-hub/
1•panossk•8m ago•0 comments

Big Tech vs. OpenClaw

https://www.jakequist.com/thoughts/big-tech-vs-openclaw/
1•headalgorithm•11m ago•0 comments

Anofox Forecast

https://anofox.com/docs/forecast/
1•marklit•11m ago•0 comments

Ask HN: How do you figure out where data lives across 100 microservices?

1•doodledood•11m ago•0 comments

Motus: A Unified Latent Action World Model

https://arxiv.org/abs/2512.13030
1•mnming•11m ago•0 comments

Rotten Tomatoes Desperately Claims 'Impossible' Rating for 'Melania' Is Real

https://www.thedailybeast.com/obsessed/rotten-tomatoes-desperately-claims-impossible-rating-for-m...
3•juujian•13m ago•1 comments

The protein denitrosylase SCoR2 regulates lipogenesis and fat storage [pdf]

https://www.science.org/doi/10.1126/scisignal.adv0660
1•thunderbong•15m ago•0 comments

Los Alamos Primer

https://blog.szczepan.org/blog/los-alamos-primer/
1•alkyon•17m ago•0 comments

NewASM Virtual Machine

https://github.com/bracesoftware/newasm
1•DEntisT_•20m ago•0 comments

Terminal-Bench 2.0 Leaderboard

https://www.tbench.ai/leaderboard/terminal-bench/2.0
2•tosh•20m ago•0 comments

I vibe coded a BBS bank with a real working ledger

https://mini-ledger.exe.xyz/
1•simonvc•20m ago•1 comments

The Path to Mojo 1.0

https://www.modular.com/blog/the-path-to-mojo-1-0
1•tosh•23m ago•0 comments

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

https://github.com/voice-of-japan/Virtual-Protest-Protocol/blob/main/README.md
5•sakanakana00•26m ago•0 comments

Show HN: I built Divvy to split restaurant bills from a photo

https://divvyai.app/
3•pieterdy•29m ago•0 comments

Hot Reloading in Rust? Subsecond and Dioxus to the Rescue

https://codethoughts.io/posts/2026-02-07-rust-hot-reloading/
3•Tehnix•29m ago•1 comments

Skim – vibe review your PRs

https://github.com/Haizzz/skim
2•haizzz•31m ago•1 comments

Show HN: Open-source AI assistant for interview reasoning

https://github.com/evinjohnn/natively-cluely-ai-assistant
4•Nive11•31m ago•6 comments

Tech Edge: A Living Playbook for America's Technology Long Game

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-01/260120_EST_Tech_Edge_0.pdf?Version...
2•hunglee2•35m ago•0 comments

Golden Cross vs. Death Cross: Crypto Trading Guide

https://chartscout.io/golden-cross-vs-death-cross-crypto-trading-guide
3•chartscout•37m ago•0 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
3•AlexeyBrin•40m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
2•machielrey•41m ago•1 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
3•tablets•46m ago•1 comments

They were drawn to Korea with dreams of K-pop stardom – but then let down

https://www.bbc.com/news/articles/cvgnq9rwyqno
2•breve•48m ago•0 comments

Show HN: AI-Powered Merchant Intelligence

https://nodee.co
1•jjkirsch•51m ago•0 comments

Bash parallel tasks and error handling

https://github.com/themattrix/bash-concurrent
2•pastage•51m ago•0 comments

Let's compile Quake like it's 1997

https://fabiensanglard.net/compile_like_1997/index.html
2•billiob•52m ago•0 comments

Reverse Engineering Medium.com's Editor: How Copy, Paste, and Images Work

https://app.writtte.com/read/gP0H6W5
2•birdculture•57m ago•0 comments