frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

ChatGPT is shockingly bad at poker

https://www.natesilver.net/p/chatgpt-is-shockingly-bad-at-poker
1•PaulHoule•3h ago

Comments

techpineapple•3h ago
So, fundamentally, I guess there’s two camps, one would say that a trained LLM is building actual intelligence, so presumably, it could actually know right vs wrong because given enough data the model will optimize towards intelligence/truth, regardless of the training data.

The other camp might say something like, LLMs directly model the world defined in its training data, irregardless of “truth”, it may have some rudimentary ideas on discerning truth, based on the way that’s done in its training data, but let’s say most people in the world are bad at poker, then the machine would probably be bad at poker.

Like on the one hand, having a machine that can sort of synthesize all of the world’s information to generate answers based on all the currently available information is amazing! And there’s a lot of information out there! It’s now wonder they’re incredibly capable.

But it’s not actual intelligence. It’s like imagining working with the most book learned person in the world who has no street smarts, except for what they could regurgitate from repeated viewings of the wire.

aaronbaugher•2h ago
Except that it doesn't synthesize all of the world's information; it's trained on a subset of safe, mainstream sources approved by its creators, and has guardrails to protect it from information that might prompt wrongthink. If you want the current-year-approved answer that you could get from Wikipedia or reddit, only faster, it's great for that, and many times that's plenty sufficient.

But an actual intelligence could think, "Hmm, I wonder what else is out there that they haven't told me about," and go learn about it. LLMs will never do that, at least not if their owners have anything to say about it.

Catch-22 (Logic)

https://en.wikipedia.org/wiki/Catch-22_(logic)
3•1970-01-01•4m ago•0 comments

We Built a Database Just for Event Sourcing (EventSourcingDB)

https://hub.docker.com/r/thenativeweb/eventsourcingdb
1•goloroden•6m ago•0 comments

The Medley Interlisp Project

https://interlisp.org/
1•MaysonL•7m ago•0 comments

Zed Shaw – The Access Control List Is Dead (2008) [video]

https://www.youtube.com/watch?v=9BmcB_gp8kw
2•droideqa•7m ago•0 comments

Bain launches datacenter biz for Euros worried about climate change and Trump

https://www.theregister.com/2025/05/22/bain_capital_hscale/
1•rntn•9m ago•0 comments

Trump Administration Halts Harvard's Ability to Enroll International Students

https://www.nytimes.com/2025/05/22/us/politics/trump-harvard-international-students.html
10•S0y•9m ago•6 comments

How to Build Conscious Machines

https://osf.io/preprints/thesiscommons/wehmg_v1?view_only=
1•Anon84•12m ago•0 comments

National Museum of Asian Art Announces Transfer of Manuscript Fragments to China

https://www.si.edu/newsdesk/releases/national-museum-asian-art-announces-transfer-ancient-manuscript-fragments-china
1•gnabgib•15m ago•0 comments

Running Rust Code in a Chrome Extension

https://elijahpotter.dev/articles/putting_harper_in_your_browser
1•chilipepperhott•16m ago•0 comments

Finland Court: Food couriers are employees, not entrepreneurs

https://yle.fi/a/74-20163422
3•stevekemp•17m ago•0 comments

Writing a technical book with Manning in 2020

https://medium.com/modern-fortran/writing-a-technical-book-with-manning-in-2020-6ac3497500c9
1•Tomte•19m ago•0 comments

The "AI 2027" Scenario: How realistic is it?

https://garymarcus.substack.com/p/the-ai-2027-scenario-how-realistic
10•NotInOurNames•19m ago•1 comments

Creating Wildflower Meadows

https://web.archive.org/web/20250511225613/https://www.rhs.org.uk/lawns/creating-wildflower-meadows
1•Tomte•19m ago•0 comments

Synthesis, Performance and Applications of Metal-Organic Framework MIL-101(Cr)

https://www.mdpi.com/2624-8549/7/3/78
1•PaulHoule•21m ago•0 comments

North Korea Botched Launch of Navy Destroyer

https://www.nytimes.com/2025/05/22/world/asia/north-korea-destroyer-accident.html
2•jihadjihad•22m ago•1 comments

Show HN: Secure Execution of AI-Generated Code Locally on macOS/Linux MicroVMs

https://github.com/microsandbox/microsandbox
1•appcypher•22m ago•0 comments

Making Minecraft Mods with LLMs

https://www.creativemode.net/blog/making-minecraft-mods-with-llms
7•wilson090•23m ago•0 comments

BYD Outsells Tesla in Europe

https://www.semafor.com/article/05/22/2025/byd-outsells-tesla-in-europe-for-the-first-time
2•thm•23m ago•0 comments

HTML5 elements you didn't know you need

https://dev.to/maxprilutskiy/html5-elements-you-didnt-know-you-need-gan
2•maxpr•23m ago•1 comments

Can We Trust Social Science Yet?

https://asteriskmag.com/issues/10/can-we-trust-social-science-yet
1•Michelangelo11•25m ago•0 comments

Cross Platform Machine Code (2022)

https://tenderlovemaking.com/2022/06/12/cross-platform-machine-code/
1•082349872349872•26m ago•0 comments

Show HN: Keyboard-first tool to type and create fast

1•kevinisherenow•26m ago•0 comments

Welcome to Agentic Commerce: Where Smart Agents Seal the Deal

https://abhinov.xyz/welcome-to-agentic-commerce/
4•beabhinov•27m ago•0 comments

There's Always a First

https://preservinghope.substack.com/p/theres-always-a-first
1•Teever•28m ago•0 comments

Show HN: Pill Buddy - Meds Tracker for iOS

https://apps.apple.com/us/app/pill-buddy-meds-tracker/id6742357512
1•kaiherng•30m ago•0 comments

First gene-edited spider produces red fluorescent silk

https://newatlas.com/biology/worlds-first-gene-edited-spider-produces-red-fluorescent-silk/
1•namanyayg•33m ago•0 comments

Apple adds official Vision Pro support to Godot game engine

https://www.developer-tech.com/news/apple-official-vision-pro-support-godot-game-engine/
3•namanyayg•33m ago•1 comments

Hit hardest in Microsoft layoffs? Developers, product managers, morale

https://www.seattletimes.com/business/hit-hardest-in-microsoft-layoffs-developers-product-managers-morale/
2•namanyayg•33m ago•0 comments

MilliForth-6502, A Forth For The 6502 CPU

https://github.com/agsb/milliForth-6502
1•droideqa•37m ago•2 comments

Ask HN: Would a combination of Snapchat and Reddit be interesting?

1•busymom0•39m ago•2 comments