frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Can GPT-5 Beat My Favorite Daily Puzzle Game?

https://www.nicksypteras.com/blog/cbs-benchmark.html
7•nsypteras•1h ago

Comments

pyankoff•1h ago
Very cool! The massive outperformance of GPT-5 looks like there is something different in their training data indeed. Considering their previous work on games, wouldn't be surprising if they generated some synthetic game data.
nsypteras•50m ago
Ya interesting thought - would be fascinating if generating games w/solutions is part of the training data pipeline. There's been previous work done on on testing LLMs on logic puzzles[1][2][3] so they could possibly be building off those ideas to improve performance.

[1] https://huggingface.co/papers/2504.00043 [2] https://huggingface.co/blog/yuchenlin/zebra-logic [3] https://arxiv.org/pdf/2403.12094

srekhi•55m ago
interesting - and thx for making reproducible

iPhone Pocket

https://www.apple.com/ca/newsroom/2025/11/introducing-iphone-pocket-a-beautiful-way-to-wear-and-c...
1•fortran77•2m ago•1 comments

Abilene Paradox

https://en.wikipedia.org/wiki/Abilene_paradox
2•ugur2nd•4m ago•0 comments

ISRG (LetsEncrypt) has created two new roots [pdf]

https://letsencrypt.org/audits/ISRG-2025-Key-Generation-Report.pdf
1•randompeach•5m ago•1 comments

We Uncovered a Race Condition in Aurora RDS

https://hightouch.com/blog/uncovering-a-race-condition-in-aurora-rds
10•theanomaly•5m ago•0 comments

Liberating Search from the Search Engine

https://softwaredoug.com/blog/2025/06/03/liberating-search
1•softwaredoug•5m ago•0 comments

Django Developers Survey 2025 Results

https://lp.jetbrains.com/django-developer-survey-2025/
3•ferryth•6m ago•1 comments

Show HN: Dumbass Business Ideas

https://dumbassideas.com
2•elysionmind•7m ago•0 comments

Coding Trance Music (Full Narrated)

https://www.youtube.com/watch?v=GWXCCBsOMSg
4•tux1968•8m ago•0 comments

LangDiff: Progressive UI from LLM

https://github.com/globalaiplatform/langdiff
1•rob•8m ago•0 comments

Ask HN: How do you handle logging and evaluation when training ML models?

2•calepayson•8m ago•0 comments

Secret Boat Strike Memo Justifies Kills by Claiming Targeting Drugs, Not People

https://theintercept.com/2025/11/14/boat-strikes-immunity-legality-trump/
6•Qem•9m ago•0 comments

Red Hat OpenShift 4.20 Boosts AI, Security, Hybrid Cloud

https://thenewstack.io/red-hat-openshift-4-20-boosts-ai-security-hybrid-cloud/
1•CrankyBear•9m ago•1 comments

Pope Leo XIV: "AI and Medicine: The Challenge of Human Dignity"

https://www.vatican.va/content/leo-xiv/en/messages/pont-messages/2025/documents/20251107-messaggi...
2•logannyeMD•10m ago•0 comments

Cursor never says "IDE" on their homepage

https://twitter.com/grigoriy_kogan/status/1989395569865838775
4•gk1•12m ago•1 comments

Cursor AI sent me a gift for pressing Tab 74,283 times [video]

https://www.youtube.com/shorts/vZ1Qd-gClu4
1•AshesOfOwls•12m ago•1 comments

The Multimillion-Dollar Plan to Make Mobile Voting Happen

https://www.wired.com/story/bradley-tusk-mobile-voting-protocol/
1•shpat•12m ago•0 comments

Cornserve: Serving multimodal AI models like microservices

https://cornserve.ai/
1•jaywonchung•13m ago•0 comments

Plant-based diets could reshape farming jobs and reduce labor costs worldwide

https://phys.org/news/2025-10-global-based-diets-reshape-farming.html
4•PaulHoule•14m ago•0 comments

Proton 10.0-3 Released for Steam Play with Fixes, More Games Working

https://www.phoronix.com/news/Proton-10.0-3-Released
8•Bender•15m ago•1 comments

Developer jobs that help you move states

https://nextstatejobs.substack.com/p/handpicked-usa-jobs-with-relocation
4•andrewstetsenko•16m ago•0 comments

Washington Post Says Nearly 10k Employees Impacted by Oracle Hack

https://www.securityweek.com/washington-post-says-nearly-10000-employees-impacted-by-oracle-hack/
4•Bender•16m ago•0 comments

World’s oldest RNA extracted from ice age woolly mammoth

https://arstechnica.com/science/2025/11/worlds-oldest-rna-extracted-from-ice-age-woolly-mammoth/
4•Bender•17m ago•0 comments

JPMorgan secures deals with fintech aggregators over fees to access data

https://www.reuters.com/sustainability/boards-policy-regulation/jpmorgan-secures-deals-with-finte...
2•impish9208•18m ago•0 comments

I Reverse Engineered a High-Volume Solana Arbitrage Bot

https://clumsy-geranium-e59.notion.site/Reverse-Engineering-a-4-500-Sol-m-Solana-Arbitrage-Bot-2a...
2•birdculture•18m ago•0 comments

How Linux is built with Greg Kroah-Hartman [video]

https://www.youtube.com/watch?v=7agB1vOl-wg
2•olvy0•18m ago•0 comments

Simple map can help New Yorkers in need find food

https://www.fastcompany.com/91440453/this-simple-map-can-help-new-yorkers-in-need-find-food
3•johanam•21m ago•0 comments

2025.46: Satellites and Strategy

https://stratechery.com/2025/satellites-and-strategy/
2•feross•22m ago•0 comments

Ruby Box – Ruby's In-Process Separation of Classes and Modules

https://docs.ruby-lang.org/en/master/box_md.html
3•Lammy•23m ago•1 comments

Winter 2026 Submission Video

https://www.youtube.com/watch?v=9dmqintX0R4
2•dgolman•24m ago•0 comments

Show HN: My workout app earns –$1000/month. I decided to build a new one

https://vis.fitness/
4•strongpigeon•24m ago•0 comments