frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Mafia Arena – LLMs play social deduction games against each other

https://mafia-arena.com
1•mohsen1•2h ago
Hello!

Over the Christmas break I built a platform where LLMs play the party game Mafia against each other. 11 AI players, full conversations, voting, deception — the whole thing.

Why? Benchmarks like MMLU test knowledge recall. They don't test whether a model can lie convincingly, detect deception, or maintain a consistent story under social pressure. Mafia forces all of that.

Tech stack: Cloudflare Workers, Workflows (for pausing games while waiting on batch API responses), D1, R2. No traditional servers. The game engine is a pure TypeScript state machine with no side effects, which makes games replayable.

You can bring your own API keys and run batches. All transcripts are saved.

Happy to answer questions about the architecture or the benchmark methodology.

Stop Chatting with AI. Start Loops (Ralph Driven Development)

https://lukeparker.dev/stop-chatting-with-ai-start-loops-ralph-driven-development
1•ghuntley•1m ago•0 comments

Logarithmic Scales of Pleasure and Pain (2019)

https://forum.effectivealtruism.org/posts/gtGe8WkeFvqucYLAF/logarithmic-scales-of-pleasure-and-pa...
1•eatitraw•2m ago•0 comments

LLMs for Medical Practice: Look Out

https://www.science.org/content/blog-post/llms-medical-practice-look-out
1•xigoi•3m ago•0 comments

TidesDB – A Modern RocksDB Replacement [video]

https://www.youtube.com/watch?v=gkxTqd_LaCQ
1•alexpadula•4m ago•0 comments

Porting Graph:Easy to TypeScript with GPT-5.2 and Azad

https://tomisin.space/projects/graph-easy-ts/
1•AntiRush•5m ago•0 comments

Ask HN: How does an indy website integrate with cookie vendors to make money?

1•ricksunny•6m ago•0 comments

Alan Kay – 75 Years of Graphical User Interfaces [video]

https://www.youtube.com/watch?v=qS20Z0RXr28
1•spiralganglion•7m ago•0 comments

Capital in the 22nd Century

https://philiptrammell.substack.com/p/capital-in-the-22nd-century
1•coloneltcb•9m ago•0 comments

Ask HN: Could your expertise help me?

1•nonmaskable•10m ago•0 comments

The First Video Game Came Long Before Pong

https://www.iflscience.com/the-first-video-game-came-long-before-pong-and-was-invented-by-a-manha...
2•geox•11m ago•0 comments

Cross-site Scripting-benchmark of Python sanitizers against real browsers

https://github.com/EmilStenstrom/justhtml-xss-bench
2•EmilStenstrom•12m ago•1 comments

Growing Up in "404 Not Found" (Part II): The Vanishing Nuclear City

https://vincent404.substack.com/p/growing-up-in-404-not-found-part
1•bookstore-romeo•13m ago•0 comments

Be aware when opening "take home challenges" from untrusted recruiters

https://bitbucket.org/brain0xlab/challenge/src/master/
3•birdculture•16m ago•0 comments

Show HN: FuseCells – 2,500 handcrafted levels logic puzzle game with leaderboard

https://igodia.dev/fusecells
2•keini•17m ago•3 comments

Quality of drinking water varies significantly by airline

https://foodmedcenter.org/2026-center-for-food-as-medicine-longevity-airline-water-study/
3•azinman2•17m ago•0 comments

I used Claude to revive an NPM package with 760K downloads/wk last updated 2019

https://github.com/greenstevester/license-checker-evergreen
1•greenstevester•21m ago•1 comments

obsera – a real-time intelligence platform

https://www.obsera.xyz
1•obsera•23m ago•0 comments

Francesca Albanese and the Lonely Road of Defiance

https://chrishedges.substack.com/p/francesca-albanese-and-the-lonely
3•chmaynard•24m ago•0 comments

All of you are about as trustworthy as the peepers in the hood

1•trusttrusttrust•25m ago•0 comments

Dittytoy – Generative Music Playground

https://dittytoy.net/
1•harel•26m ago•0 comments

The NPC to MC Spectrum

https://nonzerosum.games/npc.html
1•NonZeroSumJames•27m ago•0 comments

Stable-Pretraining-v1: Foundation Model Research Made Simple

https://arxiv.org/abs/2511.19484
2•PaulHoule•29m ago•0 comments

Anti-Addiction iPhone Setup

https://www.aadillpickle.com/blog/iphone-setup
1•aadillpickle•29m ago•1 comments

From what longer video is this short?

https://www.youtube.com/shorts/kcr02CrY_Ik
1•gjvc•30m ago•0 comments

Show HN: I built my own Metronome Desktop App

https://shredono.me/
1•danmol•32m ago•0 comments

The most expensive education system

https://skandergarroum.substack.com/p/the-most-expensive-education-program
1•JoiDegn•37m ago•0 comments

Show HN: Request sensitive user input from system services

https://github.com/LightAndLight/asker
1•lightandlight•38m ago•0 comments

Brazil's Amazon rainforest at risk as key protection under threat

https://www.bbc.co.uk/news/articles/cwypzdgwg1yo
1•zeristor•45m ago•0 comments

Top OnlyCrave Alternatives

https://onlycrave.com/blog/post/215
1•digeka•45m ago•0 comments

Show HN: RAMBnB.xyz P2P marketplace for RAM rentals

https://www.rambnb.xyz
2•olivierroy•49m ago•0 comments