LLM Economist – Mechanism Design for Simulated Agent Societies

https://github.com/sethkarten/LLM-Economist

2•milkkarten•3h ago

Comments

milkkarten•3h ago

We simulate large-scale agent societies where heterogeneous personas work, adapt, and vote—governed by an in-context planner optimizing social welfare.

The system models decentralized governance, dynamic tax policy, and institutional evolution—entirely via in-context reinforcement learning, no fine-tuning required.

Full paper (arXiv): https://arxiv.org/abs/2507.15815

slwvx•1h ago

I like the idea of simulating a society! I don't pretend to understand everything that you're doing, so please correct me where I'm wrong below.

The right side of Fig 5a shows that your LLM tool has 80% tax for people making between 0 and $11.6k/year, then drops to about 30% for the next tax bracket, with other tax brackets moving around all over the place. This seems to be designed to induce people to NOT pay taxes.For all its faults, I think the US progressive system is fairly rational and does a pretty good job of inducing people to actually pay taxes [1]; specifically the (effectively) negative tax rate in the US for low-income people gets them in the habit of paying taxes. I.e. whatever underlying model of social welfare you are assuming to get the great social welfare on the right side of Fig 5a seems to not model real people. I wonder if some LLM hallucinations are going on under the hood to create the strange behavior in Fig 5a.

Some questions: You don't seem to model the US system of tax credits; is that right? Also, is there a Saez tax below $47.2k in Fig 5a? What about between $244k and $609k? I.e. is the Saez tax ever under the LLM tax?

[1] https://blogs.worldbank.org/en/governance/why-does-progressi...

milkkarten•47m ago

These are the marginal tax rates not the effective tax rate (e.g. 80% of first $10k, 30% of $10k-20k). We do not model tax credits here. We try to keep the system as simple as possible so that we can effectively evaluate changes. As is, the Economic theory is intractable once we introduce bounded rationality from purely rational. We do think in future work we can potentially work out some smoothness in the overall tax rate but try to let the LLM planner try what it thinks is best in order to help test the in-context optimization capabilities.

Also, while there is a complicated tax code in the US, in our simulation there is no way for agents to avoid paying taxes :)

The Saez tax rates are perturbed from the LLM Economist's tax rates to find the theoretically optimal values according to the economic theory.

Thanks for the interest and I hope that this helps clarify some of the details.

Show HN: Send JSON Data directly to a table straight from your web app code

Try the #1 Photorealistic Image Generator

The Demon that Wears my Face

DuckLake for Busy Engineering Managers

Slow Opportunities for Investing Locally

The Story of Reddit [audio]

Mothers Ruin Software - Independently developled free Mac apps

The Extent of Bladed Terrain on Pluto via Photometric Surface Roughness

Is It Still Disney Magic If It's AI?

Skyscraper-size spikes of methane ice may surround Pluto's equator

Roll Your Own Virtual Ontology with Claude Code

Realizing we needed two sorts of alerts for our temperature monitoring

What's Next for AI at DeepMind

Optimal sizing, control of a grid-connected battery in a stacked revenue model

Banana Production in Iceland

Write Faster with AI – While Keeping It Yours

Google Adds Veo Watermark to All Veo3 Videos Except for Ultra Members

Heavy metals in Nestlé baby food but off the hook for liability

Shroud of Turin image matches low-relief statue–not human body,3D modeling study

The WOBA Paradox: Signal vs. Noise in Baseball Performance Metrics

Show HN: Flash AI Presentation Maker

Are you an American looking to buy a used camera from overseas? Do it quick

LLMs won't save you from bad software engineering culture

SF Fake Pitch Competition

A parser for TypeScript types, written in TypeScript types

Show HN: GPT helped me rebuild a .NET app in 30 mins what took 3 weeks in MFC

Ready or not, age verification is rolling out across the internet

Apple Hiring for 'Answers' Team Working on 'ChatGPT-Like Search'

'Drone followed Gaza colleague home and wiped out his family', says doctor

Advice I Got at PagerDuty