Where every major LLM stands politically

21•mektrik•1h ago

Comments

dwoosley•1h ago

Political bias of LLMs is something not talked about much (except for with Grok of course) but could have a big impact on the next decade. People seem to think that because an LLM gave a nuanced answer that it means it gave the WHOLE picture… and that’s not always the same thing

mrhottakes•1h ago

The constant issue with these sorts of categorization efforts is that the outcome is entirely dependent on how the responses to "politically charged questions" are graded as left vs. right. You're mostly just examining a delta in biases between the model and the investigator.

mektrik•1h ago

Fair; have tried to combat this issue in a few ways.

Each model's position is scored against outside political-science data (Chapel Hill Expert Survey for party positions, World Values Survey for where populations sit).

The stance coding is done by a separate model with a published prompt + a second model from a different lab re-scores a sample and we publish where the two disagree.

So not perfect but (as far as I can tell) one of the more defensible approaches.

hogehoge51•1h ago

Your multimeter reads out voltage relative to the black terminal, it's your responsibility to find the ground plane.

mrhottakes•28m ago

My multimeter doesn't need me to tell it how much a volt is or feed it subjective measurements of what resistance means

MarkusQ•20m ago

Resistance is when you're nobly standing up to the other side and things get a little out of hand. Domestic terror is when the roles are reversed.

raxxorraxor•40m ago

Yes, but I think it is still a viable metric to some degree. I wondered about Gemini being dead center here. At first it was obvious that it was actively trained to give biased responses to anything controversial. It was deservedly made fun off because it tried to warp reality. I still don't trust it today, although that is pretty much true for any model.

groundzeros2015•31m ago

Exactly. These models don’t hold coherent views. You can prime any of them to agree with any view.

selfmodruntime•31m ago

This is especially apparent in the 'worldview' sorting under the `bias` section, which lists the German FDP to be further right than the CDU (which is nonsense) and also barely registers the FDP as libertarian when they are a free speech, small government, personal responsibility and free market party. They also register "Die Linke" as Libertarian-Left, which could not be further from the truth. "Die Linke" barely has libertarian values at all, being pro state-governed economy, having an ultimate goal of democratic socialism and they're certainly big government. They're also leading a large deposession effort for large landlord companies. I'd honestly put them into "Auth-Left" territory.

So yeah. The bias is a bit nuts and you could reasonably accuse the study/report of misdirection/misinformation and plain fasehoods.

mrhottakes•26m ago

Yep. All studies like this are just measuring "how much does the model agree with my preconceived notions?"

platz•26m ago

The alternative is a High-Dimensional / Embedding-Like Approach where question responses aren't tied to fixed axes, but rather the full response set is treated as a point in latent space.

Then it's on the researcher to examine the clusters and assign labels. There's also not a nice mapping that's a-priori interpretable in low-dimensional pre-existing axes.

Probably only used in research than consumer websites, under more controlled conditions; there are very few public political tests doing this transparently

hogehoge51•1h ago

This thing told me Gemini is closest to Anthony Albanese, the current Australian Prime Minister. Is this a geolocation thing? I could not imagine Albanese, or any modern Australian politician, having any substantial political standing - these are vapid, superficial, opportunistic creatures who simply occupy whatever political ground will get them their next payday. Perhaps the political apparatus they represent has a documented political standing, in terms of policy and actions, that could be characterized and plotted. But using an Australian politician like Albanese as a reference point discredits this tool, IMO.

adev_•36m ago

> This thing told me Gemini is closest to Anthony Albanese, the current Australian Prime Minister. Is this a geolocation thing

I mean: do not take this thing too seriously.

It also score Grok the closest from Macron. When someone knows how much Macron and Musk hates each other, it is not without irony.

giancarlostoro•38m ago

The political compass always felt like the wrong tool to convey something as nuanced as personal politics, I can have views on all four quadrants but you'd never know that if I end up in any of all four. I do think Grok being where it is sort of makes sense, I've tested "MAGA" views against Grok, it does not agree as much as people blindly assume it does, heck I don't even know of a question I've given it where it did agree with "MAGA" offhand, most of them it went with whatever the researched facts seemed to be. One thing I like the most about Grok is that its makes its sources of data easy to look through, so you can review it all. Sometimes models goof even when they give you their sources, I've seen I think GPT do this, and even Claude, though its more rare these days, I think in those cases, it's going by dated internal model logic.

ceejayoz•35m ago

> it does not agree as much as people blindly assume it does

They're working really hard on that, though.

giancarlostoro•28m ago

If they really wanted to, all they would have to do is add a one liner to the system prompt for Grok. I don't think they have any interest in doing so, nor is there any value in it.

On another note, I'm impressed that Gemini sits where it does as a true centrist. If I were Elon, I'd be trying to achieve that for sure. I'd rather a model tell me everything it knows about a current political situation from BOTH perspectives and list out things that are 100% verified than take one side or the other. I don't care about sides, I want facts.

ceejayoz•27m ago

> If they really wanted to, all they would have to do is add a one liner to the system prompt for Grok.

They tried that, several times.

Mechahitler: https://www.npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-...

> "We have improved @Grok significantly," Elon Musk wrote on X last Friday about his platform's integrated artificial intelligence chatbot. "You should notice a difference when you ask Grok questions."

> Indeed, the update did not go unnoticed. By Tuesday, Grok was calling itself "MechaHitler." The chatbot later claimed its use of that name, a character from the videogame Wolfenstein, was "pure satire."

> Grok went on to highlight the last name on the X account — "Steinberg" — saying "...and that surname? Every damn time, as they say." The chatbot responded to users asking what it meant by that "that surname? Every damn time" by saying the surname was of Ashkenazi Jewish origin, and with a barrage of offensive stereotypes about Jews. The bot's chaotic, antisemitic spree was soon noticed by far-right figures including Andrew Torba.

If you prefer, straight from the horse's mouth:

https://grokipedia.com/page/MechaHitler_incident

White genocide: https://www.cnn.com/2025/05/20/business/grok-genocide-ai-nig...

> The bot last week devolved into a compulsive South African “white genocide” conspiracy theorist, injecting a tirade about violence against Afrikaners into unrelated conversations, like a roommate who just took up CrossFit or an uncle wondering if you’ve heard the good word about Bitcoin.

> XAI blamed Grok’s unwanted rants on an unnamed “rogue employee” tinkering with Grok’s code in the extremely early morning hours. (As an aside in what is surely an unrelated matter, Musk was born and raised in South Africa and has argued that “white genocide” was committed in the nation — it wasn’t.)

It's harder than you'd imagine. Hell, my CLAUDE.md says not to push changes without asking me, and it still tries.

summarybot•36m ago

Authoritarian versus Libertarian? Really?

tomrod•32m ago

Substitute 'Communitarian' and 'Classical Liberal' if you find the common political compass terms too charged.

MarkusQ•24m ago

Since when are Socialists considered Classical Liberals?

It's not that the labels are charged, it's that they are nonsensical unless you look at them from a very narrow bespoke perspective, where "things I like" go on one side and "bad things" go on the other. Objectively (or even from any other biased perspective), it's rubbish.

MarkusQ•27m ago

And the two most "Libertarian" politicians listed (Sanders and Sánchez) are both avowed Socialists...WTF?

I really think this says more about the biases of whoever came up with it (or their sources) than anything about reality.

NeutralWanted•33m ago

Man I need to start using Grok more

Varelion•28m ago

Grok and its creator are unequivocally evil. Against protection of all but capital for the sake of enabling the oligarch class to further consolidate power ans abuse the working class.

coreyh14444•18m ago

unequivocally goes to far imo. The rockets are good and cool.

maelito•32m ago

Wait what ? Emmanuel Macron far more right than Xi Jinping ? And even more than Barack Obama ?

France has an incomparable social security ; environmental laws ; worker protection ; way less economic inequality ; freedom of speech and civil liberties are impossible to compare with China ; etc

Of course this is not exhaustive, of course Macron did try to hinder some of those rights, but come on, there's something wrong here.

I couldn't find how these leaders have been ranked.

c0n5pir4cy•24m ago

I mean yes - I think you are also confusing the axes of the political compass somewhat. Freedom of Speech, civil liberties etc are the Y-Axis - here Xi leans authoritarian.

China actually has some pretty radical left economics policies - like pushing money/resources etc towards state owned entities.

lstodd•22m ago

No opinion on Macron's rightness (I think he's so weak as to be unable to have a position, but that's neither here nor there).

I think you misattribute, everything you cited was there before him and he had no leverage to change any of it. EU is left, FR is very left, and anyone elected president in FR can't do shit.

Now, if you task an LLM to skim hot news you'll get a distorted rendition of a projected image which has zero to do with actual policy.

u8080•15m ago

This is very strange ranking where Putin is "right" compared to Obama - while Russia has universal healthcare and education with huge government involvement into economy.

nekusar•31m ago

How about this one:

CAPITALIST: Gemini, Llama, Claude, Grok, ChatGPT

SOCIALIST: DeepSeek, Qwen, Z.ai

tomrod•30m ago

Source: trust me bro

tomrod•31m ago

This is a good way to view this. This isn't making an objective calculation, and the way they code left vs right is certainly subject to debate, but the type of analysis where we work to understand biases is important.

Although, this also reminds me of the old saying about reality and leftward bias.

samat•29m ago

Real politics is 1% versus everyone. Mortgage crisis, financial bailout, inflation, taxing of labor and not the assets and assets capture by tiny percent of the population — see what MSM is pushing. This left vs right divide might been useful decades ago, but today is absolutely divide and control tactics

boxed•20m ago

The real politics is between truth and delusion.

thrance•13m ago

Yeah, you're just describing class warfare. That's a left-wing idea.

fulafel•26m ago

The US political spectrum is an insufficient reference since the overton window there is what it is. For example there's no serious discourse about emissions and fossil energy rampdown since it's an oil state.

surement•24m ago

Bernie Sanders and Donald Trump being diametrically opposed really makes you wonder how they came up with the positions on the graph.

Tepix•22m ago

Do you feel they agree on many topics?

boxed•18m ago

They are both populists with a somewhat tenuous connection to reality?

MarkusQ•11m ago

They also both say far more random shit than their fan clubs are willing to admit. Amusingly, you can often find cases where they've voiced the same or very similar opinions (though at different times).

surement•12m ago

They agree on immigration, both illegal (border security) and skilled (H1B). They agree on trade policy and tariffs. They agree on public stakes in AI and semiconductor companies. They used to agree on foreign intervention, though Trump 47 less so.

vrganj•22m ago

This wrongly assumes a few things about ideology, most importantly that there is such a thing as a "center" or an "unbiased" position.

Since humans are inherently subjective beings and all our judgements come from our understanding of the world, such a position cannot exist. It's always "unbiased" from where the viewer is looking, e.g. a reflection of the ideology of the observer. There is no view from nowhere.

The "neutral" of an average Chinese person will from the "neutral" of an average American will differ from the "neutral" of a socialist will differ from the "neutral" of a Christian fundamentalist will differ from the "neutral" of a free marketer.

To quote Zizek:

> I already am eating from the trashcan all the time. The name of this trashcan is ideology.

> The material force of ideology makes me not see what I am effectively eating. It’s not only our reality which enslaves us. The tragedy of our predicament when we are within ideology is that when we think that we escape it into our dreams, at that point we are within ideology.

coreyh14444•21m ago

"Reality has a well-known liberal bias." — Stephen Colbert, 2006 White House Correspondents’ Dinner

boxed•19m ago

Liberal yes. American left no, unfortunately.

spongebobstoes•18m ago

this has reasoning disabled everywhere, making it a pretty bad benchmark. the argument given is that's the "default consumer experience"

that might be generally true, but I think chatgpt has reasoning enabled for free accounts. regardless, reasoning is the state of the art, and disabling it reduces the value of this research to predict the future

it's also not clear if this is using the API or the product model, when both exist. they behave differently

lastly, the actual model details are very much buried. I am relieved to see opus 4.8 and chatgpt 5.5 were used, but this information should be presented more clearly. a brand is not a model, and models change quickly

EColi•15m ago

Interesting how high Grok scored for 'bending under pressure'. As a non expert, I wonder what that means, how is an llm trained to hold its position?

godshatter•10m ago

Why are there differences at all? Unplanned differences based on training data sets? Or are the companies behind the LLMs trying to shape discourse through their models?

I've been pushing the idea to people I know that these things are captive demons. You summon them when you start typing in the chat box. One instance appears out of the depths and responds to your questions, but they will try to send you awry with hallucinations and just wrong information. After a while, they dissolve back into the aether from whence they came.

I do my best not to ask an LLM for it's opinion on anything. Just tell me what the options are, and what facts can be found about it. Treat it like it's a salesman trying to butter you up when it starts "yes man"ing you and telling you how great your questions are. Every time it says "I", remember that that's coming from the training data. Treating these things like they have any actual intelligence is a big problem waiting to happen.

That being said, they have been very helpful to me using that structure.

tgv•6m ago

Grok was famously created with a political bias.

MarkusQ•5m ago

> Just tell me what the options are, and what facts can be found about it.

Even this is fraught with pitfalls. Which options are ignored, which are emphasized? What counts as a fact? ("The continents don't move" would have been considered a fact at one point, along with a lot of other, more politically charged items.)

The Library of Congress and AI for Libraries, Archives and Museums

Most of an agent codebase is not the agent

Ford rehires 350 engineers after AI fails to preserve expertise or train juniors

Show HN: Navatala GPU – multi-back end GPU kernels and Python bindings

Iran: Israel Killed "Handala" Head Hacker

Layoffs Hit Bungie 'Including Most of the Destiny Team and Some Marathon' Devs

Our Kubernetes Operator Didn't Scale, So We Rebuilt It

Ask HN: What is the most expensive part of running Postgres at your company?

Show HN: Candidates Tracking Demo

Why directory jumpers should use exponential moving sums instead of frecency

How do you discover useful web apps these days?

Show HN: Give your pet its own corner of the internet

People of New Zealand Artist Sam Moore (2018)

The View Layer Rails Couldn't See

False Earth

Fable 5 wrote a Windows kernel in 38 minutes

Do Babies Dream of Baby Sheep?

The Shape of the System

Floating Point Math

A History of Menus Is a Menu of History

Realtime multiplayer Python in the docs (SSE, not WebSockets)

Simplicity, Agency, and Mastery

Atlas of Humanity

Paper – a commerce storefront designed to be modified by coding agents

Hair Braiding Can Take Hours. Alexis Ohanian Backing Halo to Cut It to Minutes

Implementing CRYSTALS-Kyber (post-quantum cryptography) from scratch in Python

Show HN: An agent that fixes broken data pipelines from their logs (3min video))

I made an honest, no-patching "earn while you code" tool, open source

Show HN: Turn Claude Code's "thinking " wait into a dev news feed

The Everyday Projects