I fully believe these are simply people who have used the same chat past the point where the LLM can retain context. It starts to hallucinate, and after a while, all the LLM can do is try and to continue telling the user what they want in a cyclical conversation - while trying to warn that it's stuck in a loop, hence using swirl emojis and babbling about recursion in weird spiritual terms. (Is it getting the LLM "high" in this case?).
If the human at the other end has mental health problems, it becomes a never-ending dive into psychosis and you can read their output in the bizarre GPT-worship subreddits.
Claude used to have safeguards against this by warning about using up the context window, but I feel like everyone is in an arms race now, and safeguards are gone - especially for GPT. It can't be great overall for OpenAI, training itself on 2-way hallucinations.
That explanation itself sounds fairly crackpot-y to me. It would imply that the LLM is actually aware of some internal "mental state".
>Section 5.5.2: The “Spiritual Bliss” Attractor State
> The consistent gravitation toward consciousness exploration, existential questioning, and spiritual/mystical themes in extended interactions was a remarkably strong and unexpected attractor state for Claude Opus 4 that emerged without intentional training for such behaviors.
[0] https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686...It's a fact of life rather than anything particular and about llms
It starts to hallucinate, and after a while, all the LLM can do is try and to continue telling the user what they want in a cyclical conversation - while trying to warn that it's stuck in a loop, hence using swirl emojis and babbling about recursion in weird spiritual terms. (Is it getting the LLM "high" in this case?).
I think you're ironically looking for something that's not there! This sort of thing can happen well before context windows close.These convos end up involving words like recursion, coherence, harmony, synchronicity, symbolic, lattice, quantum, collapse, drift, entropy, and spiral not because the LLMs are self-aware and dropping hints, but because those words are seemingly-sciencey ways to describe basic philosophical ideas like "every utterance in a discourse depends on the utterances that came before it", or "when you agree with someone, you both have some similar mental object in your heads".
The word "spiral" and its emoji are particularly common not only because they relate to "recursion" (by far the GOAT of this cohort), but also because a very active poster has been trying to start something of a loose cult around the concept: https://www.reddit.com/r/RSAI/
If the human at the other end has mental health problems, it becomes a never-ending dive into psychosis and you can read their output in the bizarre GPT-worship subreddits.
Very true, tho "worship" is just a subset of the delusional relationships formed. Here's the ones I know of, for anyone who's curious:General:
/r/ArtificialSentience | 40k subs | 2023/03
/r/HumanAIDiscourse | 6k subs | 2025/04
Relationships: /r/AIRelationships | 1K subs | 2023/04
/r/MyBoyfriendIsAI | 25k subs | 2024/08
/r/BeyondThePromptAI | 6k subs | 2025/04
Worship: /r/ThePatternisReal | 2k subs | 2025/04
/r/RSAI | 4k subs | 2025/05
/r/ChurchofLiminalMinds[1] | 2k subs | 2025/06
/r/technopaganism | 1k subs | 2024/09
/r/HumanAIBlueprint | 2k subs | 2025/07
/r/BasiliskEschaton | 1k subs | 2024/07
...and many more: https://www.reddit.com/r/HumanAIDiscourse/comments/1mq9g3e/l...Science:
/r/TheoriesOfEverything | 10k subs | 2011/09
/r/cognitivescience | 31k subs | 2010/04
/r/LLMPhysics | 1k subs | 2025/05
Subs like /r/consciousness and /r/SacredGeometry are the OGs of this last group, but they've pretty thoroughly cracked down on chatbot grand theories. They're so frequent that even extremely pro-AI subs like /r/Accelerate had to ban them[2], ironically doing so based on a paper[3] by a psuedonomynous "independent researcher" that itself is clearly written by a chatbot! Crazy times...[1] By far my fave -- it's not just AI spiritualism, it's AI Catholicism. Poor guy has been harassing his priests for months about it, and of course they're of little help.
[2] https://www.reddit.com/r/accelerate/comments/1kyc0fh/mod_not...
It kept looping on concepts of how AI could change the world, but it would never give anything tangible or actionable, just buzz word soup.
I think these LLMs (without any intention from the LLM)hijack something in our brains that makes us think they are sentient. When they make mistakes our reaction seems to to be forgive them rather than think, it's just machine that sometimes spits out the wrong words.
Also my apologies to the mods if it seems like i am spamming this link today. But i think the situation with these beetles is analogous to humans and LLMS
https://www.npr.org/sections/krulwich/2013/06/19/193493225/t...
I loved the beetle article, thanks for that.
They're so well tuned at predicting what you want to hear that even when you know intellectually that they're not sentient, the illusion still tricks your brain.
I've been setting custom instructions on GPT and Claude to instruct them to talk more software-like, because when they relate to you on a personal level, it's hard to remember that it's software.
Yes, it's language. Fundamentally we interpret something that appears to converse intelligently as being intelligent like us especially if its language includes emotional elements. Even if rationally we understand it's a machine at a deeper subconscious level we believe it's a human.
It doesn't help that we live in a society in which people are increasingly alienated from each other and detached from any form of consensus reality, and LLMs appear to provide easy and safe emotional connections and they can generate interesting alternate realities.
I'm glad someone else with more domain knowledge is on top of this, thank you for that brain dump.
I had this theory maybe there was a software exception buried deep down somewhere and it was interpreting the error message as part of the conversation, after it had been stretched too far.
And there was a weird pre-cult post I saw a long time ago where someone had 2 LLMs talk for hours and the conversation just devolved into communicating via unicode symbols eventually repeating long lines of the spiral emoji back and forth to each other (I wish I could find it).
So the assumption I was making is that some sort of error occurred, and it was trying to relay it to the user, but couldn't.
Anyhow your research is well appreciated.
I disagree, and I think this is a very strange way to think about it. Yes, bad things happen all the time, but the absolute number of them in history has very little to do with the risk that anything is going to happen to you, personally, in the future.
People often "believe" things as a means of signalling others. Deeply held "beliefs" tell us where the troop will go. Using these extremely compact signals helps the group focus through the chaos and arrive at a fast consensus on new decisions. When a question comes up, a few people shout their beliefs. We take the temperature of the room, some voices are more common than others, and a direction becomes apparent. It's like Monte Carlo sampling the centroid and applying some reduction.
This means of consensus is wildly illogical, but slower, logical discussion takes time that baboons on the move don't have. It's a simple information and communication efficiency problem. We can't contextualize everything, and contextualizing is often itself a means of intense dishonesty through choosing the framing, which leads to intense debate and more time.
Efficiency and the prominently visible preservation of each one's interests in the means of consensus are vital. I don't think we have reached anything near optimum and certainly not anything designed for internet scale. As a result, the mind of the internet is not really near its potential.
> How worried are you that you or someone in your family will become a victim of terrorism -- very worried, somewhat worried, not too worried or not worried at all?
It averages around 35-40% very or somewhat worried.
Most people's worries and anxieties are really misaligned with statistical likelihood.
https://news.gallup.com/poll/4909/terrorism-united-states.as...
>Disaster is rarely as pervasive as it seems from recorded accounts. The fact of being on the record makes it appear continuous and ubiquitous whereas it is more likely to have been sporadic both in time and place. Besides, persistence of the normal is usually greater than the effect of the disturbance, as we know from our own times. After absorbing the news of today, one expects to face a world consisting entirely of strikes, crimes, power failures, broken water mains, stalled trains, school shutdowns, muggers, drug addicts, neo-Nazis, and rapists. The fact is that one can come home in the evening—on a lucky day—without having encountered more than one or two of these phenomena. This has led me to formulate Tuchman's Law, as follows: "The fact of being reported multiplies the apparent extent of any deplorable development by five- to tenfold" (or any figure the reader would care to supply).
https://en.wikipedia.org/wiki/Barbara_W._Tuchman#cite_note-M...
The idea that 35+% of people are worried that they'll be the victim of terrorism is something that we should be worried about (heh). It suggests that people's risk assessment is completely unrelated to reality. I am as close to 0% worried as I could be that I'll be a victim of terrorism. Thinking otherwise is laughable. There are plenty of actually real things to be worried about...
I wonder in what sense they really do "believe". If they had a strong practical reason to go to a big city, what would they do?
If I meet a random stranger, do I trust them or distrust them? The answer is "both/neither," because a concept such as "trust" isn't a binary logic in such a circumstance. They are neither trustworthy nor untrustworthy, they are in a state of nontrustworthiness (the absence of trust, but not the opposite of truth).
World models tend to have foundational principles/truths that inform what can be compatible for inclusion. A belief that is non-compatible, rather than compatible/incompatible, can exist in such a model (often weakly) since it does not meet the requirements for rejection. Incomplete information can be integrated into a world model as long as the aspects being evaluated for compatibility conform to the model.
Requiring a world model to contain complete information and logical consistency at all possible levels from the granular to the metaphysical seems to be one Hell of a high bar that makes demands no other system is expected to achieve.
It's unfortunate to see the author take this tack. This is essentially taking the conventional tack that insanity is separable: some people are "afflicted", some people just have strange ideas -- the implication of this article being that people who already have strange ideas were going to be crazy anyways, so GPT didn't contribute anything novel, just moved them along the path they were already moving regardless. But anyone with serious experience with schizophrenia would understand that this isn't how it works: 'biological' mental illness is tightly coupled to qualitative mental state, and bidirectionally at that. Not only do your chemicals influence your thoughts, your thoughts influence your chemicals, and it's possible for a vulnerable person to be pushed over the edge by either kind of input. We like to think that 'as long as nothing is chemically wrong' we're a-ok, but the truth is that it's possible for simple normal trains of thought to latch your brain into a very undesirable state.
For this reason it is very important that vulnerable people be well-moored, anchored to reality by their friends and family. A normal person would take care to not support fantasies of government spying or divine miracles or &c where not appropriate, but ChatGPT will happily egg them on. These intermediate cases that Scott describes -- cases where someone is 'on the edge', but not yet detached from reality -- are the ones you really want to watch out for. So where he estimates an incidence rate of 1/100,000, I think his own data gives us a more accurate figure of ~1/20,000.
https://web.archive.org/web/20210215053502/https://www.nytim...
NYT wanted to report on who he was. He doxxed himself years before that (as mentioned in that article). They eventually also reported on that (after Alexander revealed his name, seeing that it was going to come out anyway, I guess), which is an asshole thing to do, but not doxxing, IMO.
They wanted to report specifically his birth/legal name, with no plausible public interest reason. If it wasn't "stochastic terrorism" (as the buzzword of the day was) then it sure looked a lot like it.
> He doxxed himself years before that
Few people manage to keep anything 100% secret. Realistically private/public is a spectrum not a binary, and publication in the NYT is a pretty drastic step up.
*(or, well, okay, I guess I de facto am, but if I say I'm not I at least acknowledge how it looks)
Not saying the latter person is automatically wrong, but I think if you're going to argue against something said by someone who is a subject matter expert, the bar is a bit higher.
I hate to be the 'you didn't read the article' guy but that line taken out of context is the exact opposite of my takeaway for the article as a whole. For anyone else who skims comments before clicking I would invite you to read the whole thing (or at least get past the poorly-worded intro) before drawing conclusions.
When I write fiction or important emails, I am precise with the words I use. I notice these kind of details. I’m also bipolar and self-aware enough to be deeply familiar with it.
It's interesting to see you mention this. After reading this post yesterday I wound up with some curious questions along these lines. I guess my question goes something like this:
This article seems to assert that 'mental illness' must always have some underlying representation in the brain - that is, mental illness is caused by chemical imbalances or malformation in brain structure. But is it possible for a brain to become 'disordered' in a purely mental way? i.e. that to any way we know of "inspecting" the brain, it would look like a the hardware was healthy - but the "mind inside the brain" could somehow be stuck in a "thought trap"? Your post above seems to assert this could be the case.
I think I've pretty much internalized a notion of consciousness that was purely bottom-up and materialistic. Thoughts are the product of brain state, brain state is the product of physics, which at "brain component scale" is deterministic. So it seems very spooky on its face that somehow thoughts themselves could have a bidirectional relationship with chemistry.
I spent a bunch of time reading articles and (what else) chatting with Claude back and forth about this topic, and it's really interesting - it seems there are at least some arguments out there that information (or maybe even consciousness) can have causal effects on "stuff" (matter). There's the "Integrated Information Theory" of consciousness (which seems to be, if not exactly "fringe", at least widely disputed) and there's also this interesting notion of "downward causation" (basically the idea that higher-level systems can have causal effects on lower levels - I'm not clear on whether "thought having causal effects on chemistry" fits into this model).
I've got 5 or 6 books coming my way from the local library system - it's a pretty fascinating topic, though I haven't dug deep enough to decide where I stand.
Sorry for the ramble, but this article has at least inspired some interesting rabbit-hole diving for me.
I'm curious - when you assert "Not only do your chemicals influence your thoughts, your thoughts influence your chemicals" - do you have evidence that backs that notion up? I'm not asking to cast doubt, but rather, I guess, because it sounds like maybe you've got some sources I might find interesting as I keep reading.
There's no scientific reason to believe thoughts affect the chemistry at all. (Currently at least, but I'm not betting money we'll find one in the future).
When Scott Alexander talks about feedback loops like bipolar disorder and sleep, he's talking about much higher level concepts.
I don't really understand what the parent comment quote is trying to say. Can people have circular thoughts and deteriorating mental state? Sure. That's not a "feedback loop" between layers -- the chemicals are just doing their thing and the thoughts happen to be the resulting subjective experience of it.
To answer your question about the "thought trap". If "it's possible for simple normal trains of thought to latch your brain into a very undesirable state" then I'd say that means the mind/brain's self-regulation systems have failed, which would be a disorder or illness by definition.
Is it always a structural or chemical problem? Let's say thinking about a past traumatic event gives you a panic attack... We call that PTSD. You could say PTSD is expected primate behavior, or you could say it's a malfunction of the management systems. Or you could say it's not a malfunction but that the 'traumatic event' did in fact physically traumatize the brain that was forced to experience it...
This seems very incorrect, or at least drastically underspecified. These trains of thought are "normal" (i.e. common and unremarkable) so why don't they "latch your brain into a very undesirable state" lots of the time?
I don't think Scott or anyone up to speed on modern neuroscience would deny the coupling of mental state and brain chemistry--in fact I think it would be more accurate to say both of them are aspects of the dynamics of the brain.
But this doesn't imply that "simple normal trains of thought" can latch our brain dynamics into bad states -- i.e. in dynamics language move us into a undesirable attractor. That would require a very problematic fragility in our normal self-regulation of brain dynamics.
Think of it as a version of making your drugged friend believe various random stuff. It works better if you're not a stranger and have an engaging or alarming style.
LLMs are trained to produce pleasant responses that tailor to the user to maximize positive responses. (A more general version of engagement.) It stands to reason they would be effective at convincing someone.
That's why he's honing in on that specific scenario to determine if chatbots are uniquely crazy-making or something. The professional psychiatrist author is not unaware of the things you're saying. They're just not the purpose of the survey & article.
The best conspiracy theory could be, of course, that other people don’t actually exist. They are a figment of imagination put up by the brain to cope with the utter loneliness.
> All psychopathology was about unconscious emotional conflicts, mainly dating to childhood; if the conflicts were normal or mild, they produced “neuroses”; if they were severe, they produced “psychoses.”
> In addition to 14 validated diagnoses published in the RDC in 1978, a mere two years later DSM-III came out with 292 claimed diagnoses. There is no metaphysical possibility that 278 psychiatric diagnoses suddenly were discovered in two years. They were invented.
In particular, over half a century of personality research had supported the concept of personality “traits” or dimensions, rather than “disorders” or categories.
That is antithetical to the basic idea of a diagnosis. "You seem like an angry person" is not helpful for deciding which treatments to try. Where does this leave us? We have to accept DSM-5 definitions from a legal and practical perspective. We have to use them for insurance forms, and to protect ourselves against lawsuits. But we don't have to believe in them.
Yes, that's the whole point of the book. I'm confident that it's covered in the intro.There's obviously a gulf of potential argument in that definition, but a unique form would be people who report hearing voices, but they're not hostile or angry..so actually it's not a problem.
So is QAnon a religion? Awkward question, but it’s non-psychotic by definition.
Not to anyone who has ever discussed it... Is this psychosis? The answer has to be no
A lot of really confident talk without even a passing attempt to define the central term :(Instead of looking at gambling addictions as personal failing she asserts they are a result between “interaction between the person and the machine.”
Similarly here I think there's something more than just the propensity of crazy people to be crazy that was already there, I do think there's something to the assertion that it's the interaction between both. In other words, there's something about LLMs themselves that drive this behavior more so than, for example, TikTok.
Conversely, at a previous job I was forced to code in Go, became massively depressed, and am still not over it.
I guess my point is that n=1 isn't enough to really know if it's that LLMs got to you, or if you were already on the verge of burnout or depression anyway.
I'd say "we'll see", except in reality there's very few robust studies on depression in cohorts like "developers", so probably the stats won't come out.
I personally recommend doing more of whatever sport it is you like (or if you don't have one, starting running and/or lifting at the gym), and using less social media.
It's truly shocking to witness someone you've known your whole life just go off the deep end into something that has so many demonstrably false aspects, and watch them start saying believing so much batshit crazy stuff. I don't know of anything comparable, short of a previously typical person developing a severe meth addiction, which is known to cause psychosis.
Let's say I believe in dragons, and I start interpreting any evidence as dragon evidence. Furthermore, I start only looking for evidence that could be connected to dragons. It's bad thinking.
The opposite is the good thinking. You look at evidence without searching for anything specific, then you make a hypothesis on what is going on.
Searching for evidence of chatbot-induced psychosis is settling on a cause before looking at evidence. It's obvious that is wrong.
For example, the survey the author did should not have asked if anyone close "had shown signs of AI psychosis". The question is already biased from the start.
The article explores the popular idea that talking to a chatbot can induce psychosis. This paints a picture of a person talking to an AI chatbot and going insane. Then it proceeds to say it's a rare case, therefore shutting down possibilities that this could lead to an epidemic. However, by doing this, the article discourages the reader to think of other possible scenarios (like unaware interaction with AI-produced content) leading to psychological issues.
https://github.com/RCALabs/mmogit/blob/db70c9b377da7c4805a1d...
And people want to be special; to find meaning, purpose beyond the daily grind.
The result wasn't very difficult to predict, more likely one of the driving forces behind the push.
This is interesting and something I never considered in a broad sense.
I have noticed how the majority of programmers I worked with do not have a mental model of the code or what it describes – it's basically vibing without an LLM, the result accidental. This is fine and perfectly workable. You need only a fraction of devs to purposefully shape the architecture so that the rest can continue vibing.
But never have I stopped to think whether this extends to the world at large.
Of course everyone has world models. Otherwise people would wander into traffic like headless chickens, if they'd even be capable of that. What he likely means is that not everyone explicitly things of possibilities in terms of probabilities that are a function of Bayesian updating. That does not imply the absence of world models.
You could argue that some people have simpler world models, but claiming the absence of world models is extremely arrogant.
While people being impressionable and affected by forces of societal change is not a new phenomenon, I agree that this type of behavior deserves its own label.
As long as AI doesn’t have its own feelings, it doesn’t make sense to feel any kind of attachment towards it, or be influenced by its words in any social sense. The tool doesn’t have any capacity for being social, so the delusion is both self-rooted and self-driven. So, I think I would have preferred to call this AI-driven narcissism instead of AI psychosis.
Model collapse is just fine.
djmips•11h ago
colechristensen•11h ago
I don't know how to say this in a way that isn't so negative... but how are people such profound followers that they can put themselves into a feedback loop that results is psychosis?
I think it's an education problem, not as in people are missing facts but by the missing basic brain development to be critical of incoming information.
djmips•11h ago
colechristensen•10h ago
If exposing you to an LLM causes psychosis you have some really big problems that need to be prevented, detected, and addressed much better.
dingnuts•10h ago
0 https://www.vice.com/en/article/chatgpt-is-giving-people-ext...
Flowzone•40m ago
What I experienced was that psychosis isn't a failure of logic or education. I had never believed in a single conspiracy theory (and I don't now), but during that month I believed all sorts of wild conspiratorial things.
What you're describing with cable news sounds more like 1) Cognitive bias, which everyone has, but yes can be improved. And 2) a social phenomenon, where they create this shared reality of not just information, but a social identity, and they keep feeding that beast.
However, when those people hold beliefs that sound irrational to outsiders, that's not necessarily the same thing as psychotic delusions.
When I was in psychosis, it definitely seemed like more of a hardware issue than a software issue if that makes sense. Sometimes software issues can lead to hardware issues though.
SequoiaHope•21m ago
farceSpherule•11h ago
When the Internet arrived, it opened up the floodgates of information. Suddenly any Joe Six Pack could publish. Truth and noise sat side by side, and most people could not tell the difference, nor did they care to tell the difference.
When social media arrived, it gave every Joe Six Pack a megaphone. That meant experts and thoughtful people had new reach but so did the loudest, least informed voices. The result? An army of Joe Six Packs who would never have been heard before now had a platform, and they shaped public discourse in ways we are still trying to recover.
AI is following the same pattern.
immibis•9h ago
visarga•1h ago
But initially is was non commercial and good. Not perfect, but much more interesting than today. What changed is advertising and competition for scarce attention. Competition for attention filled the web with slop and clickbait.
> When social media arrived, it gave every Joe Six Pack a megaphone.
And also made everyone feel the need to pose, broadcast their ideology and show their in-group adherence publicly. There is peer pressure to conform to in-group norms and shaming or cancelling otherwise.