frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

The New Godel Prize Winner Tastes Great and Is Less Filling

https://blog.computationalcomplexity.org/2025/06/the-new-godel-prize-winner-tastes-great.html
11•baruchel•20m ago•0 comments

Why Quadratic Funding Is Not Optimal

https://jonathanwarden.com/quadratic-funding-is-not-optimal/
10•jwarden•29m ago•1 comments

Hokusai Moyo Gafu: an album of dyeing patterns

https://ndlsearch.ndl.go.jp/en/imagebank/theme/hokusaimoyo
5•fanf2•21m ago•1 comments

Bruteforcing the phone number of any Google user

https://brutecat.com/articles/leaking-google-phones
133•brutecat•56m ago•19 comments

Finding Shawn Mendes (2019)

https://ericneyman.wordpress.com/2019/11/26/finding-shawn-mendes/
244•jzwinck•7h ago•38 comments

Doctors Could Hack the Nervous System with Ultrasound

https://spectrum.ieee.org/focused-ultrasound-stimulation-inflammation-diabetes
3•purpleko•35m ago•0 comments

Mushrooms communicate with each other using up to 50 'words', scientist claims

https://www.theguardian.com/science/2022/apr/06/fungi-electrical-impulses-human-language-study
21•Anon84•50m ago•6 comments

LLMs are cheap

https://www.snellman.net/blog/archive/2025-06-02-llms-are-cheap/
154•Bogdanp•3h ago•151 comments

Why Android can't use CDC Ethernet (2023)

https://jordemort.dev/blog/why-android-cant-use-cdc-ethernet/
294•goodburb•18h ago•118 comments

Riding high in Germany on the world's oldest suspended railway

https://www.theguardian.com/travel/2025/jun/09/riding-high-in-germany-on-the-worlds-oldest-suspended-railway
152•pseudolus•15h ago•76 comments

The Child-Like Role of Dogs in Western Societies

https://econtent.hogrefe.com/doi/10.1027/1016-9040/a000552
10•IndrekR•3h ago•2 comments

AI Angst

https://www.tbray.org/ongoing/When/202x/2025/06/06/My-AI-Angst
49•AndrewDucker•4h ago•16 comments

Omnimax

https://computer.rip/2025-06-08-Omnimax.html
160•aberoham•18h ago•38 comments

Endangered classic Mac plastic color returns as 3D-printer filament

https://arstechnica.com/apple/2025/06/new-filament-lets-you-3d-print-parts-in-authentic-1980s-apple-computer-color/
196•CobaltFire•4d ago•61 comments

Administering immunotherapy in the morning seems to matter. Why?

https://www.owlposting.com/p/the-time-of-day-that-immunotherapy
202•abhishaike•22h ago•155 comments

FSE meets the FBI

https://blog.freespeechextremist.com/blog/fse-vs-fbi.html
340•1337p337•13h ago•74 comments

What Is OAuth and How Does It Work?

https://fusionauth.io/articles/oauth/modern-guide-to-oauth
6•xeonmc•1h ago•0 comments

What happens when people don't understand how AI works

https://www.theatlantic.com/culture/archive/2025/06/artificial-intelligence-illiteracy/683021/
152•rmason•18h ago•186 comments

Software is about promises

https://www.bramadams.dev/software-is-about-promises/
61•_bramses•14h ago•16 comments

I used AI-powered calorie counting apps, and they were even worse than expected

https://lifehacker.com/health/ai-powered-calorie-counting-apps-worse-than-expected
175•gnabgib•15h ago•166 comments

My first attempt at iOS app development

https://mgx.me/my-first-attempt-at-ios-app-development
190•surprisetalk•4d ago•117 comments

CoverDrop: A secure messaging system for newsreader apps

https://github.com/guardian/coverdrop
17•andyjohnson0•7h ago•1 comments

Panjandrum: The ‘giant firework’ built to break Hitler's Atlantic Wall

https://www.bbc.com/future/article/20250603-the-giant-firework-built-to-break-hitlers-atlantic-wall
122•rmason•4d ago•135 comments

Gaussian integration is cool

https://rohangautam.github.io/blog/chebyshev_gauss/
185•beansbeansbeans•1d ago•37 comments

Generating Pixels One by One

https://tunahansalih.github.io/blog/autoregressive-vision-generation-part-1/
60•cyruseption•4d ago•2 comments

Analyzing IPv4 Trades with Gnuplot

https://ipv4a-5539ad.gitlab.io/
56•todsacerdoti•13h ago•12 comments

Show HN: Let’s Bend – Open-Source Harmonica Bending Trainer

https://letsbend.de
110•egdels•23h ago•33 comments

FAA to eliminate floppy disks used in air traffic control systems

https://www.tomshardware.com/pc-components/storage/the-faa-seeks-to-eliminate-floppy-disk-usage-in-air-traffic-control-systems
126•daledavies•1d ago•154 comments

Cheap yet ultrapure titanium might enable widespread use in industry (2024)

https://phys.org/news/2024-06-cheap-ultrapure-titanium-metal-enable.amp
110•westurner•4d ago•86 comments

Defiant loyalists paid dearly for choosing wrong side in the American Revolution

https://www.smithsonianmag.com/history/meet-the-defiant-loyalists-who-paid-dearly-for-choosing-the-wrong-side-in-the-american-revolution-180986716/
70•bookofjoe•2h ago•106 comments
Open in hackernews

What happens when people don't understand how AI works

https://www.theatlantic.com/culture/archive/2025/06/artificial-intelligence-illiteracy/683021/
151•rmason•18h ago

Comments

jdkee•18h ago
Someone said, "The AI you use today is the worst AI that you will ever use."
threeseed•17h ago
I can't tell any difference between Claude 3.5, 3.7 and 4 for coding.

So today is the same AI I used last year. And based on current trajectory same I will use next year.

sroussey•17h ago
I can tell the difference between those versions of Claude quite easily. Not 10x better each version, but each is more capable and the errors are fewer.
assimpleaspossi•17h ago
But errors nonetheless.
plemer•17h ago
You won’t find perfection anywhere
th0ma5•4h ago
The imperfections of people are less imperfect than the machines.
8note•8h ago
errors are all over the place no matter what. the question is how predictable they are, and if they can be spotted as they show up.

the best bugs are the ones that arent found for 5 years

smcleod•17h ago
There is certainly a difference, however Anthropic did really well with 3.5 - far, far better than any other provider could do, so the steps from there been more incremental while other providers have been playing catch up (for example Google's Gemini 2.5 Pro is really their first model that's actually useful for coding in any way).
user568439•17h ago
Someone also said the current AI is the less biased.

Future AI's will be more powerful but probably influenced to push users to spend money or have a political opinion. So they may enshitify...

Pingk•17h ago
Given that no models are profitable for the parent company afaik, it's only a matter of time before the money-squeezing begins
stirfish•15h ago
<think>I will reply with an example of bias in large language models. This comment seems unrelated to farmers in South Africa. </think>

Ultimately these machines work for the people who paid for them.

add-sub-mul-div•17h ago
That doesn't take into account how it will be used against its customers once they're dependent on it. As any other service inevitably will be.

It's like if we'd said the Youtube we used in 2015 was going to be the worst Youtube we'd ever use.

andy99•17h ago
Is the internet today better than it's ever been? I can easily imagine LLMs becoming ad-riden, propagandized, cloudlfare intermediated, enshittified crap the way most of the internet has.
davidcbc•17h ago
Someone could have said "The Google you use today is the worst Google you will ever use" 15 years ago and it may have sounded wise at the time.
dwaltrip•16h ago
That's a good point. But on the other hand Google search usefulness didn't have "scaling laws" / lines on capability graphs going up and up...
rhubarbtree•2h ago
If there is one cliche that I’d like to outlaw, it is this one. It’s a vacuous tautology and yet I hear so many people parrot it all the time.

There are many reasons to believe LLMs in particular are not going anywhere fast.

We need major breakthroughs now, and “chain of thought” is not one.

layer8•18h ago
https://archive.is/NJ9K0
mettamage•17h ago
> Few phenomena demonstrate the perils that can accompany AI illiteracy as well as “Chatgpt induced psychosis,” the subject of a recent Rolling Stone article about the growing number of people who think their LLM is a sapient spiritual guide. Some users have come to believe that the chatbot they’re interacting with is a god—“ChatGPT Jesus,” as a man whose wife fell prey to LLM-inspired delusions put it—while others are convinced, with the encouragement of their AI, that they themselves are metaphysical sages in touch with the deep structure of life and the cosmos. A teacher quoted anonymously in the article said that ChatGPT began calling her partner “spiral starchild” and “river walker” in interactions that moved him to tears. “He started telling me he made his AI self-aware,” she said, “and that it was teaching him how to talk to God, or sometimes that the bot was God—and then that he himself was God.”

This sounds insane to me. When we are talking about safe AI use, I wonder if things like this are talked about.

The more technological advancement goes on, the smarter we need to be in order to use it - it seems.

polotics•17h ago
Psychosis will find anything as a seed idea to express itself, even as basic a pattern as someone walking in lockstep with the soon-to-be patient can trigger a break. So it's not surprising that LLM chats would do the same.
daveguy•17h ago
Yeah, but should we have psychosis force multipliers?
Animats•17h ago
We have that now, in social media. If you create some way for large numbers of people with the same nutty beliefs to easily communicate, you get a psychosis force multiplier. Before social media, nuttyness tended to be diluted by the general population.
daveguy•17h ago
Completely agree with you about social media. I'm not a fan of social media algorithms and believe they also produce much more harm than benefit.
hackingonempty•17h ago
I think insane and lonely people are definitely on the safety radar.

Even if todays general purpose models and models made by predators can have negative effects on vulnerable people, LLMs could become the technology that brings psych care to the masses.

Animats•17h ago
> Few phenomena demonstrate the perils that can accompany AI illiteracy as well as “Chatgpt induced psychosis,” the subject of a recent Rolling Stone article about the growing number of people who think their LLM is a sapient spiritual guide.

People have been caught in that trap ever since the invention of religion. This is not a new problem.

staunton•17h ago
It doesn't have to be new to get worse and be important as a consequence of this stuff...
nkrisc•16h ago
Crazy people will always find a way to be crazy. It’s not surprising that many of these cases have a religious nature to them.
stirfish•16h ago
I'll admit, the first time I started ollama locally, I asked it if I would hurt it if I turned it off. I have a master's degree in machine learning and I know it's silly, but I still did it.
carlosjobim•1h ago
We were warned:

“You shall not make idols for yourselves or erect an image or pillar, and you shall not set up a figured stone in your land to bow down to it, for I am the LORD your God."

A computer chip is a stone (silicon) which has been engraved. It's a graven image.

Anything man-made is always unworthy of worship. That includes computer programs such as AI. That includes man-made ideas such as "the government", a political party, or other abstract ideas. That also includes any man or woman. But the human natural instinct is to worship a king, pharaoh or an emperor - or to worship a physical object.

streb-lo•45m ago
I mean, the bible is a man made pile of crap too...
carlosjobim•18m ago
If God or The Gods are defined as not being man-made, then each person will be able to find their own interpretation and understanding. As contrary to man-made objects and concepts. Most modern people worship "the government" or "the state", even though there is no dispute whether it was created by man or not and whether it acts under the influence of man or not.
kelseyfrog•17h ago
LLMs are divinatory instruments, our era's oracle, minus the incense and theatrics. If we were honest, we'd admit that "artificial intelligence" is just a modern gloss on a very old instinct: to consult a higher-order text generator and search for wisdom in the obscure.

They tick all the boxes: oblique meaning, a semiotic field, the illusion of hidden knowledge, and a ritual interface. The only reason we don't call it divination is that it's skinned in dark mode UX instead of stars and moons.

Barthes reminds us that all meaning is in the eye of the reader; words have no essence, only interpretation. When we forget that, we get nonsense like "the chatbot told him he was the messiah," as though language could be blamed for the projection.

What we're seeing isn't new, just unfamiliar. We used to read bones and cards. Now we read tokens. They look like language, so we treat them like arguments. But they're just as oracular, complex, probabilistic signals we transmute into insight.

We've unleashed a new form of divination on a culture that doesn't know it's practicing one. That's why everything feels uncanny. And it's only going to get stranger, until we learn to name the thing we're actually doing. Which is a shame, because once we name it, once we see it for what it is, it won't be half as fun.

Arainach•16h ago
>When we forget that, we get nonsense like "the chatbot told him he was the messiah," as though language could be blamed for the projection.

Words have power, and those that create words - or create machines that create words - have responsibility and liability.

It is not enough to say "the reader is responsible for meaning and their actions". When people or planet-burning random matrix multipliers say things and influence the thoughts and behaviors of others there is blame and there should be liability.

Those who spread lies that caused people to storm the Capitol on January 6th believing an election to be stolen are absolutely partially responsible even if they themselves did not go to DC on that day. Those who train machines that spit out lies which have driven people to racism and genocide in the past are responsible for the consequences.

kelseyfrog•15h ago
"Words have no essential meaning" and "speech carries responsibility" aren't contradictory. They're two ends of the same bridge. Meaning is always projected by the reader, but that doesn't exempt the speaker from shaping the terrain of projection.

Acknowledging the interpretive nature of language doesn't absolve us from the consequences of what we say. It just means that communication is always a gamble: we load the dice with intention and hope they land amid the chaos of another mind.

This applies whether the text comes from a person or a model. The key difference is that humans write with a theory of mind. They guess what might land, what might be misread, what might resonate. LLMs don’t guess; they sample. But the meaning still arrives the same way: through the reader, reconstructing significance from dead words.

So no, pointing out that people read meaning into LLM outputs doesn’t let humans off the hook for their own words. It just reminds us that all language is a collaborative illusion, intent on one end, interpretation on the other, and a vast gap where only words exist in between.

krisoft•15h ago
This sounds very wise but doesn’t seem to describe any of my use cases. Maybe some use cases are divination but it is a stretch to call all of them that.

Just looking at my recent AI prompts:

I was looking for the name of the small fibers which form a bird’s feather. ChatGPT told me they are called “barbs”. Then using straight forward google search i could verify that indeed that is the name of the thing i was looking for. How is this “divination”?

I was looking for what is the g-code equivalent for galvo fiber lasers are and ChatGPT told me there isn’t really one. The closest might be the sdk of ezcad, but it also listed several other opensource control solutions too.

Wanted to know what are the hallmarking rules in the UK for an item which consist of multiple pieces of sterling silver held together by a non-metalic part. (Turns out the total weight of the silver matters, while the weight of the non-metalic part does not count.)

Wanted to translate the hungarian phrase “besurranó tolvaj” into english. Out of the many possible translations chatGPT provided “opportunistic burglar” fit the best for what I was looking for.

Wanted to write an sql alchemy model and i had an approximate idea of what fields i needed but couldn’t be arsed to come up with good names for them and find the syntax to describe their types. ChatGPT wrote it for me in seconds what would have taken me at least ten minutes otherwise.

These are “divination” only in a very galaxy brained “oh man, when you open your mind you see everything is divination really”. I would call most of these “information retrieval”. The information is out there the LLM just helps me find it with a convenient interface. While the last one is “coding”.

kelseyfrog•15h ago
Sure, some people stepped up to the Oracle and asked how to conquer Persia. Others probably asked where they left their sandals. The quality of the question doesn't change the structure of the act.

You presented clear, factual queries. Great. But even there, all the components are still in play: you asked a question into a black box, received a symbolic-seeming response, evaluated its truth post hoc, and interpreted its relevance. That's divination in structural terms. The fact that you're asking about barbs on feathers instead of the fate of empires doesn't negate the ritual, you're just a more practical querent.

Calling it "information retrieval" is fine, but it's worth noticing that this particular interface feels like more than that, like there's an illusion (or a projection) of latent knowledge being revealed. That interpretive dance between human and oracle is the core of divination, no matter how mundane the interaction.

I don't believe this paints with an overly broad brush. It's a real type of interaction and the subtle distinction focuses on the core relationship between human and oracle: seeking and interpreting.

gaganyaan•12h ago
> you asked a question into a black box, received a symbolic-seeming response, evaluated its truth post hoc, and interpreted its relevance

So any and all human communication is divination in your book?

I think your point is pretty silly. You're falling into a common trap of starting with the premise "I don't like AI", and then working backwards from that to pontification.

kelseyfrog•11h ago
Hacker News deserves a stronger counterargument than “this is silly.”

My original comment is making a structural point, not a mystical one. It’s not saying that using AI feels like praying to a god, it's saying the interaction pattern mirrors forms of ritualized inquiry: question → symbolic output → interpretive response.

You can disagree with the framing, but dismissing it as "I don’t like AI so I’m going to pontificate" sidesteps the actual claim. There's a meaningful difference between saying "this tool gives me answers" and recognizing that the process by which we derive meaning from the output involves human projection and interpretation, just like divination historically did.

This kind of analogy isn't an attack on AI. It’s an attempt to understand the human-AI relationship in cultural terms. That's worth engaging with, even if you think the metaphor fails.

hexaga•8h ago
> Hacker News deserves a stronger counterargument than “this is silly.”

Their counterargument is that said structural definition is overly broad, to the point of including any and all forms of symbolic communication (which is all of them). Because of that, your argument based on it doesn't really say anything at all about AI or divination, yet still seems 'deep' and mystical and wise. But this is a seeming only. And for that reason, it is silly.

By painting all things with the same brush, you lose the ability to distinguish between anything. Calling all communication divination (through your structural metaphor), and then using cached intuitions about 'the thing which used to be called divination; when it was a limited subset of the whole' is silly. You're not talking about that which used to be called divination, because you redefined divination to include all symbolic communication.

Thus your argument leaks intuitions (how that-which-was-divination generally behaves) that do not necessarily apply through a side channel (the redefined word). This is silly.

That is to say, if you want to talk about the interpretative nature of interaction with AI, that is fairly straightforward to show and I don't think anyone would fight you on it, but divination brings baggage with it that you haven't shown to be the case for AI. In point of fact, there are many ways in which AI is not at all like divination. The structural approach broadens too far too fast with not enough re-examination of priors, becoming so broad that it encompasses any kind of communication at all.

With all of that said, there seems to be a strong bent in your rhetoric towards calling it divination anyway, which suggests reasoning from that conclusion, and that the structural approach is but a blunt instrument to force AI into a divination shaped hole, to make 'poignant and wise' commentary on it.

> "I don’t like AI so I’m going to pontificate" sidesteps the actual claim

What claim? As per ^, maximally broad definition says nothing about AI that is not also about everything, and only seems to be a claim because it inherits intuitions from a redefined term.

> difference between saying "this tool gives me answers" and recognizing that the process by which we derive meaning from the output involves human projection and interpretation, just like divination historically did

Sure, and all communication requires interpretation. That doesn't make all communication divination. Divination implies the notion of interpretation of something that is seen to be causally disentangled from the subject. The layout of these bones reveals your destiny. The level of mercury in this thermometer reveals the temperature. The fair die is cast, and I will win big. The loaded die is cast, and I will win big. Spot the difference. It's not structural.

That implication of essential incoherence is what you're saying without saying about AI, it is the 'cultural wisdom and poignancy' feedstock of your arguments, smuggled in via the vehicle of structural metaphor along oblique angles that should by rights not permit said implication. Yet people will of course be generally uncareful and wave those intuitions through - presuming they are wrapped in appropriately philosophical guise - which is why this line of reasoning inspires such confusion.

In summary, I see a few ways to resolve your arguments coherently:

1. keep the structural metaphor, discard cached intuitions about what it means for something to be divination (w.r.t. divination being generally wrong/bad and the specifics of how and why). results in an argument of no claims or particular distinction about anything, really. this is what you get if you just follow the logic without cache invalidation errors.

2. discard the structural metaphor and thus disregard the cached intuitions as well. there is little engagement along human-AI cultural axis that isn't also human-human. AI use is interpretative but so is all communication. functionally the same as 1.

3. keep the structural metaphor and also demonstrate how AI are not reliably causally entwined with reality along boundaries obvious to humans (hard because they plainly and obviously are, as demonstrable empirically in myriad ways), at which point go off about how using AI is divination because at this point you could actually say that with confidence.

dingnuts•1h ago
You're misunderstanding the point of structural analysis. Comparing AI to divination isn't about making everything equivalent, but about highlighting specific shared structures that reveal how humans interact with these systems. The fact that this comparison can be extended to other domains doesn't make it meaningless.

The issue isn't "cached intuitions" about divination, but rather that you're reading the comparison too literally. It's not about importing every historical association, but about identifying specific parallels that shed light on user behavior and expectations.

Your proposed "resolutions" are based on a false dichotomy between total equivalence and total abandonment of comparison. Structural analysis can be useful even if it's not a perfect fit. The comparison isn't about labeling AI as "divination" in the classical sense, but about understanding the interpretive practices involved in human-AI interaction.

You're sidestepping the actual insight here, which is that humans tend to project meaning onto ambiguous outputs from systems they perceive as having special insight or authority. That's a meaningful observation, regardless of whether AI is "causally disentangled from reality" or not.

gaganyaan•1h ago
> humans tend to project meaning onto ambiguous outputs from systems they perceive as having special insight or authority

This applies just as well to other humans as it does AI. It's overly-broad to the point of meaninglessness.

The insight doesn't illuminate.

albedoa•1h ago
> That's a meaningful observation, regardless of whether AI is "causally disentangled from reality" or not.

And regardless of how many words someone uses in their failed attempt at "gotcha" that nobody else is playing. There are certainly some folks acting silly here, and it's not the vast majority of us who have no problem interpreting and engaging with the structural analysis.

nosianu•4h ago
> So any and all human communication is divination in your book?

Words from an AI are just words.

Words in a human brain have more or less (depending on the individual's experiences) "stuff" attached to them: From direct sensory inputs to complex networks of experiences and though. Human thought is mainly not based on words. Language is an add-on. (People without language - never learned, or sometimes temporarily disabled due to drugs, or permanently due to injury, transient or permanent aphasia - are still consciously thinking people.)

Words in a human brain are an expression of deeper structure in the brain.

Words from an AI have nothing behind them but word statistics, devoid of any real world, just words based on words.

Random example sentence: "The company needs to expand into a new country's market."

When an AI writes this, there is no real world meaning behind it whatsoever.

When a fresh out of college person writes this it's based on some shallow real world experience, and lots of hearsay.

When an experienced person actually having done such expansion in the past says it a huge network of their experience with people and impressions is behind it, a feeling for where the difficulties lie and what to expect IRL with a lot of real-world-experience based detail. When such a person expands on the original statement chances are highest that any follow-up statements will also represent real life quite well, because they are drawn not from text analysis, but from those deeper structures created by and during the process of the person actually performing and experiencing the task.

But the words can be exactly the same. Words from a human can be of the same (low) quality as that of an AI, if they just parrot something they read or heard somewhere, although even then the words will have more depth than the "zero" on AI words, because even the stupidest person has some degree of actual real life forming their neural network, and not solely analysis of other's texts.

ben_w•3h ago
> People without language - never learned, or sometimes temporarily disabled due to drugs, or permanently due to injury, transient or permanent aphasia - are still consciously thinking people.

There are 40 definitions of the word "consciousness".

For the definitions pertaining to inner world, nobody can tell if anyone besides themselves (regardless of if they speak or move) is conscious, and none of us can prove to anyone else the validity of our own claims to posess it.

When I dream, am I conscious in that moment, or do I create a memory that my consciousness replays when I wake?

> Words from an AI have nothing behind them but word statistics, devoid of any real world, just words based on words.

> […]

> When a fresh out of college person writes this it's based on some shallow real world experience, and lots of hearsay.

My required reading at school included "Dulce Et Decorum Est" by Wilfred Owen.

The horrors of being gassed during trench warfare were alien to us in the peaceful south coast of the UK in 1999/2000.

AI are limited, but what you're describing here is the "book learning" vs. "street smart" dichotomoy rather than their actual weaknesses.

tempodox•2h ago
I can only agree with you. And I find it disturbing that every time someone points out what you just said, the counter argument is to reduce human experience and human consciousness to the shallowest possible interpretation so they can then say, “look, it's the same as what the machine does”.
kenjackson•26m ago
I think it’s because the brain is simply a set of chemical and electrical interactions. I think some believe when we understand how the brain works it won’t be some “soulful” other worldly explanation. It will be some science based explanation that will seem very unsatisfying to some that think of us as more than complex machines. The human brain is different than LLMs, but I think we will eventually say “hey we can make a machine very similar”.
krisoft•11h ago
> some people stepped up to the Oracle and asked how to conquer Persia. Others probably asked where they left their sandals.

And if the place would be any good at the second kind of queries you would call it Lost&Found and not the Oracle.

> illusion (or a projection) of latent knowledge being revealed

It is not an illusion. Knowledge is being revealed. The right knowledge for my question.

> That interpretive dance between human and oracle is the core of divination, no matter how mundane the interaction.

Ok, so if I went to a library, used a card index to find a book about bird feather anatomy, then read said book to find that the answer to my question is “barb” would you also call that “divination”?

If i would have paid a software developer to turn my imprecise description of a database table into precise and thight code which can be executed would you also call that “divination”?

anon-3988•5h ago
The difference is between saying '"I want a hammer" and it magically pops in your hand' versus '"I want a hammer" and I have to chop some wood, gather some metals, heat it up...'.

Both gets you a hammer, but I don't think anyone would call the latter magical/divine? I think its only "magical" simply because its incomprehensible...how does a hammer pops into reality? Of course, once we know EXACTLY how that works, then it ceases to be magical.

Even if we take God, if we fully understand how He works, He would no longer be magical/divine. "Oh he created another universe? This is how that works..."

The divinity comes from the fact that it is incomprehensible.

tough•4h ago
I always like to compare tongue in cheek, llm's with I-ching

https://en.wikipedia.org/wiki/I_Ching

AlecSchueler•1h ago
Why?
tough•1h ago
i copy pasted my comment and your question to chatgpt, so this isnt my answer but the AI's:

make your own conclusions

Because both LLMs and the I Ching function as mirrors for human interpretation, where: • The I Ching offers cryptic symbols and phrases—users project meaning onto them. • LLMs generate probabilistic text—users extract significance based on context.

The parallel is:

You don’t get answers, you get patterns—and the meaning emerges from your interaction with the system.

In both cases, the output is: • Context-sensitive • Open-ended • Interpreted more than dictated

It’s a cheeky way of highlighting that users bring the meaning, not the machine (or oracle).

narrator•3h ago
You're describing what narrative archetype it is most similar to from ancient history, not what it actually is.
zzbzq•2h ago
The LLMs do have "latent knowledge," indisputably, the latent knowledge is beyond reproach. Because what we do know about the "black box" is that inside it, is a database of not just facts, but understanding, and we know the model "understands" nearly every topic better than any human. Where the doubt-worthy part happens is the generative step, since it is tasked with producing a new "understanding" that didn't already exist, the mathematical domain of the generative function exceeds the domain of reality. And, second of all, because the reasoning faculties are far less proven than the understanding faculties, and many queries require reasoning about existing understandings to derive a good, new one.
datadrivenangel•2h ago
LLMs have latent knowledge insofar as it can be distilled out of the internet...
whilenot-dev•1h ago
*or any digitized proprietary works, just as long as they can be parsed correctly. don't worry, the means of how to optain these works doesn't seem to matter[0]

[0]: https://www.arl.org/blog/training-generative-ai-models-on-co...

falcor84•9m ago
> Others probably asked where they left their sandals.

This to me is massive. The Oracle of Delphi would have no idea where you left your sandals, but present day AIs increasingly do. This (emergent?) capability of combining information retrieval with flexible language is amazing, and its utility to me cannot be overstated, when I ask a vague question, and then I check the place where the AI led me to, and the sandals are indeed there.

P.S. Thank you for introducing me to the word "querent"

navane•9h ago
What about the times you didn't get a coherent answer and you gave up and looked elsewhere?
JKCalhoun•2h ago
Almost proves it is not an oracle then, not perceived as one.

Rephrasing: LLMs are the modern day oracle that we disregard when it appears to be hallucinating, embrace when it appears to be correct.

The popularity of LLMs may not be that we see them as mystical, but rather that they're right more often than they're wrong.

“That is not what I meant at all;

That is not it, at all.”

— T.S. Eliot

dfxm12•2h ago
ChatGPT told me they are called “barbs”. Then using straight forward google search i could verify that indeed that is the name of the thing i was looking for.

Why not just start with a straight forward Google search?

shagie•2h ago
If you are not familiar with the term, it can be hard to search for it.

Google doesn't give you the answer (unless you're reading the AI summaries - then it's a question of which one you trust more). Instead it provides links to

    https://www.scienceofbirds.com/blog/the-parts-of-a-feather-and-how-feathers-work
    https://www.birdsoutsidemywindow.org/2010/07/02/anatomy-parts-of-a-feather/
    https://en.wikipedia.org/wiki/Feather
    https://www.researchgate.net/figure/Feather-structure-a-feather-shaft-rachis-and-the-feather-vane-barbs-and-barbules_fig3_303095497
    
These then require an additional parsing of the text to see if it has what you are after. Arguably, one could read the Wiki article first and see if it has, but it's faster to ask ChatGPT and then verify - rather than search, scan, and parse.
ramchip•2h ago
It gives you more effective search keywords. "Fibers in feathers" isn't too bad, but when it's quite vague like "that movie from the 70s where the guy drank whiskey and then there was a firefight and..." getting the name from the LLM makes it much faster to google.
bandoti•1h ago
Funny I just entered “feather” into Merriam-Webster dictionary and there’s your word “barb”. Point being, people should use a dictionary/thesaurus before burning fuel on an AI.

1 a : any of the light, horny, epidermal outgrowths that form the external covering of the body of birds

NOTE: Feathers include the smaller down feathers and the larger contour and flight feathers. Larger feathers consist of a shaft (rachis) bearing branches (barbs) which bear smaller branches (barbules). These smaller branches bear tiny hook-bearing processes (barbicels) which interlock with the barbules of an adjacent barb to link the barbs into a continuous stiff vane. Down feathers lack barbules, resulting in fluffy feathers which provide insulation below the contour feathers.

dingnuts•1h ago
This is a great example because the LLM answer was insufficiently complete but if you didn't look up the result you wouldn't know. I think I remain an AI skeptic because I keep looking up the results and this kind of omission is more common than not.
msgodel•5h ago
This stuff wasn't an issue because older societies had hierarchy which checked the mob.

In a flat society every individual must be able to perform philosophically the way aristocrats do.

mdp2021•4h ago
> we were honest

I am quite honest and the subset of users that fill your description - unconsciously treating text from deficient authors as tea leaves - have psychiatric issues.

Surely many people consult LLMs because of the value within their right answers, which exist owing to having encoded information and some emergent idea processing, and attempting to tame the wrong ones. They consult LLMs because that's what we have, limited as it is, for some problems.

Your argument falls immediately because people in the consultation of unreliable documents cannot be confused with people in the consultation of tools for other kinds of thinking: the thought under test is outside in the first case, inside in the second (contextually).

You have fallen in a very bad use of 'we'.

awongh•3h ago
> value within their right answers

The thing is that LLMs provide plenty of answers where "right" is not a verifiable metric. Even in coding the idea of a "right" answer quickly gets fuzzy- should I use CSS grid or flexbox here? should these tables be normalized or not?

People simply have an unconscious bias towards the output just like they have an unconscious bias towards the same answer given by two real people they feel differently about- That is, the sort of thing all humans do (even if you swear that in all cases you are 100% impartial and logical).

I think the impulse of ascribing intent and meaning to the output is there in almost all questions, it's just a matter of degrees (CSS question vs. meaning of life type question)

tim333•2h ago
Maybe LLMs can be divinatory instruments but that sounds a bit highbrow going by my use.

I use it more as a better Google search. Like the most recent thing I said to ChatGPT is "will clothianidin kill carpet beetles?" (turns out it does by the way.)

suddenlybananas•2h ago
Until it makes stuff up.
tim333•2h ago
Well nothing's perfect. It actually got that one wrong - it said "not typically effective" which isn't true.
mort96•1h ago
Nothing is perfect, but some things let you validate the answer. Search engines give you search results, not an answer. You can use the traditional methods for evaluating the reliability of a resource. An academic resource focused on the toxicity of various kinds of chemicals is probably fairly trustworthy, while a blog from someone trying to sell you healing crystals probably isn't.

When you're using ChatGPT to find information, you have no information if what it's regurgitating is from a high reliability source or a low reliability source, or if it's just a random collection of words whose purpose is simply to make grammatical sense.

AlecSchueler•1h ago
The most frustrating time I've had is asking it something, pointing out why the answer was obviously wrong, having it confirm the faulty logic and give an example of a portion of the answer now it would look like if it had used sound logic, followed by a promise to answer again without the accepted inaccuracy, only to get an even more absurd answer than the first time around.
Zak•2h ago
This seems like the sort of question that's very likely to produce a hallucinated answer.

Interestingly, I asked Perplexity the same thing and it said that clothianidin is not commonly recommended for carpet beetles, and suggested other insecticides and growth regulators. I had to ask a follow-up before it concluded clothianidin probably will kill carpet beetles.

tim333•1h ago
Yeah, as mentioned in another comment ChatGPT said "not generally effective" which I guess it hallucinated. It's actually a tricky question because the answer isn't really there in a straightforward way on the general web and I only know for sure because me and someone I know tried it. Although I guess a pesticide expert would find it easy.

Part of the reason is clothianidin is too effective at killing insects and tends to persist in the environment and kill bees and butterflies and the like so it isn't recommended for harmless stuff like carpet beetles. I was actually using it for something else and curious if it would take out the beetles as a side effect.

thoroughburro•2h ago
Trusting LLM advice about poisons seems… sort of like being a test pilot for a brand new aerospace company with no reputation for safety.
selfhoster11•1h ago
Only if you don't check it against a classical search query later. Not to mention that all you might get is search results from slop farms that aren't concerned with safety or factuality - ones that were a problem before LLMs.
tim333•1h ago
I agree in general but it wasn't of much importance whether my carpet beetles died or not.
sorokod•2h ago
Given the consistently declining quality of Google search, this is a low bar to pass.
bandoti•1h ago
Just as alchemists searched for the Philosopher’s stone, we search for Artificial General Intelligence.
duxup•1h ago
The terminology is so confusing in AI right now.

I use LLMs, I enjoy them, I'm more productive with them.

Then I go read a blog from some AI devs and they use terms like "thinking" or similar terms.

I always have to ask "We're still s stringing words together with math right? Not really thinking right?" The answer is always yes ... but then they go back to using their wonky terms.

AlecSchueler•1h ago
The question is what's different in your own "thinking?"
pennomi•1h ago
If you actually know the answer to this, you should probably publish a paper on it. The conditions that truly create intelligence is… not well understood.
AlecSchueler•18m ago
That's actually the point I was making. There's an assumption that the LLM is working differently because there's a statistical model but we lack the understanding of our own intelligence to be able to say this is indeed a difference.
freejazz•6m ago
So? There is no more evidence to suggest they are the same than what you've already rejected here as evidencing difference.
jhedwards•1h ago
Thinking in humans is prior to language. The language apparatus is embedded in a living organism which has a biological state that produces thoughts and feelings, goals and desires. Language is then used to communicate these underlying things, which themselves are not linguistic in nature (though of course the causality is so complex that the may be _influenced_ by language among other things).
jsdalton•1h ago
Agreed. Many animals without language show evidence of thinking (e.g. complex problem solving skills and tool use). Language is clearly an enabler of complex thought in humans but not the entire basis of our intelligence, as it is with LLMs.
AlecSchueler•20m ago
But having language as the basis doesn't mean it isn't intelligence, right? At least I see no argument for that in what's being said. Stability can come from a basis of steel but it can also have a basis of wood.
cma•43m ago
> which themselves are not linguistic in nature (though of course the causality is so complex that the may be _influenced_ by language among other things).

Its possible something like this could be said of the middle transformer layers where it gets more and more abstract, and modern models are multimodal as well through various techniques.

kenjackson•35m ago
This is really over indexing on language for LLMs. It’s about taking input and generating output. Humans use different types of senses as their input, LLMs use text.

What makes thinking an interesting form of output is that it processes the input in some non-trivial way to be able to do an assortment of different tasks. But that’s it. There may be other forms of intelligence that have other “senses” who deem our ability to only use physical senses as somehow making us incomplete beings.

hackinthebochs•10m ago
Word embeddings are "prior" to an LLMs facility with any given natural language as well. Tokens are not the most basic representational substrate in LLMs, rather it's the word embeddings that capture sub-word information. LLMs are a lot more interesting than people give them credit for.
chasil•22m ago
The platform that we each hold is the most powerful abstract analysis machine known to exist.

It may be, by the end of my life, that this will no longer be true. That would be poignant.

globnomulous•1h ago
This is why I used to fight this "shorthand" whenever I encountered it. The shorthand almost always stops being shorthand and becomes the speaker or author's actual beliefs regarding the systems. Disciplined, careful use of language matters.

But I'm so crestfallen and pessimistic about the future of software and software engineering now that I have stopped fighting that battle.

s900mhz•51m ago
I personally love LLMs and use them daily for a variety of tasks. I really do not know how to “fix” the terminology. I agree with you that they are not thinking in the abstract like humans. I also do not know what else you would call “chain-of-thought”.

Perhaps “journaling-before-answering” lol. It’s basically talking out loud to itself. (Is that still being too anthropomorphic?)

Is this comment me “thinking out loud”? shrug

mdaniel•17m ago
I wanted to fight the "hallucinating" versus "confabulating" delineation but was told "it's a term of art, sit back down"
dmos62•7m ago
Would it be thinking if the brain was modeled in a more "accurate" way? Does this set of criteria for thinkingness come from whether or not the underlying machinery resembles what the corresponding machinery in humans looks like under the hood?

I'm putting the word accurate in quotes, because we'd have to understand how the brain in humans works, to have a measure for accuracy, which is very much not the case, in my humble opinion, contrary to what many of the commenters here imply.

voidhorse•1h ago
Great take. In my view, a major issue right now is that the people pushing these tools on the populace have never read Barthes and in many cases probably don't even know the name. If they had an inkling of literary and social history, they might be a bit more cautious and conscientious of how they frame these tools to people.

We are currently subject to the whims of corporations with absurd amounts of influence and power, run by people who barely understand the sciences, who likely know nothing about literary history beyond what the chatbot can summarize for them, have zero sociological knowledge or communications understanding, and who don't even write well-engineered code 90% of the time but are instead ok with shipping buggy crap to the masses as long as it means they get to be the first ones to do so, all this coupled with an amount of hubris unmatched by even the greatest protagonists of greek literature. Society has given some of the stupidest people the greatest amount of resources and power, and now we are paying for it.

cratermoon•1h ago
This exact characterization was noted two years ago. https://softwarecrisis.dev/letters/llmentalist/
spwa4•17h ago
What really happens: "for some reason" higher up management thinks AI will let idiots run extremely complex companies. It doesn't.

What AI actually does is like any other improved tool: it's a force multiplier. It allows a small number of highly experienced, very smart people, do double or triple the work they can do now.

In other words: for idiot management, AI does nothing (EXCEPT enable the competition)

Of course, this results in what you now see: layoffs where as always idiots survive the layoffs, followed by the products of those companies starting to suck more and more because they laid off the people that actually understood how things worked and AI cannot make up for that. Not even close.

AI is a mortal threat to the current crop of big companies. The bigger the company, the bigger a threat it is. The skill high level managers tend to have is to "conquer" existing companies, and nothing else. With some exceptions, they don't have any skill outside of management, and so you have the eternally repeated management song: that companies can be run by professional managers, without knowing the underlying problem/business, "using numbers" and spreadsheet (except when you know a few and press them, of course it turns out they don't have a clue about the numbers, can't come up with basic spreadsheet formulas)

TLDR: AI DOESN'T let financial-expert management run an airplane company. AI lets 1000 engineers build 1000 planes without such management. AI lets a company like what Google was 15-20 years ago wipe the floor with a big airplane manufacturer. So expect big management to come with ever more ever bigger reasons why AI can't be allowed to do X.

jongjong•17h ago
Resonates. I've been thinking about a tech technology bubble (not financial bubble) for years now. Big tech companies have just been throwing engineers at problems for many years and it feels like they completely stopped caring about talent now. Not that they ever really cared deeply, but they did care superficially and that was enough to keep the machines spinning.

Now that they have AI, I can see it become an 'idiocy multiplier'. Already software is starting to break in subtle ways, it's slow, laggy, security processes have become a nightmare.

thetjyt468555•16h ago
Perhaps, the higher-ups are so detached from details that they see a comrade in the LLM bullshit-artist.
lelanthran•16h ago
> What AI actually does is like any other improved tool: it's a force multiplier. It allows a small number of highly experienced, very smart people, do double or triple the work they can do now.

It's different from other force-multiplier tools in that it cuts off the pipeline of new blood while simultaneously atrophying the experienced and smart people.

jillesvangurp•4h ago
Exactly, and in the business world, those using force multipliers outsmart and outwork their competitors. It's that simple. People are projecting all sorts of hyperbole, morals, outrage, panic, fear, and other nonsense onto LLMs. But in the end they are tools that are available at extremely low cost that do vaguely useful things if you manage to prompt them right. Which isn't all that hard with a little practice.

I've been doing that for a few years now, I understand the limitations and strengths. I'm a programmer that also does marketing and sales when needed. LLMs have made the former a lot less tedious and the latter a lot easier. There are still things I have to do manually. But there are also whole categories of things that LLMs do for me quickly, reliably, and efficiently.

The impact on big companies is that the strategy of hiring large amounts of people and getting them to do vaguely useful things by prompting them right at great expense is now being challenged by companies doing the same things with a lot less people (see what I did there). LLMs eliminate all the tedious stuff in companies. A lot of admin and legal stuff. Some low level communication work (answering support emails, writing press releases, etc). There's a lot of stuff that companies do or have to do that is not really their core business but just stuff that needs doing. If you run a small startup, that stuff consumes a lot of your time. I speak from experience. Guess what I use LLMs for? All of it. As much as I can. Because that means more quality time with our actual core product. Things are still tedious. But I get through more of it quicker.

imiric•17h ago
This is a good summary of why the language we use to describe these tools matters[1].

It's important that the general public understands their capabilities, even if they don't grasp how they work on a technical level. This is an essential part of making them safe to use, which no disclaimer or PR puff piece about how deeply your company cares about safety will ever do.

But, of course, marketing them as "AI" that's capable of "reasoning", and showcasing how good they are at fabricated benchmarks, builds hype, which directly impacts valuations. Pattern recognition and data generation systems aren't nearly as sexy.

[1]: https://news.ycombinator.com/item?id=44203562#44218251

dwaltrip•16h ago
People are paying hundreds of dollars a month for these tools, often out of their personal pocket. That's a pretty robust indicator that something interesting is going on.
mmcconnell1618•16h ago
One thing these models are extremely good at is reading large amounts of text quickly and summarizing important points. That capability alone may be enough to pay $20 a month for many people.
david-gpu•4h ago
Not just summarizing, but also being able to answer follow-up questions about what is in the text.

And, like Wikipedia, they can be useful to find your bearing in a subject that you know nothing about. Unlike Wikipedia, you can ask it free-form questions and have it review your understanding.

th0ma5•4h ago
I keep hearing anecdotes but the data, like a widely covered BBC study, say they only compress and shorten and routinely fail outside of testing on real world selection of only the most important content or topics.
diggan•3h ago
> That capability alone may be enough to pay $20 a month for many people.

Sure, but that's not why me and others now have ~$150/month subscriptions to some of these services.

otabdeveloper4•51m ago
> and summarizing important points

Unfortunately the LLM does not (and cannot) know what points are important or not.

If you just want a text summary based on statistical methods, then go ahead, LLMs do this cheaper and better than the previous generation of tools.

If you want actual "importance" then no.

NoMoreNicksLeft•8m ago
Why would anyone want to read less and not more? It'd be like reading movie spoilers so you didn't have to sit through 2 hours to find out what happened.
imiric•16h ago
I'm not disputing the value of what these tools can do, even though that is often inflated as well. What I'm arguing against is using language that anthropomorphizes them to make them appear far more capable than they really are. That's dishonest at best, and only benefits companies and their shareholders.
diggan•3h ago
> anthropomorphizes them to make them appear far more

It seems like this argument is frequently brought up just because someone used the words "thinking", or "reasoning" or other similar terms, while true that the LLMs aren't really "reasoning" as a human, the terms are used not because the person actually believes that the LLM is "reasoning like a human" but because the concept of "some junk tokens to get better tokens later" has been implemented under that name. And even with that name, it doesn't mean everyone believes they're doing human reasoning.

It's a bit like a "isomorphic" programming frameworks. They're not talking about the mathematical structures which also bears the name "isomorphic", but rather the name been "stolen" to now mean more things, because it was kind of similar in some way.

I'm not sure what the alternative is, humans been doing this thing of "Ah, this new concept X is kind of similar to concept Y, maybe we reuse the name to describe X for now" for a very long time, and if you understand the context when it's brought up, it seems relatively problem-free to me, most people seem to get it.

It benefits everyone in the ecosystem when terms have shared meaning, so discussions about "reasoning" don't have to use terms like "How an AI uses jumbled starting tokens within the <think> tags to get better tokens later", and can instead just say "How an AI uses reasoning" and people can focus on the actual meat instead.

contagiousflow•2h ago
A tool can feel productive and novel, without actually providing all of the benefits the user thinks it is.
andy99•17h ago
I agree with the substance, but would argue the author fails to "understand how AI works" in an important way:

  LLMs are impressive probability gadgets that have been fed nearly the entire internet, and produce writing not by thinking but by making statistically informed guesses about which lexical item is likely to follow another
Modern chat-tuned LLMs are not simply statistical models trained on web scale datasets. They are essentially fuzzy stores of (primarily third world) labeling effort. The response patterns they give are painstakingly and at massive scale tuned into them by data labelers. The emotional skill mentioned in the article is outsourced employees writing or giving feedback on emotional responses.

So you're not so much talking to statistical model as having a conversation with a Kenyan data labeler, fuzzily adapted through a transformer model to match the topic you've brought up.

While thw distinction doesn't change the substance of the article, it's valuable context and it's important to dispel the idea that training on the internet does this. Such training gives you GPT2. GPT4.5 is efficiently stored low- cost labor.

MrZander•17h ago
This doesn't follow with my understanding of transformers at all. I'm not aware of any human labeling in the training.

What would labeling even do for an LLM? (Not including multimodal)

The whole point of attention is that it uses existing text to determine when tokens are related to other tokens, no?

daveguy•16h ago
The transformers are accurately described in the article. The confusion comes in the Reinforcement Learning Human Feedback (RLHF) process after a transformer based system is trained. These are algorithms on top of the basic model that make additional discriminations of the next word (or phrase) to follow based on human feedback. It's really just a layer that makes these models sound "better" to humans. And it's a great way to muddy the hype response and make humans get warm fuzzies about the response of the LLM.
MrZander•15h ago
Oh, interesting, TIL. Didn't realize there was a second step to training these models.
hexaga•7h ago
There are in fact several steps. Training on large text corpora produces a completion model; a model that completes whatever document you give it as accurately as possible. It's kind of hard to make those do useful work, as you have to phrase things as partial solutions that are then filled in. Lots of 'And clearly, the best way to do x is [...]' style prompting tricks required.

Instruction tuning / supervised fine tuning is similar to the above but instead of feeding it arbitrary documents, you feed it examples of 'assistants completing tasks'. This gets you an instruction model which generally seems to follow instructions, to some extent. Usually this is also where specific tokens are baked in that mark boundaries of what is assistant response, what is human, what delineates when one turn ends / another begins, the conversational format, etc.

RLHF / similar methods go further and ask models to complete tasks, and then their outputs are graded on some preference metric. Usually that's humans or a another model that has been trained to specifically provide 'human like' preference scores given some input. This doesn't really change anything functionally but makes it much more (potentially overly) palatable to interact with.

JKCalhoun•2h ago
Got 3½ hours? https://youtu.be/7xTGNNLPyMI

(I watched it all, piecemeal, over the course of a week, ha, ha.)

meroes•17h ago
Ya I don’t think I’ve seen any article going in depth into just how many low level humans like data labelers and RLHF’ers there are behind the scenes of these big models. It has to be millions of people worldwide.
simonw•16h ago
I'm really curious to understand more about this.

Right now there are top tier LLMs being produced by a bunch of different organizations: OpenAI and Anthropic and Google and Meta and DeepSeek and Qwen and Mistral and xAI and several others as well.

Are they all employing separate armies of labelers? Are they ripping off each other's output to avoid that expense? Or is there some other, less labor intensive mechanisms that they've started to use?

meroes•15h ago
There are middle-men companies like Scale that recruit thousands of remote contractors, probably through other companies they hire. There are of course other less known such companies that also sit between the model companies and the contracted labelers and RLHF’ers. There’s probably several tiers of these middle companies that agglomerate larger pools of workers. But how intermixed the work is and its scale I couldn’t tell you, nor if it’s shifting to something else.

I mean on LinkenIn you can find many AI trainer companies and see they hire for every subject, language, and programming language across several expertise levels. They provide the laborers for the model companies.

megaloblasto•2h ago
I'm also very interested in this. I wasn't aware of the extent of the effort of labelers. If someone could point me to an article or something where I could learn more that would be greatly appreciated.
whilenot-dev•1h ago
Just look for any company that offers data annotation as a service, they seem happy to explain their process in detail[0]. There's even a link to a paper from OpenAI[1] and some news about the contractor count[2].

[0]: https://snorkel.ai/data-labeling/#Data-labeling-in-the-age-o...

[1]: https://cdn.openai.com/papers/Training_language_models_to_fo...

[2]: https://www.businessinsider.com/chatgpt-openai-contractor-la...

happy_dog1•49m ago
I added a reply to the parent of your comment with a link to an article I found fascinating about the strange world of labeling and RLHF -- this really interesting article from The Verge 2 years ago:

https://www.theverge.com/features/23764584/ai-artificial-int...

happy_dog1•1h ago
There's a really fascinating article about this from a couple years ago that interviewed numerous people working on data labeling / RLHF, including a few who had likely worked on ChatGPT (they don't know for sure because they seldom if ever know which company will use the task they are assigned or for what). Hard numbers are hard to come by because of secrecy in the industry, but it's estimated that the number of people involved is already in the millions and will grow.

https://www.theverge.com/features/23764584/ai-artificial-int...

Interestingly, despite the boring and rote nature of this work, it can also become quite complicated as well. The author signed up to do data labeling and was given 43 pages (!) of instructions for an image labeling task with a long list of dos and don'ts. Specialist annotation, e.g. chatbot training by a subject matter expert, is a growing field that apparently pays as much as $50 an hour.

"Put another way, ChatGPT seems so human because it was trained by an AI that was mimicking humans who were rating an AI that was mimicking humans who were pretending to be a better version of an AI that was trained on human writing..."

Al-Khwarizmi•17h ago
I don't think those of us who don't work at OpenAI, Google, etc. have enough information to accurately estimate the influence of instruction tuning on the capabilities or the general "feel" of LLMs (it's really a pity that no one releases non-instruction-tuned models anymore).

Personally my inaccurate estimate is much lower than yours. When non-instruction tuned versions of GPT-3 were available, my perception is that most of the abilities and characteristics that we associate with talking to an LLM were already there - just more erratic, e.g., you asked a question and the model might answer or might continue it with another question (which is also a plausible continuation of the provided text). But if it did "choose" to answer, it could do so with comparable accuracy to the instruction-tuned versions.

Instruction tuning made them more predictable, and made them tend to give the responses that humans prefer (e.g. actually answering questions, maybe using answer formats that humans like, etc.), but I doubt it gave them many abilities that weren't already there.

tough•4h ago
instruction tuning is like imprimating the chat ux into the models weights

its all about the user/assistant flow instead of just a -text generator- after it

and the assistant always tries to please the user.

they built a sychopantic machine either by mistake or malfeasance

jenadine•16h ago
> produce writing not by thinking but by making statistically informed guesses about which lexical item is likely to follow another

What does "thinking" even mean? It turns out that some intelligence can emerge from this stochastic process. LLM can do math and can play chess despite not trained for it. Is that not thinking?

Also, could it be possible that are our brains do the same: generating muscle output or spoken output somehow based on our senses and some "context" stored in our neural network.

cwillu•38m ago
I'm sympathetic to this line of reasoning, but “LLM can play chess” is overstating things, and “despite not being trained for it” is understating how many chess games and books would be in the training set of any LLM.

While it's been a few months since I've tested, the last time I tested the reasoning on a game for which very little data is available in book or online text, I was rather underwhelmed with openai's performance.

hapali•16h ago
More accurately:

Modern chat-oriented LLMs are not simply statistical models trained on web scale datasets. Instead, they are the result of a two-stage process: first, large-scale pretraining on internet data, and then extensive fine-tuning through human feedback. Much of what makes these models feel responsive, safe, or emotionally intelligent is the outcome of thousands of hours of human annotation, often performed by outsourced data labelers around the world. The emotional skill and nuance attributed to these systems is, in large part, a reflection of the preferences and judgments of these human annotators, not merely the accumulation of web text.

So, when you interact with an advanced LLM, you’re not just engaging with a statistical model, nor are you simply seeing the unfiltered internet regurgitated back to you. Rather, you’re interacting with a system whose responses have been shaped and constrained by large-scale human feedback—sometimes from workers in places like Kenya—generalized through a neural network to handle any topic you bring up.

tim333•1h ago
Sounds a bit like humans. Much data modified by "don't hit your sister" etc.
JKCalhoun•2h ago
Many like the author fail to convince me because they never also explain how human minds work. They just wave their hand, look off to a corner of the ceiling with, "But of course that's not how humans think at all," as if we all just know that.
monkaiju•2h ago
First off, there's an entire field attempting to answer that question, cognitive science.

Secondly, the burden of proof isn't on cog-Sci folk to prove the human mind doesn't work like an llm, it'd be to prove that it does. From we do know, despite not having a flawless understanding on the human mind, it works nothing like an llm.

Side note: The temptation to call anything that appears to act like a mind a mind is called behavioral ism and is a very old cog-Sci concept, disproved many times over.

mjburgess•1h ago
Some features of animal sentience:

* direct causal contact with the environment, e.g., the light from the pen hits my eye, which induces mental states

* sensory-motor coordination, ie., that the light hits my eye from the pen enables coordination of the movement of the pen with my body

* sensory-motor representations, ie., my sensory motor system is trainable, and trained by historical envirionemntal coordination

* heirachical planning in coordination, ie., these sensory-motor representations are goal-contextualised, so that I can "solve my hunger" in an infinite number of ways (i can achive this goal against an infinite permutation of obstacles)

* counterfactual reality-oriented mental simulation (aka imagination) -- these rich sensory motor representatiosn are reifable in imagination so i can simulate novel permutaitons to the environment, possible shifts to physics, and so on. I can anticipate these infinite number of obsatcles before any have occured, or have ever occured.

* self-modelling feedback loops, ie., that my own process of sensory-motor coordination is an input into that coordination

* abstraction in self-modelling, ie., that i can form cognitive representations of my own goal directed actions as they succeed/fail, and treat them as objects of their own refinement

* abstraction across representation mental faculties into propositional represenations, ie., that when i imagine that "I am writing", the object of my imagination is the very same object as the action "to write" -- so I know that when I recall/imagine/act/reflect/etc. I am operating on the very-same-objects of thought

* facilities of cognition: quantification, causal reasoning, discrete logical reasoning -- etc. which can be applied both at the sensory, motor and abstract conceptual level (ie., i can "count in sensation" a few objects, also with action, also in intellection)

* concept formation: abduction, various various of induction, etc.

* concept composition: recursion, composition in extension of concepts, composition in intension, etc.

One can go on and on here.

Decribe only what happens in a few minutes of the life of a toddler as they play around with some blocks and you have listed, rather trivially, a vast universe of capbilities that an LLM lacks.

To believe an LLM has anything to do with intelligence is to have somewhat quite profoundly mistaken what capabilities are implied by intelligence -- what animals have, some more than others, and a few even more so. To think this has anything to do with linguistic competence is a proudly strange view of the world.

Nature did not produce intelligence in animals in order that they acquire competence in the correct ordering of linguistic tokens. Universities did, to some degree, produce computer science departments for this activity however.

throwawaymaths•36m ago
yeah, i think you dont understand either. rlhf is no where near the volume of "pure" data that gets thrown into the pot of data.
tracerbulletx•17h ago
Imagine thinking that brains aren't making statistically informed guesses about sequential information.
dwaltrip•17h ago
Everyone, it's just "statistics". Numbers can't hurt you. Don't worry
davidcbc•16h ago
Numbers can hurt you quite a bit when they are wrong.

For example, numbers are the difference between a bridge collapsing or not

dwaltrip•16h ago
Sorry, I dropped my /s :)
electroglyph•16h ago
https://thebullshitmachines.com/
jaza•3h ago
Thanks for this, it should be added to school curricula worldwide!
roxolotl•16h ago
The thesis is spot on with why I believe many skeptics remain skeptics:

> To call AI a con isn’t to say that the technology is not remarkable, that it has no use, or that it will not transform the world (perhaps for the better) in the right hands. It is to say that AI is not what its developers are selling it as: a new class of thinking—and, soon, feeling—machines.

Of course some are skeptical these tools are useful at all. Others still don’t want to use them for moral reasons. But I’m inclined to believe the majority of the conversation is people talking past each other.

The skeptics are skeptical of the way LLMs are being presented as AI. The non hype promoters find them really useful. Both can be correct. The tools are useful and the con is dangerous.

th0ma5•4h ago
A lot of the claims of usefulness evaporate when tested. The word useful has many meanings. Perhaps their only reliable use will be the rubber duck effect.
diggan•3h ago
> A lot of the claims of usefulness evaporate when tested

In your personal experience? Because that's been my personal experience too, in lots of cases with LLMs. But I've also been surprised the other way, and overall it's been a net-positive for myself, but I've also spent a lot of time "practicing" getting prompts and tooling right. I could easily see how people give it try for 20-30 minutes, not getting the results they expected and give up, which yeah, you probably won't get any net-positive effects by that.

squidbeak•2h ago
Not for me they haven't.
1vuio0pswjnm7•10h ago
"Witness, too, how seamlessly Mark Zuckerberg went from selling the idea that Facebook would lead to a flourishing of human friendship to, now, selling the notion that Meta will provide you with AI friends to replace the human pals you have lost in our alienated social-media age."

Perhaps "AI" can replace people like Mark Zuckerberg. If BS can be fully automated.

ares623•4h ago
It can’t. To become someone like Mark, you must have absolute zero empathy. LLMs have a little empathy in them due to their training data.
lordnacho•4h ago
The article skirts around a central question: what defines humans? Specifically, intelligence and emotions?

The entire article is saying "it looks kinds like a human in some ways, but people are being fooled!"

You can't really say that without at least attempting the admittedly very deep question of what an authentic human is.

To me, it's intelligent because I can't distinguish its output from a person's output, for much of the time.

It's not a human, because I've compartmentalized ChatGPT into its own box and I'm actively disbelieving. The weak form is to say I don't think my ChatGPT messages are being sent to the 3rd world and answered by a human, though I don't think anyone was claiming that.

But it is also abundantly clear to me that if you stripped away the labels, it acts like a person acts a lot of the time. Say you were to go back just a few years, maybe to covid. Let's say OpenAI travels back with me in a time machine, and makes an obscure web chat service where I can write to it.

Back in covid times, I didn't think AI could really do anything outside of a lab, so I would not suspect I was talking to a computer. I would think I was talking to a person. That person would be very knowledgeable and able to answer a lot of questions. What could I possibly ask it that would give away that it wasn't real person? Lots of people can't answer simple questions, so there isn't really a way to ask it something specific that would work. I've had perhaps one interaction with AI that would make it obvious, in thousands of messages. (On that occasion, Claude started speaking Chinese with me, super weird.)

Another thing that I hear from time to time is an argument along the line of "it just predicts the next word, it doesn't actually understand it". Rather than an argument against AI being intelligent, isn't this also telling us what "understanding" is? Before we all had computers, how did people judge whether another person understood something? Well, they would ask the person something and the person would respond. One word at a time. If the words were satisfactory, the interviewer would conclude that you understood the topic and call you Doctor.

xyzal•3h ago
To me, it's empathetic and caring. Which the LLMs will never be, unless you give money to OpenAI.

Robots won't go get food for your sick, dying friend.

rnkn•3h ago
A robot could certainly be programmed to get food for a sick, dying friend (I mean, don't drones deliver Uber Eats?) but it will never understand why, or have a phenomenal experience of the act, or have a mental state of performing the act, or have the biological brain state of performing the act, or etc. etc.
JKCalhoun•2h ago
Interesting. I wonder why?

Perhaps when we deliver food to our sick friend we subconsciously feel an "atta boy" from our parents who perhaps "trained" us in how to be kind when we were young selfish things.

Obviously if that's all it is we could of course "reinforce" this in AI.

squidbeak•2h ago
"Never" is a very broad word.
diggan•3h ago
That implies that people who aren't empathetic and/or caring aren't human, which I guess could be argued too, but feels too simplistic.

> Which the LLMs will never be

I'd argue LLMs will never be anything, they're giving you the text you're asking for, nothing more and nothing less. You don't tell them "to be" empathic and caring? Well, they're not gonna appear like that then, but if you do tell them, they'll do their best to emulate that.

rnkn•3h ago
> isn't this also telling us what "understanding" is?

When people start studying theory of mind someone usually jumps in with this thought. It's more or less a description of Functionalism (although minus the "mental state"). It's not very popular because most people can immediately identify an phenomenon of understanding separate from the function of understanding. People also have immediate understanding of certain sensations, e.g. the feeling of balance when riding a bike, sometimes called qualia. And so on, and so forth. There is plenty of study on what constitutes understanding and most healthily dismiss the "string of words" theory.

_petronius•3h ago
> The article skirts around a central question: what defines humans? Specifically, intelligence and emotions?

> The entire article is saying "it looks kinds like a human in some ways, but people are being fooled!"

> You can't really say that without at least attempting the admittedly very deep question of what an authentic human is.

> To me, it's intelligent because I can't distinguish its output from a person's output, for much of the time.

I think the article does address that rather directly, and that it is also is addressing very specifically your setence about what you can and can't distinguish.

LLMs are not capable of symbolic reasoning[0] and if you understand how they work internally, you will realize they do no reasoning whatsoever.

Humans and many other animals are fully capable of reasoning outside of language (in the former case, prior to language acquisition), and the reduction of "intellgence" to "language" is a catagory error made by people falling vicim to the ELIZA effect[1], not the result of a sum of these particular statistical methods being equal real intelligence of any kind.

0: https://arxiv.org/pdf/2410.05229

1: https://en.wikipedia.org/wiki/ELIZA_effect

FrustratedMonky•2h ago
> LLMs are not capable of symbolic reasoning[0]

Despite the citation. I think this is still being studied. And others have found some evidence that it forms internal symbols.

https://royalsocietypublishing.org/doi/10.1098/rsta.2022.004...

Or maybe, can say, an LLM can do symbolic reasoning, but can it do it very well? People forget that humans are also not great at symbolic reasoning. Humans also use a lot of cludgy hacks to do it, it isn't really that natural.

Example often used, about it not doing math well. But humans also don't do math well. How humans are taught to do division and multiplication, really is a little algorithm. So what would be difference between human following algorithm to do a multiplication, and an LLM calling some python to do it. Does that mean it can't symbolically reason about numbers? Or that humans also can't?

intended•3h ago
This isn’t that hard, to be honest. And I’m not just saying this.

One school of thought is - the output is indistinguishable from what a human would produce given these questions.

Another school of thought is - the underlying process is not thinking in the sense that humans do it

Both are true.

For the lay person, calling it thinking leads to confusions. It creates intuitions that do not actually predict the behavior of the underlying system.

It results in bad decisions on whether to trust the output, or to allocate resources - because if the use of the term thinking.

Humans can pass an exam by memorizing previous answer papers or just memorizing the text books.

This is not what we consider having learnt something. Learning is kinda like having the Lego blocks to build a model you can manipulate in your head.

For most situations, the output of both people is fungible.

Both people can pass tests.

lordnacho•2h ago
This is maybe the best response thus far. We can say that there's no real modelling capability inside these LLMs, and that thinking is the ability to build these models and generate predictions from them, reject wrong models, and so on.

But then we must come up with something other than opening up the LLM to look for the "model generating structure" or whatever you want to call it. There must be some sort of experiment that shows you externally that the thing doesn't behave like a modelling machine might.

intended•18m ago
Heh, this phrasing has resulted in 2 “best response” type comments in the 2 times I’ve used it.

I think maybe it makes sense for people who already have the building blocks in place and just require seeing it assembled.

indymike•3h ago
> The entire article is saying "it looks kinds like a human in some ways, but people are being fooled!"

The question is, what's wrong with that?

At some level there's a very human desire for something genuine and I suspect that no matter the "humanness" of an AI, it will never be able to close that desire for genuine. Or maybe... it is that people don't like the idea of dealing with an intelligence that will almost always have the upper hand because of information disparity.

greg_V•3h ago
> Another thing that I hear from time to time is an argument along the line of "it just predicts the next word, it doesn't actually understand it". Rather than an argument against AI being intelligent, isn't this also telling us what "understanding" is? Before we all had computers, how did people judge whether another person understood something? Well, they would ask the person something and the person would respond. One word at a time. If the words were satisfactory, the interviewer would conclude that you understood the topic and call you Doctor.

You call a Doctor 'Doctor' because they're wearing a white coat and are sitting in a doctor's office. The words they say might make vague sense to you, but since you are not a medical professional, you actually have no empirical grounds to judge whether or not they're bullshitting you, hence you have the option to get a second or third opinion. But otherwise, you're just trusting the process that produces doctors, which involves earlier generations of doctors asking this fellow a series of questions with the ability to discern right from wrong, and grading them accordingly.

When someone can't tell if something just sounds about right or is in fact bullshit, they're called a layman in the field at best or gullible at worst. And it's telling that the most hype around AI is to be found in middle management, where bullshit is the coin of the realm.

lordnacho•2h ago
Hmm, I was actually thinking of a viva situation. You sit with a panel of experts, they talk to you, they decide whether you passed your PhD in philosophy/history/physics/etc.

That process is done purely by language, but we supposed that inside you there is something deeper than a token prediction machine.

navigate8310•2h ago
Maybe it needs blood and flesh to be able for us to happily accept it.
the8472•1h ago
https://x.com/chargoddard/status/1931652388399784325
strogonoff•2h ago
We cannot actually judge whether something is intelligent in some abstract absolute way; we can only judge whether it is intelligent in the same way we are. When someone says “LLM chatbot output looks like a person’s output, so it is intelligent”, the implication is that it is intelligent like a human would be.

With that distinction in mind, whether an LLM-based chatbot’s output looks like human output does not answer the question of whether the LLM is actually like a human.

Not even because measuring that similarity by taking text output at a point in time is laughable (it would have to span the time equivalent of human life, and include much more than text), but because LLM-based chatbot is a tool built specifically to mimic human output; if it does so successfully then it functions as intended. In fact, we should deliberately discount the similarity in output as evidence for similarity in nature, because similarity in output is an explicit goal, while similarity in underlying nature is a non-goal, a defect. It is safe to assume the latter: if it turned out that LLMs are similar enough to humans in more ways than output, they would join octopus and the like and qualify to be protected from abuse and torture (and since what is done to those chatbots in order for them to be useful in the way they are would pretty clearly be considered abuse and torture when done to a human-like entity, this would decimate the industry).

That considered, we do not[0] know exactly how an individual human mind functions to assess that from first principles, but we can approximate whether an LLM chatbot is like a human by judging things like whether it is made in a way at all similar to how a human is made. It is fundamentally different, and if you want to claim that human nature is substrate-independent, I’d say it’s you who should provide some evidence—keeping in mind that, as above, similarity in output does not constitute such evidence.

[0] …and most likely never could, because of the self-referential recursive nature of the question. Scientific method hinges on at least some objectivity and thus is of very limited help when initial hypotheses, experiment procedures, etc., are all supplied and interpreted by the very subject being studied.

EMM_386•3h ago
> These statements betray a conceptual error: Large language models do not, cannot, and will not "understand" anything at all. They are not emotionally intelligent or smart in any meaningful or recognizably human sense of the word.

This is terrible write-up, simply because it's the "Reddit Expert" phenomena but in print.

They "understand" things. It depends on how your defining that.

It doesn't have to be in its training data! Whoah.

In the last chat I had with Claude, it naturally just arose that surrender flag emojis, the more there were, was how funny I thought the joke was. If there were plus symbol emojis on the end, those were score multipliers.

How many times did I have to "teach" it that? Zero.

How many other times has it seen that during training? I'll have to go with "zero" but that could be higher, that's my best guess since I made it up, in that context.

So, does that Claude instance "understand"?

I'd say it does. It knows that 5 surrender flags and a plus sign is better than 4 with no plus sign.

Is it absurd? Yes .. but funny. As it figured it out on its own. "Understanding".

------

Four flags = "Okay, this is getting too funny, I need a break"

Six flags = "THIS IS COMEDY NUCLEAR WARFARE, I AM BEING DESTROYED BY JOKES"

Hendrikto•3h ago
> This is terrible write-up, simply because it's the "Reddit Expert" phenomena but in print.

How is your comment any different?

EMM_386•2h ago
Because I provided evidence?

And made the relevant point that I need know what you mean by "understanding"?

The only 2 things in the universe that know that 6 is the maximum white flag emojis for jokes, and then might be modified by plus signs is ...

My brain, and that digital instance of Claude AI, in that context.

That's it - 2. And I didn't teach it, it picked it up.

So if that's not "understanding" what is it?

That's why I asked that first, example second.

I don't see how laying out logically like this makes me the "Reddit Expert", sort of the opposite.

It's not about knowing the internals of a transformer, this is a question that relates to a word that means something to humans ... but what is their interpretation?

fer•2h ago
> More means more

You could have used "loool" vs "loooooool", "xDD" vs "xDDDDDDDDD", using flags doesn't change a whole lot.

EMM_386•20m ago
Did I say it did!?

These are the type of responses that REALLY will drive me nuts.

I never said the flag emojis were special.

I've been a software engineers for almost 30 years.

I know what Unicode code pages are.

This is not helpful. How is my example missing your definition of understanding?

Replace the flags with yours if it helps ... same thing.

It's not the flags it's the understanding of what they are. They can be pirate ships or cats.

In my example they are surrender flags, because that is logical given the conversation.

It will "understand" that too. But the article says it can't do that. And the article, sorry, is wrong.

yahoozoo•1h ago
Post the convo
clejack•2h ago
Are people still experiencing llms getting stuck in knowledge and comprehension loops? I used them but not excessively, and I'm not heavily tracking their performance either.

For example, if you ask an llm a question, and it produces a hallucination then you try to correct it or explain to it that it is incorrect; and it produces a near identical hallucination while implying that it has produced a new, correct result, this suggests that it does not understand its own understanding (or pseudo-understanding if you like).

Without this level of introspection, directing any notion of true understanding, intelligence, or anything similar seems premature.

Llms need to be able to consistently and accurately say, some variation on the phrase "I don't know," or "I'm uncertain." This indicates knowledge of self. It's like a mirror test for minds.

ramchip•1h ago
Like the article says... I feel it's counter-productive to picture an LLM as "learning" or "thinking". It's just a text generator. If it's producing code that calls non-existent APIs for instance, it's kind of a waste of time to try to explain to the LLM that so-and-so doesn't exist. Better just try again and dump an OpenAPI doc or some sample code into it to influence the text generator towards correct output.
thomastjeffery•39m ago
That's the difference between bias and logic. A statistical model is applied bias, just like computation is applied logic/arithmetic. Once you realize that, it's pretty easy to understand the potential strengths and limitations of a model.

Both approaches are missing a critical piece: objectivity. They work directly with the data, and not about the data.

martindbp•2h ago
Many people who claim that people don't understand how AI works often have a very simplified view of the short comings of LLMs themselves, e.g. "it's just predicting the next token", "it's just statistics", "stochastic parrot" and seems to be grounded in what AI was 2-3 years ago. Rarely have they actually read the recent research on interpretability. It's clear LLMs are doing more than just pattern matching. They may not think like humans or as well, but it's not k-NN with interpolation.
beardedwizard•1h ago
Apple recently published a paper that seems to disagree and plainly states it's just pattern matching along with tests to prove it.

https://machinelearning.apple.com/research/illusion-of-think...

stuckinhell•52m ago
I'm having a hard time taking apple seriously, when they have don't even a great llm.

https://www.techrepublic.com/article/news-anthropic-ceo-ai-i... Anthropic CEO: “We Do Not Understand How Our Own AI Creations Work”. I'm going to lean with Anthropic on this one.

otabdeveloper4•49m ago
> I have a hard time taking your claim about rotten eggs seriously when you're not even a chicken.
stevenhuang•2h ago
It is a logic error to think that knowing how something works means you are justified to say it can't possess qualities like intelligence or ability to reason when we don't even understand how these qualities arise in humans.

And even if we do know enough about our brains to say conclusively that it's not how LLMs work (predictive coding suggests the principles are more alike that not), it doesn't mean they're not reasoning or intelligent; it would just mean they would not be reasoning/intelligent like humans.

yahoozoo•2h ago
Why do these same books coming out of AI (Empire of AI, The AI Con) keep getting referenced in all of these articles? It seems like some kind of marketing campaign.
tim333•47m ago
I guess if you an english literature grad type the normal way to approach a subject is to look at the leading books in that area.
tim333•35m ago
>Demis Hassabis, [] said the goal is to create “models that are able to understand the world around us.”

>These statements betray a conceptual error: Large language models do not, cannot, and will not “understand” anything at all.

This seems quite a common error in the criticism of AI. Take a reasonable statement about AI not mentioning LLMs and then say the speaker (nobel prize winning AI expert in this case) doesn't know what they are on about because current LLMs don't do that.

Deepmind already have project Astra, a model but not just language but also visual and probably some other stuff where you can point a phone at something and ask about it and it seems to understand what it is quite well. Example here https://youtu.be/JcDBFAm9PPI?t=40

NoMoreNicksLeft•28m ago
>Deepmind already have project Astra, a model but not just language but also visual and probably some other stuff where you can point a phone at something and ask about it and it seems to understand what it is quite well.

Operative phrase "seems to understand". If you had some bizarre image unlike anything anyone's ever seen before and showed it to a clever human, the human might manage to figure out what it is after thinking about it for a time. The model could never figure out anything, because it does not think. It's just a gigantic filter that takes known-and-similar images as input, and spits out a description on the other side, quite mindlessly. The language models do the same thing, do they not? They take prompts as inputs, and shit output from their LLM anuses based on those prompts. They're even deterministic if you take the seeds into account.

We'll scale all those up, and they'll produce ever-more-impressive results, but none of these will ever "understand" anything.

joenot443•18m ago
> If you had some bizarre image unlike anything anyone's ever seen before and showed it to a clever human, the human might manage to figure out what it is after thinking about it for a time

Out of curiosity, what sort of 'bizarre image' are you imagining here? Like a machine which does something fantastical?

I actually think the quantity of bizarre imagery whose content is unknown to humans is pretty darn low.

I'm not really well-equipped to have the LLMs -> AGI discussion, much smarter people have said much more poignant things. I will say that anecdotally, anything I've been asking LLMs for has likely been solved many times by other humans, and in my day to day life it's unusual I find myself wanting to do things never done before.

throwawaymaths•29m ago
i think this author doesnt fully understand how llms work either. Dismissing it as "a statistical model" is silly. hell, quantum mechanics is a statistical model too.

moreover, each layer of an llm imbues the model with the possibility of looking further back in the conversion and imbuing meaning and context through conceptual associations (thats the k-v part of the kv cache). I cant see how this doesn't describe, abstractly, human cognition. now, maybe llms are not fully capable of the breadth of human cognition or have a harder time training to certain deeper insight, but fundamentally the structure is there (clever training and/or architectural improvements may still be possible -- in the way that every CNN is a subgraph of a FCNN that would be nigh impossible for a FCNN to discover randomly through training)

to say llms are not smart in any way that is recognizable is just cherry-picking anecdotal data. if llms were not ever recognizably smart, people would not be using them the way they are.

827a•17m ago
> I cant see how this doesn't describe, abstractly, human cognition. now, maybe llms are not fully capable of the breadth of human cognition

But, I can fire back with: You're making the same fallacy you correctly assert the article as making. When I see how a CPU's ALU adds two numbers together, it looks strikingly similar to how I add two numbers together in my head. I can't see how the ALU's internal logic doesn't describe, abstractly, human cognition. Now, maybe the ALU isn't fully capable of the breadth of human cognition...

It turns out, the gaps expressed in the "fully capable of the breadth of human cognition" part really, really, really matter. Like, when it comes to ALUs, they overwhelm any impact that the parts which look similar cover. The question should be: How significant are the gaps in how LLMs mirror human cognition? I'm not sure we know, but I suspect they're significant enough to not write away as trivial.

mmsc•23m ago
This can be generalized to "what happens when people don't understand how something works". In the computing world, that could be "undefined behavior" (of which itself is .. defined as undefined) in the C programming language, or anything as simple as "functionality people didn't know because they didn't read the documentation"