"Modern LLMs now use a default temperature of 1.0, and I theorize that higher value is accentuating LLM hallucination issues where the text outputs are internally consistent but factually wrong." [0]
I think this need to bullshit is probably inherent in LLMs. It’s essentially what they are built to do: take a text input and transform it into a coherent text output. Truth is irrelevant. The surprising thing is that they can ever get the right answer at all, not that they bullshit so much.
No one is calling the crap that shows up in JPEGs "hallucinations" or "bullshit"; it's commonly accepted side effects of the compression algorithm that makes up shit that isn't there in the original image. Now we're doing the same lossy compression with language and suddenly it's "hallucinations" and "bullshit" because it's so uncanny.
It's why AI output is meaningless for everyone except the querant. No one cares about your horoscope. AI shares every salient feature with divination, except the aesthetics. The lack of candles, robes, and incense - the pageantry of divination means a LOT of people are unable to see it for what it is.
We live in a culture so deprived of meaning we accidentally invented digital tea readings and people are asking it if they should break up with their girlfriend.
Great. Implement it, benchmark, slower. In some cases much slower. I tell ChatGPT it's slower, and it confidently tells me of course it's slower, here's why.
The duality of LLMs, I guess.
CGT: The tallest tree in Texas is a 44 foot tall tree in ...
Me: No it's not! The tallest tree is a pine in East Texas!
CGT: You're right! The tallest tree in Texas is probably a Loblolly Pine in East Texas; they grow to a height of 100–150', but some have been recorded to be 180' or more.
Me: That's not right! In 1890 a group of Californians moved to Houston and planted a Sequoia, it's been growing there since then, and is nearly 300 feet tall.
CGT: Yes, correct. In the late 19th century, many Sequoia Sempervirens were planted in and around Houston.
...
I mean, come on; I already spew enough bullshit, I don't need an automated friend to help out!
dewarrn1•4h ago
ActorNightly•3h ago