frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

ChatGPT Is Still a Bullshit Machine

https://gizmodo.com/chatgpt-is-still-a-bullshit-machine-2000640488
25•01-_-•3h ago

Comments

eurekin•3h ago
Unusual for such outlets to take jabs at prominent companies. Normally, they are much more lenient. Interesting
nadermx•3h ago
I don’t think comparing a LLM to a calculator is necessarily apt. If anything i'd say you can use these LLM's as a reflection of you. If you think Alabama has an R. Then it's not maths fault it tries to find an answer that matches your persistence, especially since I'm sure somewhere in its training set alabamer exists.
perching_aix•3h ago
I'd personally liken it to expecting planes to fly like birds do.
flax•2h ago
Perhaps this is a good analogy, in which case I'd prefer they stop advertising it as a better/faster/cheaper bird. Speaking as a metaphorical bird, it clearly cannot do well what I do. It does do it poorly at a remarkable speed though.

So what is the software development task that this plane excels at? Other than bullshitting one's manager.

chowells•3h ago
When the marketing tells us it's like talking to a PhD in the relevant field on any topic, it's worth pointing out that's only true if the PhD in question has recently suffered severe head trauma.
danbruc•3h ago
Then it's not maths fault it tries to find an answer that matches your persistence, especially since I'm sure somewhere in its training set alabamer exists.

It is not supposed to find an answer that matches my persistence, its supposed to tell the truth or admit that it does not know. And even if there is an alabamer in the training set, that is either something else, not a US state, or a misspelling, in neither case should it end up on the list.

seba_dos1•2h ago
No, it is supposed to find an answer that matches your persistence. That's what it does, and understanding that is the key to understanding its strengths and weaknesses. Otherwise you may just keep drinking the investors' kool-aid and pretend that it's a tool that's supposed to tell the truth. That's not what it does, that's not how it works and it's a safe bet that's not how it's gonna work in foreseeable future.
drooby•3h ago
What are folks uploading an article that's the equivalent of supermarket tabloid junk?

You just like the title?

bigyabai•3h ago
This is one of the first (and nicest) editorials in a long line of "ChatGPT never delivered on it's promises" you will start seeing soon.
mjd•3h ago
We already know the system is really bad at spelling. I have Claude configured to periodically remind me “By the way, I think there are ** n's in 'banana'”, so I don't forget what I am dealing with. It has never gotten this right.

But that doesn't mean that it is not extremely useful. It only means I shouldn't ask it to spell stuff.

If a human is unable to count the n's in 'banana' we expect them to be barely functional. Articles like this one try to draw the same inference about the LLM: it can't count 'n's, so it must not be able to do anything else either.

But it's a bad argument, and I'm tired of hearing it.

yogurtboy•2h ago
I don't disagree with your first point, that it's not still extremely useful despite its flaws. I absolutely use it to build project outlines, write code snippets, etc.

Your overall conclusion though seems a little free of context. Average people (i.e. my mom googling something) absolutely do not have the wherewithal to keep track of the various pros and cons of the underlying system that generates the magical giant blue box at the top of their search that has all the answers. They are being deliberately duped by the salesmen-in-chief of these giant companies, as are all of their investors.

thomassmith65•2h ago
It's as much that LLMs are bad at counting letters in words as it is that humans are good at it.

LLMs are also bad at many things that humans don't notice immediately.

That is a problem because it leads humans to trust LLMs with tasks at which LLMs currently are bad, such as picking stocks, screening job applicants, providing life advice...

dragonwriter•2h ago
The particular problem (and one that AI firms marketing approaches have actively leveraged and made worse[0]) is that correlations between capacities that humans are used to from observing other humans do not hold for LLMs, so assumptions about what an LLM should be able to do based what is observed to do and what a human ovserved to do that qould also be expected to be capable of do not hold even as loose rules of thumb.

[0] e.g., by promoting AIs as having equivalent capacities of humans of various education levels because they could pass tests that were part of the standards for, and correlate for humans with other abilities of, people with that educational background.

drweevil•1h ago
It's a reminder that LLMs are not reasoning machines. LLMs are very useful in many cases, but one should not treat them as if they can reason.

Spindle

https://blog.tangled.sh/ci
2•todsacerdoti•4m ago•0 comments

Trump to blame for high cost of living, Americans say in new poll

https://www.theguardian.com/business/2025/aug/01/trump-inflation-cost-of-living-poll
3•PaulHoule•5m ago•0 comments

Show HN: I Made a "Block Blast " Solver

https://www.adriclumma.com/projects/blockBlastSolver/
2•xFixItNow•6m ago•0 comments

China's overlapping tech-industrial ecosystems

https://www.high-capacity.com/p/chinas-overlapping-tech-industrial
1•walterbell•8m ago•0 comments

Essential Books on the Science of Reading

https://journal.imse.com/ten-essential-books-the-science-of-reading/
1•mindcrime•9m ago•0 comments

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

https://arxiv.org/abs/2508.01191
1•nnx•14m ago•0 comments

Indigenous Runner Wins 63K Ultramarathon After Walking 14 Hours to Starting Line

https://mymodernmet.com/candelaria-rivas-ramos-ultramarathon-runner/
1•bookofjoe•17m ago•0 comments

HappyX – Macro-oriented asynchronous web-framework

https://github.com/HapticX/happyx
1•TheWiggles•17m ago•0 comments

'I Feel Like I'm Going Crazy': ChatGPT Fuels Delusional Spirals

https://www.wsj.com/tech/ai/i-feel-like-im-going-crazy-chatgpt-fuels-delusional-spirals-ae5a51fc
2•jrflowers•25m ago•0 comments

Benchmarks Show Speculative Decoding Needs the Right Draft Model for 3× Gains

https://www.bentoml.com/blog/3x-faster-llm-inference-with-speculative-decoding
1•bbzjk7•32m ago•0 comments

Cosmic Ray Bit Flips and the Hidden Risk at Scale

https://cside.dev/blog/cosmic-ray-bit-flips-and-the-hidden-risk-at-scale
3•s-mon•32m ago•0 comments

Founder Virtues, per the Thiel Fellowship

https://venki.dev/notes/thiel-founder-virtues
2•venkii•36m ago•0 comments

The Calculus of Grit (2011)

https://www.ribbonfarm.com/2011/08/19/the-calculus-of-grit/
2•venkii•41m ago•0 comments

China Tells Brokers to Stop Touting Stablecoins to Cool Frenzy

https://www.bloomberg.com/news/articles/2025-08-08/china-tells-brokers-to-stop-touting-stablecoins-to-cool-frenzy
2•TMWNN•42m ago•0 comments

Ask HN: Why do readmes still use $ in copy-pasteable commands?

2•garyfirestorm•43m ago•2 comments

Alzheimer's Breakthrough: Lithium Reverses Memory Loss in Mice

https://www.sciencealert.com/alzheimers-breakthrough-lithium-reverses-memory-loss-in-mice
2•amichail•46m ago•1 comments

Sam Altman does damage control: GPT-5 rollout's unpopular changes will be undone

https://xcancel.com/sama/status/1953893841381273969
4•alecco•46m ago•0 comments

Future AI bills of $100k/yr per Dev

https://blog.kilocode.ai/p/token-growth-indicates-future-ai
2•tirumario•47m ago•0 comments

Apollo 13 moon mission leader James Lovell dies at 97

https://apnews.com/article/james-lovell-dies-obituary-apollo-13-astronaut-ed08c1efc0a74fbd9d47868ff9983a23
3•divbzero•49m ago•1 comments

Google commits $1B for AI training at US universities

https://www.reuters.com/world/us/google-commits-1-billion-ai-training-us-universities-2025-08-06/
2•rbanffy•49m ago•0 comments

Atom aims to create a U.S. rival to China's open-source AI technology

https://www.washingtonpost.com/politics/2025/08/05/atom-project-open-source-ai-china/
2•rbanffy•50m ago•0 comments

3D printing and AI used to slash nuclear reactor component construction time

https://www.tomshardware.com/3d-printing/3d-printing-and-ai-used-to-slash-nuclear-reactor-component-construction-time-from-weeks-to-days-pioneers-hail-new-era-of-nuclear-construction
1•rbanffy•51m ago•0 comments

NextCoder by Microsoft — LLM performing on par with GPT-4o on complex benchmarks

https://huggingface.co/microsoft/NextCoder-32B
3•maxloh•56m ago•0 comments

High costs and thin margins threatening AI coding startups

https://techcrunch.com/2025/08/07/the-high-costs-and-thin-margins-threatening-ai-coding-startups/
2•gpi•59m ago•0 comments

Hacker News Extension

https://github.com/sea2ocean/hn-quickview
2•alexmonami•1h ago•1 comments

GPT-5 is better, but isn't giant leap forward for vision

https://blog.roboflow.com/reflections-on-gpt-5-vision-capabilities/
2•jswandev•1h ago•0 comments

Little-known leguminous plant can increase beef production by 60% (2022)

https://www.embrapa.br/en/busca-de-noticias/-/noticia/75361634/little-known-leguminous-plant-can-increase-beef-production-by-60
17•littlexsparkee•1h ago•1 comments

Pedestrians walk faster than NYC crosstown bus

https://www.fox5ny.com/news/nyc-midtown-bus-transportation-mamdani-lander-vote-traffic
3•impish9208•1h ago•0 comments

Hospital fined after patient files used as snack bags

https://www.bangkokpost.com/thailand/general/3080090/hospital-fined-after-patient-files-used-as-snack-bags
1•gnabgib•1h ago•0 comments

What to know about cardiac catheterization vs. angiogram

https://www.medicalnewstoday.com/articles/cardiac-catheterization-vs-angiogram
1•teleforce•1h ago•0 comments