* Meeting notes which was read accurately from a handwritten note (impressive!) but the summary hallucinated information that was completely made up. * Running omplex pytorch benchmarks while getting the simple parts of it completely wrong. We're talking getting variants of y=f(wx+b), which is what was being compared. All the graphs and visualizations look very convincing, but the details of what's tested completely bonkers.
Is there a petition to bring o3 back? Please? At least it was obvious when it failed.
Womp womp. Frustrating.
Then ‘roll back’ to the real version - but only for paid users.
Imagine how much worse it’d have gone if they called it GPT-4o lite and gave that to free users only and kept 4o for paid only.
Maybe it will make more people subscribe?
But it will make people cancel their subs too - I miss o3
The same issue exists with a bunch of other types of image output from ChatGPT - graphs, schematics, organizational charts, etc. It's been getting better at generating images which look like the type of image you requested, but the accuracy of the contents hasn't kept up.
The limitations of what was believed to be by many as a path to AGI/ASI are becoming more clearly apparent.
Difficult to say how much room for improvement there is, or to have a definite answer regarding the usefulness and economic impact of those models, but what we're seeing now is not exponential improvement.
This is not going to rewrite and improve itself, or to cure cancer, unify physics or any kind of scientific or technological breakthrough.
For coders is is merely a dispensable QoL improvement.
careful. I too am pessimistic on the generative AI hype, but you seem even more so, to the point where it’s making you biased and possibly uninformed.
Today’s news from BBC, 6 hours ago. “AI designs antibiotics for gonorrhoea and MRSA superbugs”
https://www.bbc.com/news/articles/cgr94xxye2lo
> Now, the MIT team have gone one step further by using *generative AI* to design antibiotics in the first place for the sexually transmitted infection gonorrhoea and for potentially-deadly MRSA (methicillin-resistant Staphylococcus aureus).
…
> "We're excited because we show that generative AI can be used to design completely new antibiotics," Prof James Collins, from MIT, tells the BBC.
Generative AI is a lot of things. LLM’s in particular (subset of generative AI) are somewhat useful, but nowhere near as useful as what Sam claims. And i guess LLM’s specifically - if we focus on chatgpt, will not be solving cancer lol.
So we agree that Sam is selling snake oil. :)
Just wanted to point out that a lot of the fundamental “tech” is being used for genuinely useful things!
This is the natural progression of mass market business where cost savings is valued and quality is not. If you as a customer want a higher quality product, you are left to the edges of the market of boutique, bespoke, upscale experiences which are only able to be offered at high quality because their scale is small and more manageable in all metrics and their existence against the walmarts of their industry is dependent on being at a higher quality offering.
blibble•2h ago
but no con can endure forever
dismalaf•2h ago
The wall is very obvious now though.
dinkblam•2h ago
no one in the industry could have believed that
stephc_int13•2h ago
I am not in the industry but I've been following closely and I am usually skeptical, but while I erred on the side of "this is just a tool" I also wondered "what if?" more than once.
Herring•2h ago
https://metr.org/blog/2025-03-19-measuring-ai-ability-to-com...