Andrej Karpathy's talk on the future of the industry

https://www.donnamagi.com/articles/karpathy-yc-talk

241•pudiklubi•7mo ago

Comments

pudiklubi•7mo ago

For context - I was in the audience when Karpathy gave this amazing talk on software 3.0. YC has said the official video will take a few weeks to release, by which Karpathy himself said the talk will be deprecated.

https://x.com/karpathy/status/1935077692258558443

levocardia•7mo ago

To complete the loop, we need an AI avatar of Karpathy doing text-to-voice from the transcript. Who says AI can't boost productivity!

msgodel•7mo ago

I listened to it with an old fashion CMU speech synth.

chrisweekly•7mo ago

Do the talk's predictions about the future of the industry project beyond a few weeks? If so, I'd expect the salient points of the talk to remain valid. Hmm...

swyx•7mo ago

i synced the slides with the talk transcript here : https://latent.space/s3

pudiklubi•7mo ago

so you took my transcript and put it behind a newsletter sub? haha. just share them!

swyx•7mo ago

not quite, i compiled the slides within a few hours of the talk yesterday well before your transcript was available. the slides are my main output/contribution. a full slides+transcript is too long for substack. i've linked your transcript prominently for people to find, and used it to fix slide ordering because twitter people took terrible notes for the purpose of exact talk reconstruction.

i exepct YC to prioritize publishing this talk so propbably the half life of any of this work is measured in days anyway.

100% of our podcast is published for free, but we still have ~1000 people who choose to support our work with a subscription (it does help pay for editors, equipment, and travel). I always feel bad that we dont have much content for them so i figured i'd put just the slide compilation up for subscribers. i'm trying to find nice ways to ramp up value for our subs over time, mostly by showing "work in progress" things like this that i had to do anyway to summarize/internalize the talk properly - which again is what we published entirely free/no subscription required

pudiklubi•7mo ago

gotcha. def a good idea! i'm only a bit wary of making all of this a bit too put together, as to create an official source before there is one.

that being said, HN is a negative place, and not what I was trying to go for. thank you for your work with the slides!

dang•7mo ago

Let's all make HN a less negative place.

(As a step towards making it a non-negative place.)

fellatio•7mo ago

Looks like you are putting a derivative behind a paywall though, no? I think quid pro quo let pudiklubi publish your work too? Some kind of open license?

swyx•7mo ago

btw i think your transcript is missing most of the Perplexity slide discussion, right after the Cursor example

theyinwhy•7mo ago

What a poor judgement he must have if his outlook becomes irrelevant in a few weeks' time.

Edit: the emoji at the end of the original sentence has not been quoted. How a smile makes the difference. Original tweet: https://x.com/karpathy/status/1935077692258558443

theturtletalks•7mo ago

It was in jest, more a take of how quickly things move in AI

qwertox•7mo ago

We better stop talking about the future then.

scottyah•7mo ago

Every time you talk about the future, it gets altered.

amarait•7mo ago

Hard determinists would disagree

koakuma-chan•7mo ago

Even if universe is not deterministic, what he said still does not make sense because him talking about future is part of destiny.

addaon•7mo ago

They basically have to.

pudiklubi•7mo ago

the way I read it it's more about how fast examples and references become irrelevant. fundamentals of the speech not.

swah•7mo ago

See also https://www.latent.space/p/s3

swyx•7mo ago

thanks - i've now also updated the powerpoint with matched transcript to slides - so we are now fully confident in the slide order and you can basically watch the talk with slides

swah•7mo ago

It didn't take weeks this time https://www.youtube.com/watch?v=LCEmiRjPEtQ

pudiklubi•7mo ago

haha, love how we have the two pieces of the puzzle. we should merge!

fenghorn•7mo ago

First time using NotebookLM and it blew my mind. I pasted in the OP's transcription of the talk into NotebookLM and got this "podcast": https://notebooklm.google.com/notebook/5ec54d65-f512-4e6c-9c...

steveBK123•7mo ago

This sounds like an informercial

jckahn•7mo ago

How so? That sounds a completely realistic use of NotebookLM.

steveBK123•7mo ago

Just the tone, I mean I listen to podcasts a bit but.. yuck. I'd rather just read than listen to this.

jolan•7mo ago

I think he meant the audio output itself sounds like an infomercial.

https://www.youtube.com/watch?v=gfr4BP4V1R8

steveBK123•7mo ago

Correct, yes.

That NotebookLM podcast was like the most unpleasant way I can imagine to consume content. Reading transcripts of live talks is already pretty annoying because it's less concise than the written word. Having it re-expanded by robot-voice back to audio to be read to me just makes it even more unpleasant.

Also sort of perverse we are going audio->transcript->fake audio. "YC has said the official video will take a few weeks to release," - I mean shouldn't one of the 100 AI startups solve this for them?

Anyway, maybe it's just me.. I'm the kind of guy that got a cynical chuckle at the airport the other week when I saw a "magazine of audio books".

jasonjmcghee•7mo ago

I have the same perspective - to such a degree that any time I see someone post a notebooklm I wonder if it's paid advertising. Every time I've tried it on something like a whitepaper etc. it just makes stuff up or says things that are kind of worthless. Reminds me of ChatGPT 3.5 in terms of quality of the presented content.

The voices sounded REALLY good the first time I used it. But then sounded exactly the same every time after that and became underwhelmed.

Velorivox•7mo ago

During the early days of NotebookLM, I got it to generate this interesting look behind the scenes of how the script is made:

https://vocaroo.com/1nZBz5hdjwEh

As a bonus its hilarious in its own right.

nico•7mo ago

Thank you for putting it together, it was pretty good - 27m:12s

jimmy76615•7mo ago

The talk is still not available on YouTube? What takes them so long?

layer8•7mo ago

Apparently AI doesn’t make you a 10x YouTube releaser. ;)

datameta•7mo ago

Certainly it can make you a 10x video releaser, but not a 10x Youtuber.

no_wizard•7mo ago

A large contention of this essay (which I’m assuming the talk is based on or is transcribed from depending on order) I do think that open source models will eventually catch up to closed source ones, or at least be “good enough” and I also think you can already see how LLMs are augmenting knowledge work.

I don’t think it’s the 4th wave of pioneering a new dawn of civilization but it’s clear LLMs will remain useful when applied correctly.

umeshunni•7mo ago

> I do think that open source models will eventually catch up to closed source ones

It felt like that was the direction for a while, but in the last year or so, the gap seems to have widened. I'm curious whether this is my perception or validated by some metric.

no_wizard•7mo ago

This was how early browsers felt too, the open source browser engines were slower at adapting than the ones developed by Netscape and Microsoft, but eventually it all reversed and open source excelled past the closed source software.

Another way to put it, is that over time you see this, it usually takes a little while for open source projects to catch up, but once they do they gain traction quite quickly over the closed source counter parts.

tayo42•7mo ago

Those were way simpler projects in the beginning when that happened. Like do you think a new browser would catch up today chrome now?

no_wizard•7mo ago

The tech behind LLMs has been open source for a very long time. Look at DeepSeek and LLAMA for example. They aren’t yet as capable as say Gemini but they aren’t “miles behind” either, especially if you know how to tune the models to be purpose built[0].

The time horizons will be different as they always are, but I believe it will happen eventually.

I’d also argue that browsers got complicated pretty fast, long cry from libhtml in a few short years.

[0]: of which I contend most useful applications of this technology will not be the generalized ChatGPT interface but specialized highly tuned models that don’t need the scope of a generalized querying

umeshunni•7mo ago

That's a good analogy. Makes a lot of sense. The only caveat I see is that there is a lot of context locked up in proprietary data sets (e.g. YT, books, podcasts) and I'm not sure how OSS models get access to that.

msgodel•7mo ago

Already today I can use aider with qwen3 for free but have to pay per token to use it with any of the commercial models. The flexibility is worth the lower performance.

QRY•7mo ago

Do you have anything to share on that workflow? I've been trying to get a local-first AI thing going, could use your insights!

msgodel•7mo ago

It's super easy. I already had llama.cpp/llama-server set up for a bunch of other stuff and actually had my own homebrew RAG dialog engine, aider is just way better.

One crazy thing is that since I keep all my PIM data in git in flat text I now have essentially "siri for Linux" too if I want it. It's a great example of what Karpathy was talking about where improvements in the ML model have consumed the older decision trees and coded integrations.

I'd highly recommend /nothink in the system prompt. Qwen3 is not good at reasoning and tends to get stuck in loops until it fills up its context window.

My current config is qwen2.5-coder-0.5b for my editor plugin and qwen3-8b for interactive chat and aider. I use nibble quants for everything. 0.5b is not enough for something like aider, 8b is too much for interactive editing. I'd also recommend shrinking the ring context in the neovim plugin if you use that since the default is 32k tokens which takes forever and generates a ton of heat.

QRY•7mo ago

I really appreciate you going into such technical specificity, thank you! I'll have to steal that siri for linux setup, that sounds awesome. Exploring ways to make use of compute people have lying around to do useful things without the vendor dependencies. But I'm relatively new to the AI scene, so your input really boosts my learning speed, thank you again!

bix6•7mo ago

Why would open source outpace? Isn’t there way more money in the closed source ones and therefore more incentive to work on them?

no_wizard•7mo ago

I didn’t say outpace, but I do believe the collective nature of open source will allow it to catch up, much like it did with browser tech, and at which point you’ll see a shift of resources toward that by major companies. It’s a collective works thing. I think it also is attractive to work on in open source, much like Linux or web browsers (hence the comparison to one) and that will also help it along over time.

I stick by my general thesis that OSS will eventually catch up or the gap will be so small only frontier applications will benefit from using the most advanced models

oblio•7mo ago

They didn't say "outpace", they said "catch up to good enough levels".

adamnemecek•7mo ago

AGI = approximating partition function. Everything else is just a poor substitute.

pera•7mo ago

Is "Software 3.0" somehow related to "Web 3.0"?

fhd2•7mo ago

Pure coincidence, I'm sure :)

pudiklubi•7mo ago

No – for more context you can check out Karpathy's original essay from 2017: https://karpathy.medium.com/software-2-0-a64152b37c35

lcnPylGDnU4H9OF•7mo ago

No, they are totally unrelated. Web 3.0 is blockchain-backed web applications (rather than proprietary server-backed web applications, which is Web 2.0) and Software 3.0 is LLM-powered agents.

knowaveragejoe•7mo ago

What was Software 2.0?

jckahn•7mo ago

And what was Software 1.0?

Karrot_Kream•7mo ago

It's using NNs or ML models which are given datasets and learn using those datasets. https://karpathy.medium.com/software-2-0-a64152b37c35

If you read the talk you can find out this and more :)

lcnPylGDnU4H9OF•7mo ago

Trained Neural Networks.

msgodel•7mo ago

Early application specific ML models, Software 1.0 is normal programs.

uncircle•7mo ago

They are related in that they are pure marketing buzzwords to build enormous hype around a product, if not a dream.

lcnPylGDnU4H9OF•7mo ago

That does not describe Karpathy’s use of Software 3.0. He is referring to LLM agents with that term, which is agnostic to LLM model, whether third-party-hosted or self-hosted.

rvz•7mo ago

No. But it doesn't make any difference that both of them are grifts in different ways.

One bundles "AGI" with broken promises and bullshit claims of "benefits to humanity" and "abundance for all" when at the same time it takes jobs away with the goal of achieving 10% global unemployment in the next 5 years.

The other is an overpromised scam wrapped up in worthless minted "tokens" on a slow blockchain (Ethereum).

Terms like "Software 3.0", "Web 3.0" and even "AGI" are all bullshit.

lvl155•7mo ago

I soak up everything Andrej has to say.

koakuma-chan•7mo ago

Andrej is the Dan Abramov of AI.

msgodel•7mo ago

This is almost exactly what I've experienced with them. It's a great talk, I wish I could have seen it in person.

afiodorov•7mo ago

> So, it was really fascinating that I had the menu gem basically demo working on my laptop in a few hours, and then it took me a week because I was trying to make it do it

Reminds me of work where I spend more time figuring out how to run repos than actually modifying code. A lot of my work is focused on figuring out the development environment and deployment process - all with very locked down permissions.

I do think LLMs are likely to change industry considerably, as LLM-guided rewrites are sometimes easier than adding a new feature or fixing a bug - especially if the rewrite is something into more LLM-friendly (i.e., a popular framework). Each rewrite makes the code further Claude-codeable or Cursor-codeable; ready to iterate even faster.

andai•7mo ago

The last 10% always takes 1000% of the time...

afiodorov•7mo ago

I am not saying rewrites are always warranted, but I think LLMs change the cost-benefit balance considerably.

steveklabnik•7mo ago

I am with you on this. I'm very much not sure about rewrites, but LLMs do change the cost-benefit balance of refactorings considerably, for me. Both in a "they let me make a more informed decision about proceeding with the refactoring" and "they are faster at doing it than I am".

Aeolun•7mo ago

Jup. Claude develops the first 90% without a sweat, and then starts flailing.

alganet•7mo ago

> imagine changing it and programming the computer's life

> imagine that the inputs for the car are on the bottom, and they're going through the software stack to produce the steering and acceleration

> imagine inspecting them, and it's got an autonomy slider

> imagine works as like this binary array of a different situation, of like what works and doesn't work

Software 3.0 is imaginary. All in your head.

I'm kidding, of course. He's hyping because he needs to.

Let's imagine together:

Imagine it can be proven to be safe.

Imagine it being reliable.

Imagine I can pre-train on my own cheap commodity hardware.

Imagine no one using it for war.

Henchman21•7mo ago

If I’m going to be leaning on my imagination this much I am going to imagine a world where the tech industry considers at great length whether or not something should be built.

alganet•7mo ago

Let me be clear about what I think: I have zero fear of an AI apocalypse. I think the fear is part of the scam.

The danger I see is related to psychological effects caused by humans using LLMs on other humans. And I don't think that's a scenario anyone is giving much attention to, and it's not that bad (it's bad, but not world end bad).

I totally think we should all build it. To be trained from scratch on cheap commodity hardware, so that a lot of people can _really_ learn it and quickly be literate on it. The only true way of democratizing it. If it's not that way, it's a scam.

Henchman21•7mo ago

I think we fear the same things, roughly. My fear is what humans do with any tech we create. Where we differ is that I would prefer things not be built. After a lifetime of watching what we actually do with the things we create, I think we need a bit of a “time out”. Just my 2 cents that won’t change a thing!

alganet•7mo ago

It's already built. It's just not that impressive.

We need to completely own it and remove the control these shady companies have over it.

Karpathy says nothing changed in 70 years of software. Something very important did: free software.

serial_dev•7mo ago

I tried to imagine all that he described and felt literally nothing. If he wants to hype AI, he should find his Steve Jobs.

msgodel•7mo ago

It was easy for me to see and it's incredible. Maybe I should be launching a startup.

alganet•7mo ago

I can see it just fine, it just doesn't appeal to me.

I want to build things, not "launch startups". If someone asks me about a minimal thing of what I build, I want to be able to speak for hours about it, in detail. I want to completely understand it. I'm an engineer, not an enterpreneur.

This whole concept of Software 3.0 takes away all the craft from the thing. It's souless.

- "How did you do it?"

- "Dunno, just prompted it LOL."

Sounds very dumb to me. Even if you make loads of money off of it.

That's why I also believe it will not be a "gateway drug to software development". That's just a profound misconception of what makes good software developers tick.

They are now trying to sell the idea of control. You can "put it in a leash". It's pathetic. I don't want control, I want to understand it. Internalize it.

One thing that never changed in software development is that _you get the payoff_ for the hard work. You solve the mystery. You figure out how it works, by yourself.

Anyone that looked a little close to AI knows that it doesn't work that way. It's a black box thing. You will never fully get the knowledge payoff.

msgodel•7mo ago

Like Karpathy said: it augments software 1.0, it does not replace it. It makes it easier to manipulate and communicate theory to more people. I really don't think people understand how great things could be.

alganet•7mo ago

> Karpathy [...]

I'm sure he's a great guy, but let's face it: He's there speaking those things because he made some popular videos in the past and they needed to renew their PR strategy.

Speaking of PR strategy...

> [...] augments [...]

Iron Man does not buy his metal suit at Costco. He builds it, in a cave, with a bunch of scraps. That's the magic of building things.

> I really don't think people understand how great things could be.

That's not the issue. I think it could be great, but they're being greedy and underestimating their best audience.

Instead, they're focusing on complete beginners, in the hopes that those beginners will generate enough monkey-bashing to train a model that can churn good quality "software 1.0" (which is the real deal, so far irreplaceable). I believe that's a mistake.

msgodel•7mo ago

Are they focusing on beginners? I think everyone in tech has always been excited about bringing more people in but I don't feel like there's more of that here than anywhere else.

Have you tried building things with this stuff? I have, it's exciting, there's all kinds of new things to build. I haven't been as excited to build things for a while before this.

alganet•7mo ago

> Are they focusing on beginners?

That's what their PR material seems to imply.

> Have you tried building things with this stuff?

Yes.

> it's exciting, there's all kinds of new things to build

Yes, and no. I don't want to pay to build, or be dependent on an API or model delivery. It means I want to build the juicy stuff, the models themselves. Generate synthetic data, experiment with different approaches for training, etc. That is way outside the reach of common people (it can't be done now, you need lots of money).

I also don't want to buy a fancy GPU so early. Even the most expensive ones are currently very weak to do anything of real value. I could run some inference or adjust some weights, but I would still be at the mercy of some company delivering me base models.

The prices are all inflated because of the hype, and it seems AI companies are synchronizing this inflation with the hardware companies. I'm betting on this pricing bubble to pop.

Regarding the stuff I can do with already pre-trained models, there's not much to learn. In its core, it's basically the same good old stitching of APIs (I've been doing this for decades). I tried using the AI IDEs, but didn't found much value in them. I'm sure they're great for some scenarios, but it's more of a gimmick to use developers to generate training data than anything else.

Regarding "agents" and stuff, I have zero interest in competing in this "user of model" market. It's so easy to do, that any new idea is flooded with attempts and saturates in mere weeks.

When the opportunity to build something _really_ cool and novel appears, I'll give it a real chance. The race between big tech competitors is killing those opportunities for now, only people hired by big AI companies get to do it, and I'm not one of them.

msgodel•7mo ago

Yeah I think the hype is over inflating prices but all the expensive stuff is exactly that: hype.

I've self hosted everything except training, I've been using colab for that, not because it's faster (it isn't) but because the heat is awful.

alganet•7mo ago

I think AI companies overdid it, and they're struggling to get high-value collaborators to their ecosystem.

Before, Google would launch a new Android version. Or Facebook would launch React, and developers would flock en masse to the ecosystem and the new features. These are very low-barrier entry ecosystems. All you need is a cheap computer or phone, which you probably already have for other reasons.

I get the impression that this is not happening with AI tech. Whoever is flocking to these is not contributing much to improve the ecosystem. There's a lot of users, but they're not doing anything interesting, and it's not getting the traction they expected.

I could be wrong, but I think I'm on the right track on this one. High-value collaborators know that they can jump in at any time, there's nothing really special about it that requires the early investment, and currently there is no clear reward for being an early adopter.

kaladin-jasnah•7mo ago

Tangentially related, but it boggles my mind this guy was badmephisto, who made a quite famous cubing tutorial website that I spent plenty of time on in my childhood.

Frummy•7mo ago

Totally not a supervillain

"Q: What does your name (badmephisto) mean?

A: I've had this name for a really long time. I used to be a big fan of Diablo2, so when I had to create my email address username on hotmail, i decided to use Mephisto as my username. But of course Mephisto was already taken, so I tried Mephisto1, Mephisto2, all the way up to about 9, and all was taken. So then I thought... "hmmm, what kind of chracteristic does Mephisto posess?" Now keep in mind that this was about 10 years ago, and my English language dictionary composed of about 20 words. One of them was the word 'bad'. Since Mephisto (the brother of Diablo) was certainly pretty bad, I punched in badmephisto and that worked. Had I known more words it probably would have ended up being evilmephisto or something :p"

bgwalter•7mo ago

> Since Mephisto (the brother of Diablo)

Unbelievable. Perhaps some techies should read Goethe's Faust instead of Lord of the Rings.

bluefirebrand•7mo ago

He's talking about videogame characters

If you want to scoff at anyone, scoff at 1990s Blizzard Entertainment for using those names in that way

debugnik•7mo ago

Tell that to the writers of Diablo. In the context of the game, those two characters are brothers.

Aeroi•7mo ago

TL;DR: Karpathy says we’re in Software 3.0: big language models act like programmable building blocks where natural language is the new code. Don’t jump straight to fully autonomous “agents”—ship human-in-the-loop tools with an “autonomy slider,” tight generate-→verify loops, and clear GUIs. Cloud LLMs still win on cost, but on-device is coming. To future-proof, expose clean APIs and docs so these models (and coming agents) can safely read, write, and act inside your product.

arkj•7mo ago

>Software 2.0 are the weights which program neural networks. >I think it's a fundamental change, is that neural networks became programmable with large libraries... And in my mind, it's worth giving it the designation of a Software 3.0.

I think it's a bit early to change your mind here. We love your 2.0, let's wait for some more time till th e dust settles so we can see clearly and up the revision number.

In fact I'm a bit confused about the number AK has in mind. Anyone else knows how he arrived at software 2.0?

I remember a talk by professor Sussman where he suggest we don't know how to compute, yet[1].

I was thinking he meant this,

Software 0.1 - Machine Code/Assembly Code Software 1.0 - HLLs with Compilers/Interpreters/Libraries Software 2.0 - Language comprehension with LLMs

If we are calling weights 2.0 and NN with libraries as 3.0, then shouldn't we account for functional and oo programming in the numbering scheme?

[1] https://www.youtube.com/watch?v=HB5TrK7A4pI

autobodie•7mo ago

Objectivity is lacking throughout the entire talk, not only in the thesis. But objectivity isn't very good for building hype.

bigyabai•7mo ago

Reminds me of Vitalik Buterin. I spent a lot of my starry-eyed youth reading his blog, and was hopeful that he was applying the learned-lessons from the early days of Bitcoin. Turned out he was fighting the wrong war though, and today Ethereum gets less lip service than your average shitcoin. The whole industry went up in flames, really.

Nerds are good at the sort of reassuring arithmetic that can make people confident in an idea or investment. But oftentimes that math misses the forest for the trees, and we're left betting the farm on a profoundly bad idea like Theranos or DogTV. Hey, I guess that's why it's called Venture Capital and not Recreation Investing.

Karrot_Kream•7mo ago

I'm curious why you think that? I thought the talk was pretty grounded. There was a lot of skepticism of using LLMs unbounded to write software and an insistence on using ground truth free from LLM hallucination. The main thesis, to me, seemed like "we need to write software that was designed with human-centric APIs and UI patterns to now use an LLM layer in front and that'll be a lot of opportunity for software engineers to come."

If anything it seemed like the middle ground between AI boosters and doomers.

autobodie•7mo ago

It's a lot of meandering and mundane analogies that don't work very well or explain much, so it's totally understandable that so many people have different interpretations of what he's even trying to say. The only consistent takeaway here is that he's talking about using AI (of many sorts) alongside legacy software.

baxtr•7mo ago

How can someone so smart become a hype machine? I can’t wrap my head around it. Maybe he had the opportunity to learn from someone he worked closely with?

TeMPOraL•7mo ago

> How can someone so smart become a hype machine? I can’t wrap my head around it.

Maybe they didn't, and it's just your perception.

baxtr•7mo ago

Probably true.

throwawayoldie•7mo ago

"It is difficult to get a man to understand something, when his salary depends upon his not understanding it." --Upton Sinclair

barrkel•7mo ago

Maybe you haven't seen the frontier and envisioned the possibilities?

bigyabai•7mo ago

Maybe the emperor wears no clothes. I'll believe it when I see it.

pests•7mo ago

I think to understand how Andrej views 3.0 is hinted at with his later analogy at Tesla. He saw a ton of manually written Software 1.0 C++ replaced by the weights of the NN. What we used to write manually in explicit code is now incorporated into the NN itself, moving the implementation from 1.0 to 3.0.

koakuma-chan•7mo ago

"revision number" doesn't matter. He is just saying that traditional software's behaviour ("software 1.0") is defined by its code, whereas outputs produced by a model ("software 2.0") are driven by its training data. But to be fair I stopped reading after that, so can't tell you what "software 3.0" is.

DaveChurchill•7mo ago

The death of deterministic computing and unverifiable information is a horror show

bcrosby95•7mo ago

I might be wrong, but it seems like some people are misinterpreting what is being said here.

Software 3.0 isn't about using AI to write code. It's about using AI instead of code.

So not Human -> AI -> Create Code -> Compile Code -> Code Runs -> The Magic Happens. Instead, it's Human -> AI -> The Magic Happens.

__loam•7mo ago

This industry is so tiring

mattgreenrocks•7mo ago

Definitely. And it gets more tiring the more experience you have, because you've seen countless hype cycles come and go with very little change. Each time, the same mantra is chanted: "but this time, it's different!" Except, it usually isn't.

Started learning metal guitar seriously to forget about industry as a whole. Highly recommended!

zie1ony•7mo ago

This is great idea, until you have to build something.

fellatio•7mo ago

Let alone productionize it! And god forbid maintain it. And have support that doesn't crap out.

layer8•7mo ago

Until you have to reliably automate something, I would say.

bmicraft•7mo ago

The AI isn't much easier, when you consider the "AI" step is actually: create dataset -> train model -> fine-tune model -> run model to train a much smaller model -> ship much smaller model to end devices.

imiric•7mo ago

So... Who builds the AI?

This is why I think the AI industry is mostly smoke and mirrors. If these tools are really as revolutionary as they claim they are, then they should be able to build better versions of themselves, and we should be seeing exponential improvements of their capabilities. Yet in the last year or so we've seen marginal improvements based mainly on increasing the scale and quality of the data they're trained on, and the scale of deployments, with some clever engineering work thrown in.

trgn•7mo ago

> who builds the ai

3 to 5 companies iso of the hundreds of thousands who sell software now

TeMPOraL•7mo ago

Recursive self-improvement is literally the endgame scenario - hard takeoff, singularity, the works. Are you really saying you're dissatisfied with the progress of those tools because they didn't manage to end the world as we know it just yet?

fellatio•7mo ago

Nope. Just that 1. Is better than people. 2. Isn't better than people. Pick one!

If the former then yes singularity. The only hope is it's "good will" (wouldn't bet on that) or turning off switches.

If the latter you still need more workers (programmers or whatever they'll be called) due to increased demand for compute solutions.

TeMPOraL•7mo ago

> Nope. Just that 1. Is better than people. 2. Isn't better than people. Pick one!

That's too coarse of a choice. It's better than people at increasingly large number of distinct tasks. But it's not good enough to recursively self-improve just yet - though it is doing it indirectly: it's useful enough to aid researchers and businesses in creating next generation of models. So in a way, the recursion and resulting exponent are already there, we're just in such early stages that it looks like linear progress.

fellatio•7mo ago

Thanks. Your nuanced version is better. In that version I can still ignore most of LinkedIn and Twitter and assume there will still be a need for people. Not just at OMGAD (OpenAI...) but at thousands of companies.

no_wizard•7mo ago

I don’t believe the technology horizon in the next 5 years is sufficiently developed for recursive self improvement to work so well it requires no human intervention, by which I mean it will hit a limit and the technology will still be a tool not a sentient (near sentient?) thing.

I think there will be a wall hit eventually with this, much like there was with visual recognition in the mid 2010s[0]. It will continue to improve but not exponentially

To be fair I am bullish it will make white collar work fundamentally different but smart companies will use it to accelerate their workforce productivity, reliability and delivery, not simply cut labor to the bone, despite that seemingly being every CEOs wet dream right now

[0]: remember when everyone was making demos and apps that would identify objects and such, and all the facial augmentation stuff? My general understanding is that the tech is now in the incremental improvement stage. I think LLMs will hit the same stage in the near term and likely hover there for quite awhile

TeMPOraL•7mo ago

> I don’t believe the technology horizon in the next 5 years is sufficiently developed for recursive self improvement to work so well it requires no human intervention

I'm personally 50/50 on this prediction at this point. It doesn't feel like we have enough ingredients for end-to-end recursive self-improvement in the next 5 years, but the overall pace is such that I'm hesitant to say it's not likely either.

Still, my reply was to the person who seemed to say they won't be impressed until they see AIs "able to build better versions of themselves" and "exponential improvements of their capabilities" - to this I'm saying, if/when it happens, it'll be the last thing that they'll ever be impressed with.

> remember when everyone was making demos and apps that would identify objects and such, and all the facial augmentation stuff? My general understanding is that the tech is now in the incremental improvement stage.

I thought that this got a) boring, and b) all those advancements got completely blown away by multimodal LLMs and other related models.

My perspective is that we had a breakthrough across the board in this a couple years ago, after the stuff you mentioned happened, and that isn't showing signs of slowing down.

yusina•7mo ago

Moving goal posts? This was a response to the claim that AIs are the new code.

TeMPOraL•7mo ago

Not really. GP claims they expect to see exponential improvements to be impressed - seemingly without realizing how such exponent will look like once it's happening and starts to look obviously exponential.

mlboss•7mo ago

Recursive self-improvement will never happen. We will hit physical limitation before that: energy, rate earth minerals, datacenter etc. The only way we can have recursive self-improvement if Robots take over and start expanding to other planets/solar systems.

imiric•7mo ago

No, that's not what I'm saying.

The progress has been adequate and expected, save for very few cases such as generative image and video, which has exceeded my expectations.

Before we reach the point where AI is self-improving on its own, we should go through stages where AI is being improved by humans using AI. That is, if these tools are capable of reasoning and are able to solve advanced logic, math, and programming challenges as shown in benchmarks, then surely they must be more capable of understanding and improving their own codebases with assistance from humans than humans could do alone.

My point is that if this was being done, we should be seeing much greater progress than we've seen so far.

Either these tools are intelligent, or they're highly overrated. Which wouldn't mean that they can't be useful, just not to the extent that they're being marketed as.

Eisenstein•7mo ago

> if these tools are capable of reasoning and are able to solve advanced logic, math, and programming challenges as shown in benchmarks

The benchmarks are make of questions that humans created and can answer, and are not composed of anything which a human hasn't been able to answer.

> then surely they must be more capable of understanding and improving their own codebases with assistance from humans than humans could do alone.

I don't think that logic follows. The models have proven that they can have more breadth of knowledge than a single human, but not more capability.

Also, they have no particular insight into their own codebases. They only know what is in their training data -- they can use that to form patterns and solve new problems, but they still only have the that and whatever information is given with the question as base knowledge.

> My point is that if this was being done, we should be seeing much greater progress than we've seen so far.

The point is taken, but I think your reasoning is weak.

> Either these tools are intelligent, or they're highly overrated. Which wouldn't mean that they can't be useful, just not to the extent that they're being marketed as.

I may have missed the marketing you have seen, but I don't see the big AI companies claiming that they are anything but tools that can help humans do things or replace certain human tasks. They do not advertise super human capability in intelligence tasks.

I suspect you are seeing a lot of hype and unfounded expectations, and using that as a basis for a calculation. The formula might be right, but the variables are incorrect.

We have a seen a LOT of progress with AI and language models in the last few years, but expecting them to go from 'can understand language and solve complicated novel problems' to 'making better versions of themselves using solutions that humans haven't been able to come up with yet' is a bit much to expect.

I don't know if one would call them intelligent, but something can be intelligent but at the same time not able to make substantial leaps forward in emerging fields.

imiric•7mo ago

> The benchmarks are make of questions that humans created and can answer, and are not composed of anything which a human hasn't been able to answer.

Sure, but they do it at superhuman speeds, and if they truly can reason and come up with novel solutions as some AI proponents claim, then they would be able to come up with better answers as well.

So, yes, they do have more capability in certain aspects than a human. If nothing else, they should be able to draw from their vast knowledgebase in ways that a single human never could. So we should expect to see groundbreaking work in all fields of science. Not just in pattern matching applications as we've seen in some cases already, but in tasks that require actual reasoning and intelligence, particularly programming.

> Also, they have no particular insight into their own codebases.

Why not? Aren't most programming languages in their training datasets, and isn't Python, the language most AI tools are written in, one of the easiest languages to generate? Furthermore, can't AI programmers feed its own codebase into the model via context, RAG, etc. in the same way that most other programmers do?

> I may have missed the marketing you have seen, but I don't see the big AI companies claiming that they are anything but tools that can help humans do things or replace certain human tasks. They do not advertise super human capability in intelligence tasks.

You are downplaying the claims being made by AI companies and its proponents.

According to Sam Altman just a few days ago[1]:

> We are past the event horizon; the takeoff has started. Humanity is close to building digital superintelligence

> we have recently built systems that are smarter than people in many ways, and are able to significantly amplify the output of people using them

> We already hear from scientists that they are two or three times more productive than they were before AI.

If a human assisted by AI can be more productive than a human alone, then why isn't this productivity boost producing improvements at a faster rate than what the tech industry has been able to deliver so far? Why aren't AI companies dogfooding their products and delivering actual value to humanity beyond benchmark results and shiny demos?

Again, none of this requires actual superhuman levels of intelligence or reaching the singularity. But just based on what they're telling us their products are capable of, the improvements to their own capabilities should be exponential by now.

[1]: https://blog.samaltman.com/the-gentle-singularity

TeMPOraL•7mo ago

FWIW, we're barely a two years into useful LLMs, less than half a year into the AI coding frenzy. Stuff takes time, there's organizational inertia.

Karpathy himself gave a perfect example in the talk with that restaurant menu to pictures app - it took few hours of AI-assisted coding to make it, and a week of devops bullshit to publish it. This is the case for everyone, so it slows down the feedback cycles right now.

Give it a couple of months; if we don't have clear evidence of recursive improvements by this time next year, I'll concede something is really off about it all.

imiric•7mo ago

Cool, but I'm not trying to convince you of anything. Believe what you want to believe.

I'm simply pointing out the dissonance between what AI companies have been telling us for the past 2+ years, and the results that we would expect to see if their claims were true. I'm not holding my breath that their promises will magically materialize given more time. If they were honest, they would acknowledge that the current tech simply won't get us there because of fundamental issues that still haven't been addressed (e.g. hallucination). But instead it's more profitable to promote software that "thinks", "reasons", and is "close to digital superintelligence".

That menu app is an example of vibe coding, not of increased productivity. He sidestepped the bulk of the work that a human still needs to do to ensure that the software works beyond the happy path scenario he tested, has no security issues, and so on. Clearly, the reason the DevOpsy tasks took him much longer is because he's not an ops person. The solution to this isn't to offload these tasks to AI as well, and ignore all the issues that this could cause on the operational side. It's to either offload them to a competent ops engineer, or become familiar with the tools and processes yourself so that it doesn't take you a week to do them next time.

If you want to use AI to assist you with mindless mechanical tasks, that's fine, I frequently do so too. But don't tell me it's making you more productive when you ignore fundamental processes of software engineering.

iLoveOncall•7mo ago

> If these tools are really as revolutionary as they claim they are, then they should be able to build better versions of themselves, and we should be seeing exponential improvements of their capabilities. Yet in the last year or so we've seen marginal improvements based mainly on increasing the scale and quality of the data they're trained on, and the scale of deployments, with some clever engineering work thrown in.

Yes and we've actually been able to witness in public the dubious contributions that Copilot has made on public Microsoft repositories.

autobodie•7mo ago

I don't think people are misinterpreting. People just don't find it convincing or intriguing.

agarren•7mo ago

That jibes with what Nadella said in an interview not too long ago. Essentially, SaaS apps disappear entirely as LLMs interface directly with the underlying data store. The unspoken implication being that software as we understand it goes away as people interface with LLMs directly rather than ~~computers~~ software at all.

I kind of expect that from someone heading a company that appears to have sold-the-farm in an AI gamble. It’s interesting to see a similar viewpoint here (all biases considered)

Vegenoid•7mo ago

> people interface with LLMs directly rather than software at all

What does this mean? An LLM is used via a software interface. I don’t understand how “take software out of the loop” makes any sense when we are using reprogrammable computers.

FridgeSeal•7mo ago

It’s just…the vibe of it man! It’s LLM’s! They’ll just…do things! And stuff…it’ll just happen!!! Don’t worry about the details!!!!

ethbr1•7mo ago

The strongman for it would be:

Our current computing paradigm is built on APIs stacked on APIs.

Those APIs exist to standardize communication between entities.

LLMs are pretty good at communicating between entities.

Why not replace APIs with some form of LLM?

The rebuttal would be around determinism and maintainability, but I don't think the strongman argument is weak enough to dismiss out of hand. Granted: these would likely be highly-tuned, more deterministic specialized LLMs.

Eisenstein•7mo ago

I think it would more take the form of 'LLM makes a backend solution using deterministic code that it uses to solve the problem'. Since LLMs are already extremely good at code, then they could code the solution to the problem and use that internally to solve it. They would manage the information exchange and the operations, but the results would be from connected pieces of bespoke software.

Vegenoid•7mo ago

Maybe this is a failure of my imagination, but I still don’t understand how “replace an API with an LLM” makes sense. LLMs just generate text, the only way they have of interacting with software is by generating text that the software can parse - a.k.a. an API call.

Maybe I have some misconception here. I think seeing a program or system that is doing this “replace APIs with LLMs” thing would help me understand.

obiefernandez•7mo ago

Self plug: I wrote a whole bestselling book on this exact topic

https://leanpub.com/patterns-of-application-development-usin...

adriand•7mo ago

It’s like a friend of mine who has an AI company said to me: the future isn’t building a CRM with AI. The future is saying to the AI, act like a CRM.

__loam•7mo ago

And it won't work as well as an actual crm because you've scrubbed all the domain knowledge of that software and how it ought to work out of the organization.

bredren•7mo ago

Anyone know what "oil bank" was in the actual talk?

romain_batlle•7mo ago

The analogy with the grid seems pretty good. The fab one seems bad tho.

mattlangston•7mo ago

Very nice find @pudiklubi. Thank you.

uncircle•7mo ago

AI sus talk. Kinda appropriate.

sammcgrail•7mo ago

You’ve got “two bars” instead of “two rs” in strawberry

computator•7mo ago

I was going to ask what this meant about strawberries:

> LLMs make mistakes that basically no human will make, like, you know, it will insist that 9.11 is greater than 9.9, or that there are two bars of strawberry. These are some famous examples.

But you answered it: It’s a stupid mistake a human makes when trying to mock the stupid mistakes that LLMs make!

pudiklubi•7mo ago

nice catch! the original transcript kept saying dogs instead of docs. thats the only thing i fixed (until your r's find now) after laughing at it for a while

amelius•7mo ago

Does it say anything about how this will affect wealth distribution?

yusina•7mo ago

> I think broadly speaking, software has not changed much at such a fundamental level for 70 years.

I love Andrej, but come on.

Writing essentially punch cards 70 years ago, writing C 40 years ago and writing Go or Typescript or Haskell 10 years ago, these are all very different activities.

TeMPOraL•7mo ago

Nah, not much changed in the past 40-50 years; between the two, Lisp and Smalltalk spearheaded pretty much all the stuff that was added to other programming languages in subsequent decades, and some of the things yet to be added.

The main thing that changed about programming is the social/political/bureaucratic side.

93po•7mo ago

how is it different? you're writing a series of instructions to execute, that's the same today as it was with punch cards. it's fundamentally the same even though we're doing this on 4k screens and with a million libraries behind it all

jdougan•7mo ago

"broadly" does a lot of work in his statement. But as an old dude who has done these things, it isn't deeply different.

yapyap•7mo ago

SUS talk

great name already

gooseus•7mo ago

But at what cost? And I don't mean the "human cost", I mean literally, how much will it cost to use an LLM as your "operating system"? Correct me if I'm wrong here, but isn't every useful LLM being operated at a loss?

ath3nd•7mo ago

That's part of the rug pull.

They want to onboard as many people on their stuff and make them as dependent on it as possible, so the switching costs are more.

It's the classic scam. Look at what Meta are doing now that they reached end of the line and are trying to squeeze out people for profitability:

- Bringing Ads to WhatsApp: https://apnews.com/article/whatsapp-meta-advertising-messagi...

- Desperately trying by any illegal means possible to steal your data: https://localmess.github.io/

- Firing all the people who built their empire: https://www.thestreet.com/employment/meta-rewards-executives...

- Enabled ethnic cleansing in multiple instances: https://www.amnesty.org/en/latest/news/2022/09/myanmar-faceb...

If you can't see the total moral bankruptcy of Big Tech, you gotta be blind. Don't Be Evil my ass. To me, LLMs have only one purpose: dumb down the population, make people doubt what's real and what's not, and enrich the tech overlords while our societies drown in the garbage they create.

zkmon•7mo ago

I'm not an expert on the subject itself, but I can tell that the transcript, in its entirety, is missing a solid line. While the parts of this talk are great on their own, I feel they couldn't stitch the whole story together well. And probably he might not be confident of completeness and composition of his thought. What's the whole point? That should be answered in the first few minutes.

sensanaty•7mo ago

Christ I'm gonna be forced to listen to the moronic managers and C-suites repeat this "software 3.0" bullshit incessantly from now on aren't I...

throwawayoldie•7mo ago

The good news is, it won't last forever. The bad news is, it's because some other drivel will take its place.

Who wants to start a pool on when the first advertisement for "Software 3.0" goes up in an airport somewhere?

snickell•7mo ago

If you want to try what Karpathy is describing live today, here's a demo I wrote a few months ago: https://universal.oroborus.org/

It takes mouse clicks, sends them to the LLM, and asks it to render static HTML+CSS of the output frame. HTML+CSS is basically a JPEG here, the original implementation WAS JPEG but diffusion models can't do accurate enough text yet.

My conclusions from doing this project and interacting with the result were: if LLMs keep scaling in performance and cost, programming languages are going to fade away. The long-term future won't be LLMs writing code, it'll be LLMs doing direct computation.

karpathy•7mo ago

Btw I notice many pretty bad errors in this transcription of the talk. The actual video will be up soon I hope.

dang•7mo ago

Ah sorry! I'm going to downweight this thread now.

There's so much demand around this, people are just super eager to get the information. I can understand why, because it was my favorite talk as well :)

pudiklubi•7mo ago

anything you'd want fixed immediately? happy to do so – or even take this down if you wish. it's your talk.

sotix•7mo ago

Is this because it was recorded with AI tooling rather than a traditional note taker?

pudiklubi•7mo ago

it was an audio recording, transcribed with speech to text models. there's definitely some errors and words lost. I also tried to emphasize this

sotix•7mo ago

Thanks for the clarification. Bit ironic given the talk’s subject. It is quite a bit of effort, but there’s something to say for going through and manually writing up the transcript like a journalist. Sometimes you can’t beat human effort ;)

Eisenstein•7mo ago

What about a middle ground? Speech-to-text AI with manual corrections?

sotix•7mo ago

That’s a great approach! That’s what I meant to convey if I had been a bit more articulate. I assume journalists do exactly that. Takes away some laborious work while retaining accuracy.

kapildev•7mo ago

How soon? I am contemplating whether to read this errorful transcript or wait for the video

tomhow•7mo ago

The video is now up and on the front page of HN:

https://news.ycombinator.com/item?id=44314423

iLoveOncall•7mo ago

Just a grifter grifting.

> The more reliance we have on these models, which already is, like, really dramatic

Please point me to a single critical component anywhere that is built on LLMs. There's absolutely no reliance on models, and ChatGPT being down has absolutely no impact on anything beside teenagers not being able to cheat on their homeworks and LLM wrappers not being able to wrap.

Aeolun•7mo ago

Well, you have all these social security programs using them to decide whether people should be investigated. That’s pretty nasty. I can totally see them not processing any applications if the model is down.

ath3nd•7mo ago

I find it hard to care for the marginal improvements in a glorifiedutocomplete that guzzles a shit ton of water and electricity (all stuff that can be used for more useful stuff than generating a picture of a cat with human hands or some lazy rando's essay assignment) and then ends up having to be coddled by a real engineer into a working solution.

Software 2.0? 3.0? Why stop there? Why not software 1911.1337? We went through crypto, NFTs, web3.0, now LLMs are hyped as if they are frigging AGI (spoiler, LLMs are not designed to be AGI, and even if they were, you sure as hell won't be the one to use them to your advantage, so why are you so irrationally happy about it?).

Man this industry is so tiring! What is the most tiring is the dog-like enthusiasm of the people who buy it EVERY.DAMN.TYPE, as if it's gonna change the life of most of them for the better. Sure, some of these are worse and much more useless than others (NFTs), but in the core of all of it is this cult-like awe we as a society have towards figures like the Karpathy's, Musks and Altmans of this world.

How are LLMs gonna help society? How are they gonna help people work, create and connect with one another? They take away the joy of making art, the joy of writing, of learning how to play a music instrument and sing, and now they are coming for software engineering. Sure, you might be 1%/2% faster, but are you happier, are you smarter (probably not: https://www.mdpi.com/2076-3417/14/10/4115)?

throwawayoldie•7mo ago

KARPATHY, MUSK, ALTMAN AND COMPANY: "How are we going to 'help society'? I'm sorry, I don't understand the question."

jacobgorm•7mo ago

[flagged]

93po•7mo ago

if you had lidar in teslas they'd look like those massive silly things on top of waymos and they'd cost an extra $20k and literally no one would buy them. tesla has by far the best self driving software without lidar, they seem to be doing fine.

jacobgorm•7mo ago

Not really doing fine https://www.yahoo.com/news/tesla-self-driving-cars-investiga...

dang•7mo ago

"Please don't post shallow dismissals, especially of other people's work. A good critical comment teaches us something."

https://news.ycombinator.com/newsguidelines.html

waynenilsen•7mo ago

I used https://app.readaloudto.me to listen it is helpful

meerab•7mo ago

Check out the transcription of Andrej Karpathy's keynote at AI Startup School in San Francisco.

https://videotobe.com/play/youtube/LCEmiRjPEtQ

1970-01-01•7mo ago

Ironic how the shiny new paradigm in software (AI) was not leveraged for cleaning up the transcription. We are not baking a new kind of pie, we're simply proving that you can bake with the microwave. We're not heading into a new "3.0" era. We're growing new branches to the trunk. AI will be mistaken for causing a paradigm shift in coding right until it fails to overtake the trunk and high quality code is necessary again.

agentultra•7mo ago

Buzzword soup. A lot of mixed analogies and metaphors. Very little justification for anything.

"We need to rewrite a lot of software," ok... why?

"AI is the new electricity" Really now... so I should expect a bill every month that always increases and to have my access cut off intermittently when there's a rolling AI power outage?

Interesting times indeed.

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

SectorC: A C Compiler in 512 bytes (2023)

Haskell for all: Beyond agentic coding

Bye Bye Humanity: The Potential AMOC Collapse

Speed up responses with fast mode

Software factories and the agentic moment

Brookhaven Lab's RHIC concludes 25-year run with final collisions

LLMs as the new high level language

Total Surface Area Required to Fuel the World with Solar (2009)

Homeland Security Spying on Reddit Users

Hoot: Scheme on WebAssembly

Stories from 25 Years of Software Development

Vocal Guide – belt sing without killing yourself

First Proof

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Vouch

Al Lowe on model trains, funny deaths and working with Disney

FDA intends to take action against non-FDA-approved GLP-1 drugs

Why there is no official statement from Substack about the data leak

Show HN: A luma dependent chroma compression algorithm (image compression)

Start all of your commands with a comma (2009)

The AI boom is causing shortages everywhere else

I write games in C (yes, C) (2016)

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

Learning from context is harder than we thought

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Selection rather than prediction

Where did all the starships go?

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

SectorC: A C Compiler in 512 bytes (2023)

Haskell for all: Beyond agentic coding

Bye Bye Humanity: The Potential AMOC Collapse

Speed up responses with fast mode

Software factories and the agentic moment

Brookhaven Lab's RHIC concludes 25-year run with final collisions

LLMs as the new high level language

Total Surface Area Required to Fuel the World with Solar (2009)

Homeland Security Spying on Reddit Users

Hoot: Scheme on WebAssembly

Stories from 25 Years of Software Development

Vocal Guide – belt sing without killing yourself

First Proof

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Vouch

Al Lowe on model trains, funny deaths and working with Disney

FDA intends to take action against non-FDA-approved GLP-1 drugs

Why there is no official statement from Substack about the data leak

Show HN: A luma dependent chroma compression algorithm (image compression)

Start all of your commands with a comma (2009)

The AI boom is causing shortages everywhere else

I write games in C (yes, C) (2016)

Microsoft account bugs locked me out of Notepad – Are thin clients ruining PCs?

Learning from context is harder than we thought

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Selection rather than prediction

Where did all the starships go?

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Andrej Karpathy's talk on the future of the industry

Comments