LLMs bring new nature of abstraction – up and sideways

https://martinfowler.com/articles/2025-nature-abstraction.html

45•tudorizer•3d ago

Comments

oytis•3d ago

I don't get his argument, and if it wasn't Martin Fowler I would just dismiss it. He admits himself that it's not an abstraction over previous activity as it was with HLLs, but rather a new activity altogether - that is prompting LLMs for non-deterministic outputs.

Even if we assume there is value in it, why should it replace (even if in part) the previous activity of reliably making computers do exactly what we want?

dist-epoch•4h ago

Because unreliably solving a harder problem with LLMs is much more valuable than reliably solving an easier problem without.

darkwater•3h ago

Which harder problems are LLMs going to (unreliably) solve in your opinion?

dist-epoch•1h ago

Anything which requires "common sense".

A contrived example: there are only 100 MB of disk space left, but 1 GB of logs to write. LLM discards 900 MB of logs and keeps only the most important lines.

Sure, you can nitpick this example, but it's the kind of edge case handling that LLMs can "do something resonable" that before required hard coding and special casing.

oytis•3h ago

OK, so we are having two classes of problems here - ones worth solving unreliably, and ones that are better solved without LLMs. Doesn't sound like a next level of abstraction to me

dist-epoch•1h ago

I was thinking more along this line: you can solve unreliably 100% of the problem with LLMs, or solve reliably only 80% of the problem.

So you trade reliability to get to that extra 20% of hard cases.

pydry•49m ago

The story of programming is not largely one of humans striving to be more reliable when programming but putting up better defenses against our own inherent unreliabilities.

When I watch juniors struggle they seem to think that it's because they dont think hard enough whereas it's usually because they didnt build enough infrastructure that would prevent them from needing to think too hard.

As it happens, when it comes to programming, LLM unreliabilities seem to align quite closely with ours so the same guardrails that protect against human programmers' tendencies to fuck up (mostly tests and types) work pretty well for LLMs too.

furyofantares•24m ago

I'm pretty deep into these things and have never had them solve a harder problem than I can solve. They just solve problems I can solve much, much faster.

Maybe that does add up to solving harder higher level real world problems (business problems) from a practical standpoint, perhaps that's what you mean rather than technical problems.

Or maybe you're referring to producing software which utilizes LLMs, rather than using LLMs to program software (which is what I think the blog post is about, but we should certainly discuss both.)

kookamamie•1h ago

Funny, I dismiss the opinion based on the author in question.

Insanity•1h ago

Serious question - why? I know of the author but don’t see a reason to value his opinion on this topic more or less because of this.

(Attaching too much value to the person instead of the argument is more of an ‘argument from authority’)

kookamamie•22m ago

Let's just say I think a lot of damage was caused by their OOP evangelism back in the day.

felineflock•3d ago

It is a new nature of abstraction, not a new level.

UP: It lets us state intent in plain language, specs, or examples. We can ask the model to invent code, tests, docs, diagrams—tasks that previously needed human translation from intention to syntax.

BUT SIDEWAYS: Generation is a probability distribution over tokens. Outputs vary with sampling temperature, seed, context length, and even with identical prompts.

dcminter•3h ago

Surely given an identical prompt with a clean context and the same seed the outputs will not vary?

diggan•2h ago

+ temperature=0.0 would be needed for reproducible outputs. And even with that, if it's actually reproducible or not depends on the model/weights themselves, not all of them are even when all those things are static. And then finally depends on the implementation of the model architecture as well.

I think the tricky part is that we tend to think that prompts with similar semantic meaning will give the same outputs (like a human), while LLMs can give vastly different outputs if you have one spelling mistake for example, or used "!" instead of "?", the effect varies greatly per model.

dcminter•1h ago

Hmm, I'm barely even a dabbler, but I'd assumed that the seed in question drove the (pseudo)randomness inherent in "temperature" - if not, what seed(s) do they use and why could one not set that/those too?

To your second part I wouldn't make that assumption - I can see how a non-technical person might, but surely programmers wouldn't? I've certainly produced very different output from that which I intended in boring old C with a mis-placed semi-colon after all!

smokel•55m ago

> I think the tricky part is that we tend to think that prompts with similar semantic meaning will give the same outputs (like a human)

Trust me, this response would have been totally different if I were in a different mood.

furyofantares•15m ago

You can make these things deterministic for sure, and so you could also store prompts plus model details instead of code if you really wanted to. Lots of reasons this would be a very very poor choice but you could do it.

I don't think that's how you should think about these things being non-deterministic though.

Let's call that technical determinism, and then introduce a separate concept, practical determinism.

What I'm calling practical determinism is your ability as the author to predict (determine) the results. Two different prompts that mean the same thing to me will give different results, and my ability to reason about the results from changes to my prompt is fuzzy. I can have a rough idea, I can gain skill in this area, but I can't gain anything like the same precision as I have reasoning about the results of code I author.

genidoi•3h ago

This is too abstract and a concrete example of what this looks like in output is needed.

dtagames•3d ago

I respect Martin Fowler greatly but those who, by their own admission, have not used current AI coding tools really don't have much to add regarding how they affect our work as developers.

I do hope he takes the time to get good with them!

diggan•2h ago

> have not used current AI coding tools really don't have much to add regarding how they affect our work as developers

I dunno, sometimes it's helpful to learn about the perspectives of people who've watched something from afar as well, especially if they already have broad knowledge and context that is adjacent to the topic itself, and have lots of people around them deep in the trenches that they've discussed with.

A bit like historians still can provide valuable commentary on wars, even though they (probably) haven't participated in the wars themselves.

nmaley•3d ago

I'm in the process of actually building LLM based apps at the moment, and Martin Fowler's comments are on the money. The fact is seemingly insignificant changes to prompts can yield dramatically different outcomes, and the odd new outcomes have all these unpredictable downstream impacts. After working with deterministic systems most of my career it requires a different mindset.

It's also a huge barrier to adoption by mainstream businesses, which are used to working to unambiguous business rules. If it's tricky for us developers it's even more frustrating to end users. Very often they end up just saying, f* it, this is too hard.

I also use LLM's to write code and for that they are a huge productivity boon. Just remember to test! But I'm noticing that use of LLM's in mainstream business applications lags the hype quite a bit. They are touted as panaceas, but like any IT technology they are tricky to implement. People always underestimate the effort necessary to get a real return, even with deterministic apps. With indeterministic apps it's an even bigger problem.

CraigJPerry•4h ago

Some failure modes can be annoying to test for. For example, if you exceed the model’s context window, nothing will happen in terms of errors or exceptions but the observable performance on the task will tank.

Counting tokens is the only reliable defence i found to this.

danielbln•4h ago

If you exceed the context window the remote LLM endpoint will throw you an error which you probably want to catch, or rather you want to catch that before it happens and deal with it. Either way, it's not a silent error that goes unnoticed usually, what makes you think that?

CraigJPerry•3h ago

Interesting, the completion return object is documented but theres no error or exception field. In practice the only errors ive seen so far have been on the HTTP transport layer.

It would make sense to me for the chat context to raise an exception. Maybe i should read the docs further…

diggan•3h ago

> If you exceed the context window the remote LLM endpoint will throw you an error which you probably want to catch

Not every endpoint works the same way, I'm pretty sure LM Studio's OpenAI-compatible endpoints will silently (from the clients perspective) truncate the context, rather than throw an error. It's up to the client to make sure the context fits in those cases.

OpenAI's own endpoints do show an error and refuses if you exceed the context length though. I think I've seen others use the "finish_reason" attribute too to signal the context length was exceeded, rather than setting an error status code on the response.

Overall, even "OpenAI-compatible" endpoints often aren't 100% faithful reproductions of the OpenAI endpoints, sadly.

danielbln•2h ago

That seems like terrible API design to just truncate without telling the caller. Anthropic, Google and OpenAI all will fail very loudly if you exceed the context window, and that's how it should be. But fair enough, this shouldn't happen anyway and the context should be actively handled before it blows up either way.

dist-epoch•39m ago

It's complicated, for example some models (o3) will throw an error if you set temperature.

What do you do if you want to support multiple models in your LLM gateway? Do you throw an error if a user sets temperature for o3, thus dumping the problem on them? Or just ignore it, but potentially creating confusion because temperature will seem to not work for some models?

alganet•2h ago

> This evolution in non-determinism is unprecedented in the history of our profession.

Not actually true. Fuzzing and mutation testing have been here for a while.

diggan•2h ago

I think the whole context of the article is "program with non-deterministic tools", while non-deterministic fuzzing and mutation testing is kind of isolated to "coming up with test cases", not something you constantly program side-by-side with, or even integrate into the (business-side) of the software project itself. That's how I've used fuzzing and mutation testing in the past at least, maybe others use it differently.

Otherwise yeah, there are a bunch of non-deterministic technologies, processes and workflows missing, like what Machine Learning folks been doing for decades, which is also software and non-deterministic, but also off-topic from context of the article, as I read it.

alganet•2h ago

I just have a problem with his use of the word "unprecedent".

This is not the first rodeo of our profession with non-determinism.

smokel•50m ago

Abstractions for high-level programming languages have always gone in multiple directions (or dimensions if you will). Operations in higher level languages abstract over multiple simpler operations in other languages, but they also allow for abstraction over human concepts, by introducing variable names for example. Variable names are irrelevant to a computer, but highly relevant to humans.

Languages are created to support both computers as well as humans. And to most humans, abstractions such as those presented by, say, Hibernate annotations, are as non-deterministic as can be. To the computer it is all the same, but that is increasingly becoming less relevant, given that software is growing and has to be maintained by humans.

So, yes, LLMs are interesting, but not necessarily that much of a game-changer when compared to the mess we are already in.

Unheard works by Erik Satie to premiere 100 years after his death

History of Cycling Maps

Engineer creates ad block for the real world with augmented reality glasses

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

AlphaGenome: AI for Better Understanding the Genome

James Webb Space Telescope reveals its first direct image of an exoplanet

Lago (Open-Source Usage Based Billing) is hiring for 10 roles

Lossless LLM 3x Throughput Increase by LMCache

London's Largest Ancient Roman Fresco Is “Most Difficult Jigsaw Puzzle”

An Introduction to Tribalism for the Modern World That Has Forgotten It

A short history of web bots and bot detection techniques

Normalizing Flows Are Capable Generative Models

Learn OCaml

Reinforcement learning, explained with a minimum of math and jargon

Finding Peter Putnam: The forgotten janitor who discovered the logic of the mind

c4wa – C compiler for Web Assembly

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites

Qwen VLo: From “Understanding” the World to “Depicting” It

SymbolicAI: A neuro-symbolic perspective on LLMs

A brief history of children sent through the mail (2016)

Facebook is starting to feed its AI with private, unpublished photos

LLMs bring new nature of abstraction – up and sideways

Weird Expressions in Rust

10 Years of Pomological Watercolors

The Original Macintosh: Calculator Construction Set

I deleted my second brain

Multi-Stage Programming with Splice Variables

Transmitting data via ultrasound without any special equipment

Untangling Lifetimes: The Arena Allocator

bootc-image-builder: Build your entire OS from a Containerfile

Unheard works by Erik Satie to premiere 100 years after his death

History of Cycling Maps

Engineer creates ad block for the real world with augmented reality glasses

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

AlphaGenome: AI for Better Understanding the Genome

James Webb Space Telescope reveals its first direct image of an exoplanet

Lago (Open-Source Usage Based Billing) is hiring for 10 roles

Lossless LLM 3x Throughput Increase by LMCache

London's Largest Ancient Roman Fresco Is “Most Difficult Jigsaw Puzzle”

An Introduction to Tribalism for the Modern World That Has Forgotten It

A short history of web bots and bot detection techniques

Normalizing Flows Are Capable Generative Models

Learn OCaml

Reinforcement learning, explained with a minimum of math and jargon

Finding Peter Putnam: The forgotten janitor who discovered the logic of the mind

c4wa – C compiler for Web Assembly

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites

Qwen VLo: From “Understanding” the World to “Depicting” It

SymbolicAI: A neuro-symbolic perspective on LLMs

A brief history of children sent through the mail (2016)

Facebook is starting to feed its AI with private, unpublished photos

LLMs bring new nature of abstraction – up and sideways

Weird Expressions in Rust

10 Years of Pomological Watercolors

The Original Macintosh: Calculator Construction Set

I deleted my second brain

Multi-Stage Programming with Splice Variables

Transmitting data via ultrasound without any special equipment

Untangling Lifetimes: The Arena Allocator

bootc-image-builder: Build your entire OS from a Containerfile

LLMs bring new nature of abstraction – up and sideways

Comments