The Humble Programmer (1972)

https://www.cs.utexas.edu/~EWD/transcriptions/EWD03xx/EWD340.html

135•squircle•7mo ago

Comments

selcuka•7mo ago

> The sooner we can forget that FORTRAN has ever existed, the better, for as a vehicle of thought it is no longer adequate: it wastes our brainpower, is too risky and therefore too expensive to use.

Apparently the ISO/IEC 1539-1:2023 [1] committee didn't get the memo.

[1] https://www.iso.org/standard/82170.html

pjmlp•7mo ago

Modern Fortran is quite neat, and much better than having to deal with Python + rewriting code into C and C++.

selcuka•7mo ago

Sure, I was joking. I haven't used anything that's more recent than Fortran 77 (Fortran 90 was a thing, but WATFOR didn't support it, so we had to stick with the older spec). I'm sure it's more enjoyable now.

To be honest it wasn't that bad back then either.

pjmlp•7mo ago

As language nerd I tend to follow up in many domains, even if I don't use said tools daily.

A good refresher was reading "Modern FORTRAN: Building Efficient Parallel Applications".

https://www.manning.com/books/modern-fortran

enord•7mo ago

It’s a real shame Dijkstra rubbed so many people the wrong way.

Maybe his incisive polemic, which I greatly enjoy, was all but pandering to a certain elitist sensibility in the end.

To make manageable programs, you have to trade off execution speed both on the cpu and in the organization. His rather mathematized prescriptions imply we should hire quarrelsome academics such as him to reduce performance and slow down product development[initially…] all in the interest of his stratified sensibilities of elegance and simplicity.

Sucks to be right when that’s the truth.

bob1029•7mo ago

> Secondly, we have got machines equipped with multi-level stores, presenting us problems of management strategy that, in spite of the extensive literature on the subject, still remain rather elusive.

NUMA only got more complicated over time. The range of latency differences is more extreme than ever. We've got L1 running at nanosecond delay, and on the other end we've got cold tapes that can take a whole day to load. Which kind of memory/compute to use in a heterogeneous system (cpu/gpu) is also something that can be difficult to figure out. Multi core is likely the most devastating dragon to arrive since this article was written.

Premature optimization might be evil, but it's the only way to efficiently align the software with the memory architecture. E.g., in a Unity application, rewriting from game objects to ECS is basically like starting over.

If you could only focus on one aspect, I would keep the average temperature of L1 in mind constantly. If you can keep it semi-warm, nothing else really matters. There are very few problems that a modern CPU can't chew through ~instantly assuming the working set is in L1 and there is no contention with other threads.

This is the same thinking that drives some of us to use SQLite over hosted SQL providers. Thinking in terms of not just information, but the latency domain of the information, is what can unlock those bananas 1000x+ speed ups.

pjmlp•7mo ago

This is why having language runtimes that are NUMA aware like in JVM implementations (-XX:+UseNUMA) or languages like Chapel, is much easier than requiring everyone to be a top coder.

It isn't the same as manually doing the optimizations, still half way there is better than not at all.

kragen•7mo ago

> If you could only focus on one aspect, I would keep the average temperature of L1 in mind constantly. If you can keep it semi-warm, nothing else really matters.

This is easy, as Dijkstra points out. Allocate an array ¾ the size of L1D, and write a subroutine to reverse its contents, but don't use the array otherwise. Call the useless reversal subroutine often enough (at the beginning of every other subroutine and loop body should generally be enough) that the CPU spends most of its time running it. That will ensure that most of your data references are to the array, and most of your instruction references are to the reversal subroutine, both of which are always in cache.

Other things do matter.

gobblik•7mo ago

Or, for the esolangers: The Less Humble Programmer http://digitalhumanities.org/dhq/vol/17/2/000698/000698.html

xpointer•7mo ago

More specifically, The Humble Programmer is about "professionalizing" programming. In the '50s and '60s, programmers justified clever tricks due to the strict constraints of early machines. Dijsktra is saying enough already with that, we need to move to a neutral style and favor clarity above all else, so programmers can understand others' work. Esolangs, which often annihilate readability, give an excuse to show off technical feats that aren't justified in mainstream code, a return to the "Wild West" (as Backus put it) or early computing.

stereolambda•7mo ago

In the articles and talks from that time people often take the perspective of what the whole society (with its organizations) wants from the "automatic computers" and programmers as a profession. Compare also something like the 1982 Grace Hopper's talk on YT. Now I think it's mostly the perspective of companies, teams, the industry. This shift happened in the 1990s? I'm guessing here.

I guess there is still something left here from there from the concept of programming language as a tool for top-down shaping and guiding the thinking of its users. Pascal being the classic example. Golang tries to be like that. I get how annoying it can be. I don't know how JS/TypeScript constructs evolve, but I suspect this is more Fortran-style committee planning than trying to "enlighten" people into doing the "right" things. Happy to be corrected on this.

Maybe the hardest to interpret in hindsight is the point that in the sixties programming has been an overpaid profession, the hardware costs will be dropping and software costs cannot stay the same (You cannot expect society to accept this, and therefore we must learn to program an order of magnitude more effectively). Yeah, in some sense, what paying for software even is anymore.

But interestingly, the situation now is kind of similar to the very old days: bunch of mainframe ("cloud") owners paying programmers to program and manage their machines. And maybe the effectiveness really has gone up dramatically. There's relatively little software running in comparison to the crazy volume of metal machines, even though the programmers for that scale are still paid a lot. It's not like you get a team of 10 guys for programming each individual server.

puttycat•7mo ago

What a joy to find a plaintext HTML page (and such a wonderful text of course).

augustk•7mo ago

The interview "Discipline in Thought" is also quite interesting:

https://www.youtube.com/watch?v=mLEOZO1GwVc

ddtaylor•7mo ago

> In this sense the electronic industry has not solved a single problem, it has only created them, it has created the problem of using its products.

Oh boy does that read VERY true today!

AnimalMuppet•7mo ago

Not to me. The electronic industry has created new problems, but it has solved old ones.

I'm old enough to remember what text editing was like before word processors. I'm old enough to remember trying to reach people before cell phones. I'm old enough to remember trying to find information in a physical library. There's a lot of problems that electronics has solved.

ddtaylor•7mo ago

What computer is he referring to?

> When these machines were announced and their functional specifications became known, quite a few among us must have become quite miserable; at least I was. It was only reasonable to expect that such machines would flood the computing community, and it was therefore all the more important that their design should be as sound as possible. But the design embodied such serious flaws that I felt that with a single stroke the progress of computing science had been retarded by at least ten years: it was then that I had the blackest week in the whole of my professional life. Perhaps the most saddening thing now is that, even after all those years of frustrating experience, still so many people honestly believe that some law of nature tells us that machines have to be that way. They silence their doubts by observing how many of these machines have been sold, and derive from that observation the false sense of security that, after all, the design cannot have been that bad. But upon closer inspection, that line of defense has the same convincing strength as the argument that cigarette smoking must be healthy because so many people do it.

ddtaylor•7mo ago

Apparently the IBM 360 and by an extension OS/360, which apparently was so "problematic" that it inspired the book The Mythical Man-Month. Neat.

vincent-manis•7mo ago

OS/360 was a victim of over-ambition. Originally, it was to run on all models of System/360 (maybe not the Model 20, which implemented only a subset of the ISA), and it was complex enough that there were challenges in building it. IBM ended up spinning off the smaller models to IBM Germany, DOS/360 came from there, and subsetting OS/360 into 3 levels, so they could get something out the door only a bit late. OS/360 by some measures is one of the most successful operating systems ever: IBM's current z/OS is a remote descendent of it. Yes, it is pretty horrible to use (three letters: JCL), but it certainly was successful.

“The Mythical Man-Month” is not about OS/360 as such, but about project planning and specifically what was learned about project management during the development.

pjmlp•7mo ago

Sadly we went from over-ambition to mostly UNIX clones in modern OSes, which triggered Rob Pike's famous rant on systems programming, or his remarks on Slashdot interview.

dkarl•7mo ago

> But if you take as “performance” the duty cycle of the machine’s various components, little will prevent you from ending up with a design in which the major part of your performance goal is reached by internal housekeeping activities of doubtful necessity

JITs have taken this to an even higher level — people don't just argue that the machine is fast enough to run their convoluted code with countless unnecessary layers, they argue that their code as they've written it won't be run at all: the JIT will reduce it to a simpler form that can be handled efficiently.

But they can't explain why their poor coworkers who have to read and maintain the code don't deserve the same consideration as the machine!

nradov•7mo ago

I don't understand your comment. A good JIT compiler can often make a program more efficient by taking advantage of runtime profiling. This allows developers to write simpler, more maintainable code without doing tricky things for efficiency.

dkarl•7mo ago

That's the upside of JITs and a great way to take advantage of them. Unfortunately, not every programmer is motivated to produce simple code. Some programmers prefer to write more complex code, either because they enjoy building castles in their mind, or because they would rather not take the time to remove any of the dead ends and missteps they made while searching for a solution.

Highly optimized code being convoluted is an extreme case, for rare algorithms or exotic levels of instruction-level efficiency. The first 95% of optimization is simplifying the code, which benefits both the machine and the programmers.

nradov•7mo ago

We usually catch that type of problem in code review.

saghm•7mo ago

> A study of program structure had revealed that programs —even alternative programs for the same task and with the same mathematical content— can differ tremendously in their intellectual manageability. A number of rules have been discovered, violation of which will either seriously impair or totally destroy the intellectual manageability of the program. These rules are of two kinds. Those of the first kind are easily imposed mechanically, viz. by a suitably chosen programming language. Examples are the exclusion of goto-statements and of procedures with more than one output parameter. For those of the second kind I at least —but that may be due to lack of competence on my side— see no way of imposing them mechanically, as it seems to need some sort of automatic theorem prover for which I have no existence proof. Therefore, for the time being and perhaps forever, the rules of the second kind present themselves as elements of discipline required from the programmer. Some of the rules I have in mind are so clear that they can be taught and that there never needs to be an argument as to whether a given program violates them or not. Examples are the requirements that no loop should be written down without providing a proof for termination nor without stating the relation whose invariance will not be destroyed by the execution of the repeatable statement.

Interestingly, designing a language that enforces that loops need an invariant that proves they terminate is actually possible; Coq, for example, does pretty much exactly this from what I understand. My understanding is that this means that it isn't Turing complete, but I also think that maybe Turing completeness isn't quite as necessary for as many things as it might otherwise seem like.

jmj•7mo ago

Invariant is the property that is preserved on every iteration. Proof of termination in imperative languages can be done by proving that a natural number decreases with every step.

Dafny implements this at the compiler level (and a curly braces syntax!).

Coq uses other methods more tailored towards recursion.

You are right that if every loop must terminate, it is not Turing complete. So some valid programs will not compile.

There are some interesting programs that potentially never terminate (like servers, daemons, OS, games, etc) formal methods can be applied too. For instance to prove that they preserve certain property or that they don't terminate or terminate only under certain conditions.

I find the theory extremely elegant and pleasurable, but it´s obviously not everyone's cup of tea, as shown by its lack of widespread use.

LLM's might create a revival in the coming years for the following reasons:

1) cost of formalization goes down 2) cost of proving goes down 3) cost of programming goes down 4) provable code quality becomes a differentiation among a sea of programs

Michelangelo11•7mo ago

> The first effect of teaching a methodology —rather than disseminating knowledge— is that of enhancing the capacities of the already capable, thus magnifying the difference in intelligence.

Absolutely right, with the implication that new capabilities available suddenly to everyone often end up making the playing field more unequal, not less.

stillpointlab•7mo ago

I feel we are seeing this now with the adoption of coding agents.

koakuma-chan•7mo ago

Don't tell them!

buckfactor•7mo ago

here is the audio:

https://www.youtube.com/watch?v=0dGXRK8FUVg

Nicook•7mo ago

Java really needs to take a look into the

>baroque monstrosity

warnings. probably beating a dead horse here, but way too many tools, and they keep adding more.

TremendousJudge•7mo ago

hah, when I read that part, I immediately thought of C++. But I guess all the bigcorp languages suffer from that same issue.

varjag•7mo ago

Such an evergreen observation:

Nowadays one often encounters the opinion that in the sixties programming has been an overpaid profession, and that in the coming years programmer salaries may be expected to go down. Usually this opinion is expressed in connection with the recession, but it could be a symptom of something different and quite healthy, viz. that perhaps the programmers of the past decade have not done so good a job as they should have done. Society is getting dissatisfied with the performance of programmers and of their products.

Obscurity4340•7mo ago

Seems more like more people having access to computers to learn programming, more programming job seekers, widespread corporate wage suppression efforts, outsourcing, why crap on programmers?

kragen•7mo ago

Dijkstra writes about subroutine calling:

> We should recognise the closed subroutines as one of the greatest software inventions; it has survived three generations of computers and it will survive a few more, because it caters for the implementation of one of our basic patterns of abstraction. Regrettably enough, its importance has been underestimated in the design of the third generation computers, in which the great number of explicitly named registers of the arithmetic unit implies a large overhead on the subroutine mechanism.

Presumably what he's saying is that if you have 16 architectural registers instead of four, entering a subroutine involves saving 16 registers, which takes four times as long as saving four registers, which discourages factoring your program into small subroutines. Is there a more reasonable interpretation?

Because I think this interpretation is wrong. If your leaf subroutine requires six registers for its work, and you only have four caller-saved registers, you need to save the values of two callee-saved registers on entry and restore them on exit. And that's true whether your architecture has eight registers, or 16, or 32. Similarly, if a value needs to survive across a subroutine call, you need to use a callee-saved register for it, or store it in memory.

So the cost of invoking a subroutine doesn't depend on the number of architectural registers, I think.

The exception is that, if you don't have enough registers, you need to store variables in memory, and variables in memory don't need to be saved and restored. You could even posit cases where moving variables from registers into memory makes your program faster because you don't have to spend time saving and restoring a register. (Probably not on a single-issue RISC, because you need extra loads and stores, but you could imagine machines whose memory access is fast enough for this.)

But, if that's the case for a particular variable, nothing stops you from using memory for it, perhaps occasionally holding its value in a caller-saved register temporarily. This saves you the cost of saving and restoring the register you could have used for it. It's as if the register doesn't exist.

So I'm skeptical of the thesis that large architectural register files make subroutine calls slower. At most, they fail to speed up subroutine calling by as much as they speed up other parts of your code.

On the other hand, I don't think I know anything relevant that Dijkstra didn't know in 01972. So, am I missing something?

(All this is assuming a constant number of instructions per second, although in fact stack architectures like the MuP21 might be able to hit a higher clock speed than more conventional designs.)

Large register files do slow down preemptive context switching (multithreading), because the operating system must save even registers that the problem-state code is not using, even caller-saved registers. And large sets of callee-saved registers also slow down cooperative context switching.

minikeyvalue

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

How I grow my X presence?

What's the cost of the most expensive Super Bowl ad slot?

What if you just did a startup instead?

Hacking up your own shell completion (2020)

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

GLM-OCR: Accurate × Fast × Comprehensive

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

Show HN: AboutMyProject – A public log for developer proof-of-work

Expertise, AI and Work of Future [video]

So Long to Cheap Books You Could Fit in Your Pocket

PID Controller

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

Kubernetes MCP Server

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

What were the first animals? The fierce sponge–jelly battle that just won't end

Sidestepping Evaluation Awareness and Anticipating Misalignment

OldMapsOnline

What It's Like to Be a Worm

Don't go to physics grad school and other cautionary tales

Lawyer sets new standard for abuse of AI; judge tosses case

AI anxiety batters software execs, costing them combined $62B: report

Bogus Pipeline

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

Cycling in France

Ask HN: What breaks in cross-border healthcare coordination?

Show HN: Simple – a bytecode VM and language stack I built with AI

minikeyvalue

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

How I grow my X presence?

What's the cost of the most expensive Super Bowl ad slot?

What if you just did a startup instead?

Hacking up your own shell completion (2020)

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

GLM-OCR: Accurate × Fast × Comprehensive

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

Show HN: AboutMyProject – A public log for developer proof-of-work

Expertise, AI and Work of Future [video]

So Long to Cheap Books You Could Fit in Your Pocket

PID Controller

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

Kubernetes MCP Server

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

What were the first animals? The fierce sponge–jelly battle that just won't end

Sidestepping Evaluation Awareness and Anticipating Misalignment

OldMapsOnline

What It's Like to Be a Worm

Don't go to physics grad school and other cautionary tales

Lawyer sets new standard for abuse of AI; judge tosses case

AI anxiety batters software execs, costing them combined $62B: report

Bogus Pipeline

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

Cycling in France

Ask HN: What breaks in cross-border healthcare coordination?

Show HN: Simple – a bytecode VM and language stack I built with AI

The Humble Programmer (1972)

Comments