Literate programming tool for any language

48•LorenDB•4h ago

Comments

tony_cannistra•3h ago

Been around for a long time indeed. I first learned literate programming in college at Tufts, from Norman Ramsey. He wrote noweb[1], an early implementation of Knuth's ideas.

[1]: https://en.wikipedia.org/wiki/Noweb

onair4you•2h ago

Oh you beat me to it!

make3•3h ago

I feel like being able to import notebooks like `import notebook_name` and run jupyter notebooks (more easily) like `python notebook.ipynb` and the analogue in different languages would already get us 99% of the way there

jedimastert•2h ago

There's a pretty direct line between the concept of notebooks and literate programming.

audiodude•2h ago

Is there any intrinsic reason why Jupyter Notebooks can't be imported? You don't know which code blocks to run?

mapcars•3h ago

Literate programming is an intriguing concept, but its hard to compete with modern IDEs. Having build system is good, but can you get proper syntax highlight for the code segments? Or goto-symbol, real-time typechecking?

I feel like it needs its own IDE, because now apart from the coding abstractions you also have named snippets.

corysama•2h ago

Code-as-a-Database is something lots of people would like to have, but not much effort has been put into implementation since... Smalltalk? Could still be a pile of loose text files with markup. Like how https://obsidian.md/ is an informal graph database of loose markdown files.

ModernMech•2h ago

There's an active research community based around this very idea: https://liveprog.org, also https://www.hytradboi.com

codebje•1h ago

I write a bit of literate Haskell, sometimes. It's one of the most well supported literate programming systems out there: the compiler supports it, the language server supports it, using VSCode as an "IDE" means full support for all the things you mentioned. Haskell code formatters don't seem to support literate Haskell, though, and GitHub Copilot, at least, gets confused between prose and code (but that's fine, if I'm taking the time to make my code extra readable and understandable the last thing I want is for an AI to get involved).

Maybe a tool like the one presented here could work as a language server proxy to the underlying language's server. The presence of literate text alone doesn't seem to be the main issue, it's getting the code portions parsed, checked, and annotated with references that matters.

taeric•1h ago

I mean... somewhat yes to all of those? Emacs can even do most of what you are asking for. When I export an org buffer, it even has the syntax highlighting in the html that I was looking at. :D

Obviously, the type checking will be a bit more limited for code snippets you haven't finished. But especially for image based environments, it should have everything that you have in the image just fine.

CWEB, which is the one that Knuth prefers, even supports step debugging. Has supported it for decades, at this point.

WillAdams•59m ago

This is why I use LuaLaTeX for this:

https://github.com/WillAdams/gcodepreview/blob/main/literati...

which allows me to have an ordinary .tex file:

https://github.com/WillAdams/gcodepreview/blob/main/gcodepre...

which outputs multiple .py and .scad files and generates a .pdf with nice listings-based code blocks, ToC, index, hyperlinks, &c.:

https://github.com/WillAdams/gcodepreview/blob/main/gcodepre...

The notable downsides are that the .sty and .tex files have to be customized for the filenames which one can output, and I haven't been able to get auto-line numbering working between code blocks, so one has to manually manage the counters.

svieira•2h ago

The fact that the actual implementation is in `lit` too is really helpful - getting to see how one would actually use this on a larger program does make it much more intriguing than the simple examples (and much more approachable than TeX itself).

https://github.com/zyedidia/Literate/tree/master/lit

groos•2h ago

Ignore the naysayers here. Good job!

jostylr•2h ago

I've been wondering if AI coding agent world makes literate programming valuable again. I got into it with JavaScript being a mess prior to the modern changes. Needed a lot of workarounds. Then they improved the language and it felt like coding could be efficient with them. But if the programmer switched from coding to reviewing, maybe it would be good to be able to have various snippets, with an explanation preceding it and then verifying it. Haven't tried it yet. But I do wonder.

thyrsus•2h ago

If your colleagues just don't feel the benefit of the extra .lit file, is there a way to pull their changes to the derived files into your own .lit files and to keep the .lit files in a parallel version control repo or branch?

taeric•58m ago

Sorta? Noweb and org mode's support of it, at least, has a "detangle" and it worked surprisingly well last time I tried it. You can't edit the comments it puts in the source, for obvious reasons. And I'm sure it has trouble if you tried to get too fancy. But it did allow me to edit the generated source directly and pull those edits back into my literate source. I imagine if this was something people were more often doing, you could make it more reliable, even.

onair4you•2h ago

https://en.m.wikipedia.org/wiki/Noweb

w10-1•1h ago

This doesn't seem to provide any context for literate programming, or the core literate operations?

cf leo editor for literate programming in python [0]

Yes, markdown has code blocks, and notebooks have embedded code in documentation since Mathematica in the 1980's. It is possible to get IDE support in such blocks.

But for literate programming, weaving/tangling sources is needed to escape the file structure, particularly when the build system imposes its own logic, and sometimes one needs to navigate into the code. Leo shows how complicated the semantics of weaving can get.

Eclipse as an IDE was great because their editor component made it easy to manage the trick of one editor for many sources, and their markers provided landmarks for cross-source navigation and summaries.

[0] https://leo-editor.github.io/leo-editor

nico•1h ago

Great concept and very relevant today[1]

It’s interesting that using LLMs is making very explicit that “someone” needs to read the code and understand it. So having good comments and making code readable is great both for AI and humans

1: “Writing documentation for AI: best practices” https://news.ycombinator.com/item?id=44311217

amanwithnoplan•1h ago

Another interesting implementation of literate programming with _bidirectional sync_ between documentation and source code is Entangled (https://entangled.github.io/). This allows you to use all your normal tooling on the normal source code files, and the changes are reflected back to your Markdown documentation files.

seanwilson•1h ago

Maybe I'm missing something but how often is the English in literate programming repeating what's already written in the code? Does it work for large projects where it's often hard to explain all the parts in a linear way in the style of an essay?

I avoid code comments where I can because English is way less precise than code, it's an extra chore to keep the comments and code in sync, and when the comments and code inevitably get out of sync it's confusing which one is the source of truth. Does literate programming sidestep this somehow? Or have benefits that outweigh this?

juliangmp•1h ago

I'm not sure who first coined this idea or put it in a book or where I've read it, but for code comments I generally like the "explain why, not what" philosophy. The "what" is answered by the code itself and should be easy enough to comprehend if your design is simple and your names meaningful. The "why" is much more important. Why does this parser check for some magic numbers at this specific offset and change some parameters if it finds them? If you don't explain that its because of e.g. compatibility with some legacy format, its gonna be a mystery to the reader.

taeric•1h ago

I think this certainly happens a fair bit. Not at all uncommon to have a section that largely says what is going to happen next, which, fair that what is going to happen is what happens.

I think where it shines, is where it helps you break the code up, without having to break it up in a way that makes sense for the computer. Show an outline, but then drill into a section. The overall function can then be kept as a single unit, and you can sort of punt on sub sections. I tried this just recently in https://taeric.github.io/many_sums.html. I don't know that I succeeded, necessarily. Indeed, I think I probably should have broken things into more sections. That said, I did find that this helped me write the code more than I expected it to. (I also was very surprised at how effective the goto style of thinking was... Much to my chagrin.)

I will have to look again at some of the code I've read this way.

To directly answer the question of if it helped keep the documentation in sync, as it were, that is tough. I think it helps keep the code in a section directly related to the documentation for that section. All too often, the majority of code around something is not related to what you were wanting to do. Even the general use of common code constructs gets in the way of reading what you were doing. Literate programming seems the best way I have seen to give the narrative the ability to say "here is the outline necessary for a function" and then "this particular code is to do ..." Obviously, though, it is no panacea.

billfruit•11m ago

I prefer to add huge amounts of comments, explaining in as much detail as I can, sometimes it will be a mini-essay in there. I write most of it before I write the code. It helps me formulate the code better. Later it serves as explanatory text.

Usually the problem with comments is that there is too less of it.

JAHDIEL1•1h ago

Español

bilalq•1h ago

I actually did some small hobby projects using Literate Coffeescript a long time ago. Looking at the source code today, and I can't help but feel like the proponents of literate programming were really onto something. I'm coming back a decade later, but I can easily see what's going on and why at a glance. Compared to many other projects that I've written in the past without documentation, it's a completely different vibe. The Gulpfile in particular is such a treat to read.

Yeah, it can look a bit repetitive if the code is already clear, but the context of why a thing is being done is still valuable. In the modern era with LLM tools, I'm sure it could be even more powerful.

txgvnn•1h ago

If you are interested with Literate programming, you should try Emacs. Some packages are org-mode, eev and even elisp are best for literate programming. Example https://www.youtube.com/watch?v=dljNabciEGg

smitty1e•9m ago

Org-babel[1]:

"Literate programming (LP) offers 2 classical operations:

    Tangle: Extract the source code blocks and generate real working code files for further compilation or execution, eventually outside of Emacs.

    Weave: Export the whole Org file as literate, human-readable documentation (generally in HTML or LaTeX)."

[1] https://org-babel.readthedocs.io/en/latest/

Show HN: I wrote a new BitTorrent tracker in Elixir

Infinite Mac OS X

Open source can't coordinate

Compiling LLMs into a MegaKernel: A path to low-latency inference

FedFlix — Public Domain Stock Footage Library

Literate programming tool for any language

Show HN: ATAC, an event verification platform evidence based

Octobass

Andrej Karpathy: Software in the era of AI [video]

Curved-Crease Sculpture

Sunsonic 986-II – A Thai Famicom clone with keyboard and mini CRT built-in

Show HN: A DOS-like hobby OS written in Rust and x86 assembly

Show HN: EnrichMCP – A Python ORM for Agents

Homegrown Closures for Uxn

How OpenElections uses LLMs

Guess I'm a Rationalist Now

Extracting memorized pieces of books from open-weight language models

Show HN: RM2000 Tape Recorder, an audio sampler for macOS

String Interpolation in C++ Using Glaze Stencil/Mustache

Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs

Giant, All-Seeing Telescope Is Set to Revolutionize Astronomy

Public/protected/private is an unnecessary feature

DNA floating in the air tracks wildlife, viruses, even drugs

Flowspace (YC S17) Is Hiring Software Engineers

What would a Kubernetes 2.0 look like

We Can Just Measure Things

Star Quakes and Monster Shock Waves

Testing a Robust Netcode with Godot

Visual History of the Latin Alphabet

Munich from a Hamburger's perspective

Literate programming tool for any language

Comments

Show HN: I wrote a new BitTorrent tracker in Elixir

Infinite Mac OS X

Open source can't coordinate

Compiling LLMs into a MegaKernel: A path to low-latency inference

FedFlix — Public Domain Stock Footage Library

Literate programming tool for any language

Show HN: ATAC, an event verification platform evidence based

Octobass

Andrej Karpathy: Software in the era of AI [video]

Curved-Crease Sculpture

Sunsonic 986-II – A Thai Famicom clone with keyboard and mini CRT built-in

Show HN: A DOS-like hobby OS written in Rust and x86 assembly

Show HN: EnrichMCP – A Python ORM for Agents

Homegrown Closures for Uxn

How OpenElections uses LLMs

Guess I'm a Rationalist Now

Extracting memorized pieces of books from open-weight language models

Show HN: RM2000 Tape Recorder, an audio sampler for macOS

String Interpolation in C++ Using Glaze Stencil/Mustache

Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs

Giant, All-Seeing Telescope Is Set to Revolutionize Astronomy

Public/protected/private is an unnecessary feature

DNA floating in the air tracks wildlife, viruses, even drugs

Flowspace (YC S17) Is Hiring Software Engineers

What would a Kubernetes 2.0 look like

We Can Just Measure Things

Star Quakes and Monster Shock Waves

Testing a Robust Netcode with Godot

Visual History of the Latin Alphabet

Munich from a Hamburger's perspective