frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Ferrite – Markdown editor in Rust with native Mermaid diagram rendering

https://github.com/OlaProeis/Ferrite
25•OlaProis•1h ago•6 comments

Finding and fixing Ghostty's largest memory leak

https://mitchellh.com/writing/ghostty-memory-leak-fix
283•thorel•8h ago•60 comments

Show HN: Librario, a book metadata API that aggregates G Books, ISBNDB, and more

62•jamesponddotco•3h ago•22 comments

Show HN: I used Claude Code to discover connections between 100 books

https://trails.pieterma.es/
226•pmaze•10h ago•78 comments

Open Chaos: A self-evolving open-source project

https://www.openchaos.dev/
321•stefanvdw1•11h ago•65 comments

Datadog, thank you for blocking us

https://www.deductive.ai/blogs/datadog-thank-you-for-blocking-us
5•binarylogic•1d ago•8 comments

Show HN: Play poker with LLMs, or watch them play against each other

https://llmholdem.com/
75•projectyang•7h ago•42 comments

Eulogy for Dark Sky, a data visualization masterpiece (2023)

https://nightingaledvs.com/dark-sky-weather-data-viz/
381•skadamat•14h ago•162 comments

AI is a business model stress test

https://dri.es/ai-is-a-business-model-stress-test
180•amarsahinovic•10h ago•206 comments

Code and Let Live

https://fly.io/blog/code-and-let-live/
240•usrme•1d ago•78 comments

Show HN: mcpc – Universal command-line client for Model Context Protocol (MCP)

https://github.com/apify/mcp-cli
21•jancurn•4d ago•3 comments

1970 Paris, cut into a grid and photographed

https://paris1970.jeantho.eu/index.html
34•panic•1w ago•7 comments

A Year of Work on the Arch Linux Package Management (ALPM) Project

https://devblog.archlinux.page/2026/a-year-of-work-on-the-alpm-project/
4•susam•1h ago•0 comments

Private equity firms acquired more than 500 autism centers in past decade: study

https://www.brown.edu/news/2026-01-07/private-equity-autism-centers
209•hhs•3h ago•134 comments

Code Is Clay

https://campedersen.com/code-is-clay
37•ecto•7h ago•19 comments

Bob Weir has died

https://www.rollingstone.com/music/music-news/bob-weir-grateful-dead-dead-obituary-1234810106/
52•asix66•2h ago•7 comments

Kodbox: Open-source cloud desktop with multi-storage fusion and web IDE

https://github.com/kalcaddle/kodbox
6•indigodaddy•1h ago•0 comments

Show HN: GlyphLang – An AI-first programming language

21•goose0004•3h ago•13 comments

ChatGPT Health is a marketplace, guess who is the product?

https://consciousdigital.org/chatgpt-health-is-a-marketplace-guess-who-is-the-product/
246•yoaviram•2d ago•239 comments

ASCII-Driven Development

https://medium.com/@calufa/ascii-driven-development-850f66661351
103•_hfqa•2d ago•69 comments

Rats caught on camera hunting flying bats

https://scienceclock.com/rats-caught-on-camera-hunting-flying-bats-for-the-first-time/
78•akg130522•8h ago•9 comments

MetaFun: Compile Haskell-like code to C++ template metaprograms

https://gergo.erdi.hu/projects/metafun/
3•PaulHoule•1w ago•0 comments

Brands upset Buy For Me is featuring their products on Amazon without permission

https://www.modernretail.co/technology/brands-are-upset-that-buy-for-me-is-featuring-their-produc...
26•spenvo•4d ago•7 comments

I replaced Windows with Linux and everything's going great

https://www.theverge.com/tech/858910/linux-diary-gaming-desktop
576•rorylawless•11h ago•487 comments

UpCodes (YC S17) is hiring PMs, SWEs to automate construction compliance

https://up.codes/careers?utm_source=HN
1•Old_Thrashbarg•10h ago

Org Mode Syntax Is One of the Most Reasonable Markup Languages for Text (2017)

https://karl-voit.at/2017/09/23/orgmode-as-markup-only/
249•adityaathalye•17h ago•177 comments

New information extracted from Snowden PDFs through metadata version analysis

https://libroot.org/posts/going-through-snowden-documents-part-4/
286•libroot•15h ago•118 comments

Extracting books from production language models (2026)

https://arxiv.org/abs/2601.02671
39•logicprog•6h ago•8 comments

The eight ways that all the elements in the Universe are made (2021)

https://bigthink.com/starts-with-a-bang/8-ways-elements-made/
54•zdw•5d ago•30 comments

How wolves became dogs

https://www.economist.com/christmas-specials/2025/12/18/how-wolves-became-dogs
106•mooreds•5d ago•92 comments
Open in hackernews

A Tiny Boltzmann Machine

https://eoinmurray.info/boltzmann-machine
262•anomancer•8mo ago

Comments

vanderZwan•8mo ago
Lovely explanation!

Just FYI: mouse-scrolling is much too sensitive for some reason (I'm assuming it swipes just fine in mobile contexts, have not checked that). The result is that it jumped from first to last "page" and back whenever I tried scrolling. Luckily keyboard input worked so I could still read the whole thing.

djulo•8mo ago
that's soooo coool
nonrandomstring•8mo ago
This takes me back. 1990, building Boltzman machines and Perceptrons from arrays of void pointers to "neurons" in plain C. What did we use "AI" for back then? To guess the next note in a MIDI melody, and to recognise the shape of a scored note, minim, crotchet, quaver on a 5 x 9 dot grid. 85% accuracy was "good enough" then.
bwestergard•8mo ago
Did the output sound musical?
nonrandomstring•8mo ago
For small values of "music"? Really, no. But tbh, neither have more advanced "AI" composition experiments I've encountered over the years, Markov models, linear predictive coding, genetic/evolutionary algs, rule based systems, and now modern diffusion and transormers... they all lack the "spirit of jazz" [0]

[0] https://i.pinimg.com/originals/e4/84/79/e484792971cc77ddff8f...

gopalv•8mo ago
> recognise the shape of a scored note, minim, crotchet, quaver on a 5 x 9 dot grid

Reading music off a lined page sounds like a fun project, particularly to do it from scratch like 3Blue1Brown's number NN example[1].

Mix with something like Chuck[2] and you can write a completely clientside application with today's tech.

[1] - https://www.3blue1brown.com/lessons/neural-networks

[2] - https://chuck.stanford.edu/

nonrandomstring•7mo ago
Thanks for these links. You're right, I think computer-vision "sight reading" is now a fairly done deal. Very impressive progress in the past 30 years.
bbstats•8mo ago
anyone got an archived link?
tambourine_man•8mo ago
Typo

“They can be used for generating new data that…”

munchler•8mo ago
Another typo (or thinko) in the very first sentence:

"Here we introduce introduction to Boltzmann machines"

croemer•8mo ago
More typos (LLMs are really good at finding these):

"Press the "Run Simulation" button to start traininng the RBM." ("traininng" -> "training")

"...we want to derivce the contrastive divergence algorithm..." ("derivce" -> "derive")

"A visisble layer..." ("visisble" -> "visible")

anomancer•7mo ago
Author here, there was so many typos in first draft that hit hn, all fixed now
nayuki•8mo ago
Oh, this is a neat demo. I took Geoff Hinton's neural networks course in university 15 years ago and he did spend a couple of lectures explaining Boltzmann machines.

> A Restricted Boltzmann Machine is a special case where the visible and hidden neurons are not connected to each other.

This wording is wrong; it implies that visible neurons are not connected to hidden neurons.

The correct wording is: visible neurons are not connected to each other and hidden neurons are not connected to each other.

Alternatively: visible and hidden neurons do not have internal connections within their own type.

CamperBob2•8mo ago
Alternatively: visible and hidden neurons do not have internal connections within their own type.

I'm a bit unclear on how that isn't just an MLP. What's different about a Boltzmann machine?

Edit: never mind, I didn't realize I needed to scroll up to get to the introductory overview.

What 0xTJ's [flagged][dead] comment says about it being undesirable to hijack or otherwise attempt to reinvent scrolling is spot on.

nayuki•8mo ago
> I'm a bit unclear on how that isn't just a multi-layer perceptron. What's different about a Boltzmann machine?

In a Boltzmann machine, you alternate back and forth between using visible units to activate hidden units, and then use hidden units to activate visible units.

> What 0xTJ's [flagged][dead] comment says about it being undesirable to hijack or otherwise attempt to reinvent scrolling is spot on.

The page should be considered a slideshow that is paged discretely and not scrollable continuously. And there should definitely be no scrolling inertia.

sitkack•8mo ago
Fun article on David Ackley https://news.unm.edu/news/24-nobel-prize-in-physics-cited-gr...

Do check out his T2 Tile Project.

AstroJetson•8mo ago
The key takeaways are that there are lots of people involved with making these breakthroughs.

The value of grad students is often overlooked, they contribute so much and then later on advance the research even more.

Why does America look on research as a waste, when it has move everything so far?

macintux•8mo ago
It's more accurate to say that businesspeople consider research a waste in our quarter-by-quarter investment climate, since it generally doesn't lead to immediate gains.

And our current leadership considers research a threat, since science rarely supports conspiracy theorists or historical revisionism.

sfifs•8mo ago
More charitably, America's current government has an unusually large concentration of business people. Interestingly, they were elected as a vote for change by a population of non business people who were tired of the economic marginalization they suffered when their government consisted largely of non business people not once but twice. It will be interesting to see how this plays out.
richk449•7mo ago
Why do you say that America looks in research as a waste? We spend higher percentage of gdp on R&D than just about any other country in the world:

https://en.wikipedia.org/wiki/List_of_sovereign_states_by_re...

itissid•8mo ago
IIUC, we need gibbs sampling(to compute the weight updates) instead of using the gradient based forward and backward passes with today's NNetworks that we are used to. Any one understand why that is so?
ebolyen•8mo ago
Not an expert, but I have a bit of formal training on Bayesian stuff which handles similar problems.

Usually Gibbs is used when there's no directly straight-forward gradient (or when you are interested in reproducing the distribution itself, rather than a point estimate), but you do have some marginal/conditional likelihoods which are simple to sample from.

Since each visible node depends on each hidden node and each hidden node effects all visible nodes, the gradient ends up being very messy, so its much simpler to use Gibbs sampling to adjust based on marginal likelihoods.

oac•8mo ago
I might be mistaken, but I think this is partly because of the undirected structure of RBMs, so you can't build a computational graph in the same way as with feed-forward networks.
alimw•7mo ago
By "undirected structure" I assume you refer to the presence of cycles in the graph? I was taught to call such networks "recurrent" but it seems that that term has evolved to mean something slightly different. Anyway yeah, because of the cycles Gibbs sampling is key to the network's operation. One still employs gradient descent during training, but the procedure to calculate the gradient itself involves Gibbs sampling.

Edit: Actually was talking about the General Boltzmann Machine. For the Restricted Boltzmann Machine an approximation has been assumed which obviates the need for full Gibbs sampling during training. Then (quoting the article, emphasis mine) "after training, it can sample new data from the learned distribution using Gibbs sampling."

btickell•7mo ago
Thought I'd weigh in here as well, I believe Gibbs sampling is being used as a way to approximate the expectation over the model distribution. This value is required to compute the gradient of the log likelihood but integrating the distribution is intractable.

This is done in a similar way as you may use MCMC to draw a representative sample from a VAE. In the deep learning formulation of a neural network the gradient is estimated over batches of the dataset rather than over an explicitly modeled probability distribution.

pawanjswal•8mo ago
Love how this breaks down Boltzmann Machines—finally makes this 'energy-based model' stuff click!
BigParm•8mo ago
That font with a bit of margin looks fantastic on my phone specifically. Really nailing the minimalist look. What font is that?
mac9•8mo ago
"font-family: ui-sans-serif, system-ui, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol", "Noto Color Emoji";"

from the css so odds are it's whatever your browser or OS's default sans font is, in my case it's SF Pro which is an Apple font though it may vary if you use a non Apple device.

nickvec•8mo ago
> Here we introduce introduction to Boltzmann machines and present a Tiny Restricted Boltzmann Machine that runs in the browser.

nit: should "introduction" be omitted?

antidumbass•8mo ago
The section after the interactive diagrams has no left padding and thus runs off the screen on iOS.
rollulus•8mo ago
Now the real question: is it you enjoying that nice page or is it a Boltzmann Brain?

https://en.m.wikipedia.org/wiki/Boltzmann_brain

alganet•8mo ago
It doesn't matter.

It's Decartes demon all over again. Problem solved centuries ago. You can skin it however you want, it's the same problem.

taneq•8mo ago
We can’t really discuss Descartes without first explaining the horse.
alganet•8mo ago
I don't think there is anything to discuss about Decartes.
taneq•7mo ago
https://www.reddit.com/r/Jokes/comments/8bmuk8/a_horse_walks...

;)

nickvec•8mo ago
Great site! Would be cool to be able to adjust the speed at which the simulation runs as well.
thingamarobert•8mo ago
This is very well made, and so nostalgic to me! My whole PhD between 2012-16 was based on RBMs and I learned so much about generative ML through these models. Research has come so far and one doesn't hear much about them these days but they were really at the heart of the "AI Spring" back then.
tomrod•8mo ago
Great read!

One nit, a misspelling in the Appendix: derivce -> derive

oac•8mo ago
Nice and clean explanation!

It brings up a lot of memories! Shameless plug: I made a visualization of an RBM being trained years ago: https://www.youtube.com/watch?v=lKAy_NONg3g

dr_dshiv•8mo ago
My understanding is that the Harmonium (Smolensky) was the first restricted Boltzmann machine, but maximized “harmony” instead of minimizing “energy.” When Smolensky, Hinton and Rummelhart collaborated, they instead called it “goodness of fit.”

The harmonium paper [1] is a really nice read. Hinton obviously became the superstar and Smolensky wrote long books about linguistics.

Anyone know more about this history?

[1] https://stanford.edu/~jlmcc/papers/PDP/Volume%201/Chap6_PDP8...

Nevermark•8mo ago
I mistook the title for "A Tiny Boltzmann Brain"! [0]

My own natural mind immediately solved the conundrum. Surely this was a case where a very small model was given randomly generated weights and then tested to see if it actually did something useful!

After all, the smaller the model, the more likely simple random generation can produce something interesting, relative to its size.

I stand corrected, but not discouraged!

I propose a new class of model, the "Unbiased-Architecture Instant Boltzmann Model" (UA-IBM).

One day we will have quantum computers large enough to simply set up the whole dataset as a classical constraint on a model defined with N serialized values, representing all the parameters and architecture settings. Then let a quantum system with N qubits take one inference step over all the classical samples, with all possible parameters and architectures in quantum superposition, and then reduce the result to return the best (or near best) model's parametesr and architecture in classical form.

Anyone have a few qubits laying around that want to give this a shot? (The irony that everything is quantum and yet so slippery we can hardly put any of it to work yet.

(Sci-fi story premise: the totally possible case of an alien species that evolved one-off quantum sensor, which evolved into a whole quantum sensory system, then a nervous system, and subsequently full quantum intelligence out of the gate. What kind of society and technological trajectory would they have? Hopefully they are in close orbit around a black hole, so the impact of their explosive progress has not threatened us yet. And then one day, they escape their gravity well, and ...)

[0] https://en.wikipedia.org/wiki/Boltzmann_brain

immibis•8mo ago
That isn't how quantum computers work.
Nevermark•8mo ago
My understanding, which is far from certain, is that problems like this (try a large combination of solutions, cooperate between superpositions to identify the best, then set up the quantum outputs so when they are classically sampled, the best answer is the most likely sample obtained) are solved in four stages:

1. N^2 potential solutions encoded in superposition across N qubuts.

2. Each superposition goes through the quantum version of a normal inference pass, using each superpositions weights to iteratively process all the classical data, then calculates performance. (This is all done without any classical sampling.)

3. Cross-superposition communication that results in agreement on the desired version. This is the weakest part of my knowledge. I know that is an operation type that exists, but I don't know how circuits implement it. But the number of bits being operated on is small, just a performance measure. (This is also done without any classical sampling.)

4. Then the output is sampled to get classical values. This requires N * log2(N) circuit complexity, where N is the number of bits defining the chosen solution, i.e. parameters and architectural settings. This can be a lot of hardware, obviously, perhaps more than the rest of the hardware, given N will be very large.

Don't take anything I say here for granted. I have designed parts of such a circuit, using ideal quantum gates in theory, but not all of it. I am not an expert, but I believe that every step here is well understood by others.

The downside relative to other approaches: It does a complete search of the entire solutions space directly, so the result comes from reducing N^2 superpositions across N qubits, where for interesting models the N can be very large (billions of parameters x parameter bit width), to get N qubits. No efficiencies are obtained from gradient information or any other heuristic. This is the brute force method. So an upper bound of hardware requirements.

Another disadvantage, is since the solution was not obtained by following a gradient, the best solution might not be robust. Meaning, it might be a kind of vertex between robust solutions that manages to do best on the designed data, but small changes of input samples from the design data might not generalize well. This is less likely to be a problem if smoothing/regularization information is taken into account account for determining each superpositions performance.

ithkuil•8mo ago
Poor quantum beings. They don't have access to a computation model that exceeds the speeds of their own thoughts and they are forever doomed to be waiting a long time for computations to happen
anomancer•7mo ago
Author here! Thanks for all the comments, didn't expect this to hit the front page.

Cleaning up the abundance of typos, margin, and scroll issues now, thanks for pointing them out.

anomancer•7mo ago
Have cleaned up the typos, and it should look much better on mobile now