The von Neumann bottleneck is impeding AI computing?

https://research.ibm.com/blog/why-von-neumann-architecture-is-impeding-the-power-of-ai-computing

24•Nezteb•1h ago

Comments

mwkaufma•1h ago

The old saw from corporations that want to sell you an locked-down alternative to general-purpose computing -- now for "AI"

bahmboo•49m ago

Huh, I did not get that from the article. The main takeaway for me was doing ALU operations in memory resulting in massive energy savings. There is still a von Neumann architecture running the show.

nyrikki•1h ago

Nit,

ARM processors primarily use a modified Harvard architecture, including the raspberry pi pico.

NooneAtAll3•1h ago

this isn't about Harvard/VonNeuman split/no-split between i-cache and d-cache

I think this post is more about... compute in memory? if I got it right?

danudey•2m ago

Sort of? It's about locality of data; this has often been a bottleneck, which is why we have CPU caches to keep data extremely close to the CPU cores with practically zero latency and throughput limitations compared to fetching from main memory. Unfortunately now we're shuffling terabytes of data through our algorithms and the CPU spends a huge amount of its time waiting for the next batch of data to come in through the pipe.

This is, IIRC, part of why Apple's M-series chips are as performant as they are: they not only have a unified memory architecture which eliminates the need to copy data from CPU main memory to GPU or NPU main memory to operate on it (and then copy the result back) but the RAM being on the package means that it's slightly "more local" and the memory channels can be optimized for the system they're going to be connected to.

bobmcnamara•54m ago

Nit: RP2040 is a Von Neumann. There's only one AHB port on the m0.

Edit: see also ARM7TDMI, Cortex-m0/0+/1, and probably a few others. All the big stuff is modified Harvard or very rarely pure Harvard.

Animats•1h ago

Actual result: "This new process promises to increase the number of optical fibers that can be connected at the edge of a chip, a measure known as beachfront density, by six times."

Faster interconnects are always nice, but this is more like routine improvement.

bahmboo•51m ago

"In recent inference tests run on a 3-billion-parameter LLM developed from IBM’s Granite-8B-Code-Base model, NorthPole was 47 times faster than the next most energy-efficient GPU and was 73 times more energy efficient than the next lowest latency GPU."

It's also fascinating that they are experimenting with analog memory because it pairs so well with model weights

lomase•1h ago

Why they don't use AI to create a new architecture?

observationist•59m ago

https://github.com/GAIR-NLP/ASI-Arch

This is being done, with great results so far. As models get better, architecture search and creation and refinment improves, driving a reinforcement loop. At some point in the near future the big labs will likely start seeing significant returns from methods like this, translating into better and faster AI for consumers.

lomase•50m ago

I skimed the repo and only found sloop. Can you point out where I can find those new architectures you talk about?

jedberg•51m ago

Do you want SkyNet? That's how you get SkyNet.

amelius•45m ago

No, that's how you get SlopNet.

owyn•28m ago

If you follow the press release rabbit a few clicks, there's an article in Science describing the architecture in more detail:

https://www.science.org/doi/full/10.1126/science.adh1174

Also they've been working on this for 10+ years so it's not exactly new news.

stego-tech•24m ago

IBM initially leads with the more salient point (current architecture designs are hindering frontier computing concepts), then just kinda…relents into iterative improvement.

Which is fine! I am all for iterative improvements, it’s how we got to where we are today. I just wish more folks would start openly admitting that our current architecture designs are broadly based off “low hanging fruit” of early electronics and microprocessors, followed by a century of iterative improvements. With the easy improvements already done and universally integrated, we’re stuck at a crossroads:

* Improve our existing technologies iteratively and hope we break through some barrier to achieve rapid scaling again

* Accept that we cannot achieve new civilizational uplifts with existing technologies, and invest more capital into frontier R&D (quantum processing, new compute substrates, etc)

I feel like our current addiction to the AI CAPEX bubble is a desperate Hail Mary to validate our current tech as the only way forward, when in fact we haven’t really sufficiently explored alternatives in the modern era. I could very well be wrong, but that’s the read I get from the hardware side of things and watching us backslide into the 90s era of custom chips to achieve basic efficiency gains again.

SEC and FINRA are probing over 200 DATs for insider trading

2025 Solo Female Travel Trends and Statistics

Elon Musk and Prince Andrew named in new Epstein files

Why so we remember some life moments - but not others?

Solar-Powered RC Boat Has Unlimited Range

Security Advisory: Anthropic's Slack MCP Server Vulnerable to Data Exfiltration

Agent design lessons from Claude Code

Trump calls for the firing of Lisa Monaco, Microsoft president of global affairs

Boeing May Be Getting Too Much Leeway from the FAA

Version Control by Example (2011)

Show HN: A Firefox extension to stay focused

Cracker Barrel Outrage Was Almost Certainly Driven by Bots, Researchers Say

Code Mode: the better way to use MCP

Million-year-old skull found in China could rewrite human evolution timeline

Prompta – AI Prompt Testing Tool with Version Control

High Voltage Coin Cell

The Longest Run of Heads (1990) [pdf]

Bell Labs Scientists Accidentally Proved the Big Bang Theory

Thoughts on Mechanical Keyboards and the ZSA Moonlander

Give First Is a Philosophy, Not a Religion

Game over for pure LLMs. Even Rich Sutton has gotten off the bus

Oral Bacterial and Fungal Microbiome and Subsequent Risk for Pancreatic Cancer

Tummy upset (bezoar) cured with Coca Cola

Documenting Ireland's Traditional Crafts (1984)

Tic-Tac-Toe game state graph

CityLab: Should We Let Public Transit Die?

The Alpha [School] App Stack and what apps you should use for your kid

'Inflation refund' checks coming to New Yorkers' mailboxes

The Publishing Industry Has a Gambling Problem

New RLS Bike Helmet Design Greatly Reduces Rotational Forces