SpikingBrain 7B – More efficient than classic LLMs

https://github.com/BICLab/SpikingBrain-7B

28•somethingsome•5h ago

Comments

asdfasdf1•2h ago

SpikingBrain Technical Report: Spiking Brain-inspired Large Models https://arxiv.org/abs/2509.05276

cpldcpu•1h ago

>The current implementation adopts pseudo-spiking, where activations are approximated as spike-like signals at the tensor level, rather than true asynchronous event-driven spiking on neuromorphic hardware.

Isn't that in essence very similar to Quantization Aware Training (QaT)?

spwa4•1h ago

Can you explain more? Why would that be the case? What is being passed from one layer to the next is not a linear value but the delay until the next spike, which is very different.

bob1029•1h ago

https://news.ycombinator.com/item?id=45206420

cpldcpu•1h ago

Well, it would still allow to deploy the trained model to SNN hardware, if it existed.

augment_me•27m ago

To me it sounds like sparse matrix multiplication repackaged as "event-driven spiking computation", where the spikes are simply the non-zero elements that sparse GPU kernels have always been designed to process.

The supposedly dynamic/temporal nature of the model seems to be not applied for GPU execution, collapsing it into a single static computation equivalent to just applying a pre-calculated sparsity mask.

Perhaps a bit cynical of me, but it feels like wrapping standard sparse computing and operator fusion in complex, biological jargon...

GregarianChild•14m ago

The 'brain-inspired' community has always been doing this, since Carver Mead introduced the term 'neuromorphic' in the late 1980s. Reselling banalities as a new great insight. My favourite is "Neuromorphic computing breakthrough could enable blockchain on Mars" [1]. What else can they do? After all, That community has now multiple decades of failure under it's belt. Not a single success. Failure to make progress in AI and failure to say anything in interest about the brain. To paraphrase a US president: In this world nothing can be said to be certain, except death, taxes and neuromphicists exaggerating. (Aside: I was told by someone who applied to YC with a 'neuromorphic' startup that YC said, they don't fund 'neuromorphic'. I am not sure about details ...). The whole 'brain talk' malarkey goes back way longer. In particular psychology and related subjects, since their origins as a specialty in the 19th century, have heavily used brain-inspired metaphors that were intended to mislead. Already in the 19th century that was criticised. See [3] for an interesting discussion.

There is something interesting in this post, namely that it's based on non-Nvidia GPUs, in this case MetaX [2]. I don't know how competitive MetaX are today, but I would not bet against China in the longer term.

[1] https://cointelegraph.com/news/neuromorphic-computing-breakt...

[2] https://en.wikipedia.org/wiki/MetaX

[3] K. S. Kendler, A history of metaphorical brain talk in psychiatry. https://www.nature.com/articles/s41380-025-03053-6

imtringued•19m ago

In a few years China will be completely independent from Nvidia.

https://en.wikipedia.org/wiki/MetaX

They have GPU manufacturers that nobody in the west has ever heard of.

Models of European Metro Stations

Refurb Weekend: Silicon Graphics Indigo² Impact 10000

Geedge and MESA leak: Analyzing the great firewall’s largest document leak

A single, 'naked' black hole confounds theories of the young cosmos

Pass: Unix Password Manager

SpikingBrain 7B – More efficient than classic LLMs

Show HN: A store that generates products from anything you type in search

Two Slice, a font that's only 2px tall

Dynamic Bird Migration Map

Will AI be the basis of many future industrial fortunes, or a net loser?

The Socratic Journal Method: A Simple Journaling Method That Works

A Trick for Backpropagation of Linear Transformations

AMD’s RDNA4 GPU architecture

The PC was never a true 'IBMer'

Gemini (2023)

The unreasonable effectiveness of modern sort algorithms

Myocardial infarction may be an infectious disease

How the restoration of ancient Babylon is drawing tourists back to Iraq

High Altitude Living – 8,000 ft and above (2021)

The case against social media is stronger than you think

Recreating the US/* time zone situation

486Tang – 486 on a credit-card-sized FPGA board

RIP pthread_cancel

Visual programming is stuck on the form

Osteo-Odonto-Keratoprosthesis

Lexy: A parser combinator library for C++17

Show HN: Ultraplot – A succint wrapper for matplotlib

Adding OR logic forced us to confront why users preferred raw SQL

Four-year wedding crasher mystery solved

My first impressions of Gleam

SpikingBrain 7B – More efficient than classic LLMs

Comments

Models of European Metro Stations

Refurb Weekend: Silicon Graphics Indigo² Impact 10000

Geedge and MESA leak: Analyzing the great firewall’s largest document leak

A single, 'naked' black hole confounds theories of the young cosmos

Pass: Unix Password Manager

SpikingBrain 7B – More efficient than classic LLMs

Show HN: A store that generates products from anything you type in search

Two Slice, a font that's only 2px tall

Dynamic Bird Migration Map

Will AI be the basis of many future industrial fortunes, or a net loser?

The Socratic Journal Method: A Simple Journaling Method That Works

A Trick for Backpropagation of Linear Transformations

AMD’s RDNA4 GPU architecture

The PC was never a true 'IBMer'

Gemini (2023)

The unreasonable effectiveness of modern sort algorithms

Myocardial infarction may be an infectious disease

How the restoration of ancient Babylon is drawing tourists back to Iraq

High Altitude Living – 8,000 ft and above (2021)

The case against social media is stronger than you think

Recreating the US/* time zone situation

486Tang – 486 on a credit-card-sized FPGA board

RIP pthread_cancel

Visual programming is stuck on the form

Osteo-Odonto-Keratoprosthesis

Lexy: A parser combinator library for C++17

Show HN: Ultraplot – A succint wrapper for matplotlib

Adding OR logic forced us to confront why users preferred raw SQL

Four-year wedding crasher mystery solved

My first impressions of Gleam