frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Blazing Matrix Products

https://panadestein.github.io/blog/posts/mp.html
34•Bogdanp•8h ago

Comments

imurray•2h ago
This post is for those interested high-performance matrix multiplication in BQN (an APL-like array language).

The main thing I got out of it was the footnotes, in particular: https://en.algorithmica.org/hpc/algorithms/matmul/ is a really nice post on fast matrix multiplication, and is a chapter of what looks like a nice online book.

bee_rider•1h ago
Does BQN just natively vectorize the code? I was surprised not to see anything about that.
icen•1h ago
Yes; the CBQN interpreter has a number of specialised vectorised codepaths. It picks a good one for the arrays at runtime.
mlochbaum•34m ago
The relevant operations for matrix multiply are leading-axis extension, shown near the end of [0], and Insert +˝ shown in [1]. Both for floats; the leading-axis operation is × but it's the same speed as + with floating-point SIMD. We don't handle these all that well, with needless copying in × and a lot of per-row overhead in +˝, but of course it's way better than scalar evaluation.

[0] https://mlochbaum.github.io/bencharray/pages/arith.html

[1] https://mlochbaum.github.io/bencharray/pages/fold.html

mlochbaum•20m ago
And the reason +˝ is fairly fast for long rows, despite that page claiming no optimizations, is that ˝ is defined to split its argument into cells, e.g. rows of a matrix, and apply + with those as arguments. So + is able to apply its ordinary vectorization, while it can't in some other situations where it's applied element-wise. This still doesn't make great use of cache and I do have some special code working for floats that does much better with a tiling pattern, but I wanted to improve +˝ for integers along with it and haven't finished those (widening on overflow is complicated).

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

https://jameshard.ing/pilot
261•jamesharding•1h ago•57 comments

Qwen VLo: From "Understanding" the World to "Depicting" It

https://qwenlm.github.io/blog/qwen-vlo/
8•lnyan•9m ago•0 comments

The Power and Beauty of Incrementalism

https://supernuclear.substack.com/p/the-power-and-beauty-of-incrementalism
29•surprisetalk•2h ago•8 comments

I Switched from Flutter and Rust to Rust and Egui

https://jdiaz97.github.io/greenblog/posts/flutter_to_egui/
141•jdiaz97•3d ago•68 comments

Parameterized types in C using the new tag compatibility rule

https://nullprogram.com/blog/2025/06/26/
81•ingve•9h ago•24 comments

PJ5 TTL CPU

https://pj5cpu.wordpress.com/
40•doener•7h ago•1 comments

Show HN: Zenta – Mindfulness for Terminal Users

https://github.com/e6a5/zenta
84•ihiep•6h ago•16 comments

Moonbase Alpha: That time NASA made a meme video game

https://www.spacebar.news/moonbase-alpha-nasa-video-game/
52•todsacerdoti•3d ago•12 comments

AlphaGenome: AI for better understanding the genome

https://deepmind.google/discover/blog/alphagenome-ai-for-better-understanding-the-genome/
476•i_love_limes•1d ago•156 comments

Launch HN: Issen (YC F24) – Personal AI language tutor

291•mariano54•1d ago•251 comments

XSLT – Native, zero-config build system for the Web

https://github.com/pacocoursey/xslt
312•_kush•9h ago•227 comments

Sailing the fjords like the Vikings yields unexpected insights

https://arstechnica.com/science/2025/06/this-archaeologist-built-a-replica-boat-to-sail-like-the-vikings/
92•pseudolus•3d ago•23 comments

Show HN: Sink – Sync any directory with any device on your local network

https://github.com/sirbread/sink
45•sirbread•8h ago•46 comments

Whitesmiths C compiler: One of the earliest commercial C compilers available

https://github.com/hansake/Whitesmiths-C-compiler
16•todsacerdoti•4d ago•5 comments

Calculating the Fibonacci numbers on GPU

https://veitner.bearblog.dev/calculating-the-fibonacci-numbers-on-gpu/
26•rbanffy•3d ago•13 comments

A Lisp adventure on the calm waters of the dead C (2021)

https://mihaiolteanu.me/language-abstractions
45•caned•3d ago•8 comments

Alternative Layout System

https://alternativelayoutsystem.com/scripts/#same-sizer
302•smartmic•19h ago•39 comments

The Effect of Noise on Sleep

https://www.empirical.health/blog/effect-of-noise-on-sleep/
38•brandonb•1h ago•18 comments

The time is right for a DOM templating API

https://justinfagnani.com/2025/06/26/the-time-is-right-for-a-dom-templating-api/
171•mdhb•19h ago•156 comments

Blazing Matrix Products

https://panadestein.github.io/blog/posts/mp.html
34•Bogdanp•8h ago•5 comments

Why is the Rust compiler so slow?

https://sharnoff.io/blog/why-rust-compiler-slow
234•Bogdanp•19h ago•277 comments

Starcloud can’t put a data centre in space at $8.2M in one Starship

https://angadh.com/space-data-centers-1
132•angadh•18h ago•211 comments

A lumberjack created more than 200 sculptures in Wisconsin's Northwoods

https://www.smithsonianmag.com/travel/when-a-lumberjacks-imagination-ran-wild-he-created-more-than-200-sculptures-in-wisconsins-northwoods-180986840/
72•noleary•12h ago•32 comments

VA Tech scientists are building a better fog harp

https://arstechnica.com/science/2025/06/these-va-tech-scientists-are-building-a-better-fog-harp/
28•PaulHoule•3d ago•12 comments

How much slower is random access, really?

https://samestep.com/blog/random-access/
96•sestep•3d ago•47 comments

Kea 3.0, our first LTS version

https://www.isc.org/blogs/kea-3-0/
105•conductor•18h ago•39 comments

Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models

https://github.com/dmf-archive/PILF
4•NetRunnerSu•3h ago•0 comments

My Lights Run on Bash – Tomasz Kramkowski

https://kramkow.ski/article/2025/06/27/my_lights_run_on_bash.html
6•todsacerdoti•3h ago•0 comments

Snow - Classic Macintosh emulator

https://snowemu.com/
260•ColinWright•1d ago•86 comments

Show HN: Magnitude – Open-source AI browser automation framework

https://github.com/magnitudedev/magnitude
113•anerli•20h ago•38 comments