frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: RunMat – runtime with auto CPU/GPU routing for dense math

https://github.com/runmat-org/runmat
9•nallana•1h ago
Hi, I’m Nabeel. In August I released RunMat as an open-source runtime for MATLAB code that was already much faster than GNU Octave on the workloads I tried. https://news.ycombinator.com/item?id=44972919

Since then, I’ve taken it further with RunMat Accelerate: the runtime now automatically fuses operations and routes work between CPU and GPU. You write MATLAB-style code, and RunMat runs your computation across CPUs and GPUs for speed. No CUDA, no kernel code.

Under the hood, it builds a graph of your array math, fuses long chains into a few kernels, keeps data on the GPU when that helps, and falls back to CPU JIT / BLAS for small cases.

On an Apple M2 Max (32 GB), here are some current benchmarks (median of several runs):

* 5M-path Monte Carlo * RunMat ≈ 0.61 s * PyTorch ≈ 1.70 s * NumPy ≈ 79.9 s → ~2.8× faster than PyTorch and ~130× faster than NumPy on this test.

* 64 × 4K image preprocessing pipeline (mean/std, normalize, gain/bias, gamma, MSE) * RunMat ≈ 0.68 s * PyTorch ≈ 1.20 s * NumPy ≈ 7.0 s → ~1.8× faster than PyTorch and ~10× faster than NumPy.

* 1B-point elementwise chain (sin / exp / cos / tanh mix) * RunMat ≈ 0.14 s * PyTorch ≈ 20.8 s * NumPy ≈ 11.9 s → ~140× faster than PyTorch and ~80× faster than NumPy.

If you want more detail on how the fusion and CPU/GPU routing work, I wrote up a longer post here: https://runmat.org/blog/runmat-accel-intro-blog

You can run the same benchmarks yourself from the GitHub repo in the main HN link. Feedback, bug reports, and “here’s where it breaks or is slow” examples are very welcome.

Comments

constantcrying•17m ago
Writing a (somewhat?) Matlab compatible interpreter and runtime, which targets GPU and CPU simultaneously, is certainly impressive.

But, who is this for? Matlab users? Python users? Julia users? Do you have an aim with this project or is it just for fun?

salvesefu•2m ago
From the Website: "If you write math in MATLAB and hit performance walls on CPU, RunMat is built for you."

Learning Music with Strudel

https://terryds.notion.site/Learning-Music-with-Strudel-2ac98431b24180deb890cc7de667ea92
144•terryds•6d ago•30 comments

Mistral 3 family of models released

https://mistral.ai/news/mistral-3
266•pember•1h ago•78 comments

Nixtml: Static website and blog generator written in Nix

https://github.com/arnarg/nixtml
38•todsacerdoti•1h ago•6 comments

Addressing the adding situation

https://xania.org/202512/02-adding-integers
197•messe•5h ago•59 comments

Advent of Compiler Optimisations 2025

https://xania.org/202511/advent-of-compiler-optimisation
244•vismit2000•7h ago•33 comments

Show HN: Marmot – Single-binary data catalog (no Kafka, no Elasticsearch)

https://github.com/marmotdata/marmot
42•charlie-haley•1h ago•7 comments

YesNotice

https://infinitedigits.co/docs/software/yesnotice/
44•surprisetalk•1w ago•24 comments

A series of vignettes from my childhood and early career

https://www.jasonscheirer.com/weblog/vignettes/
87•absqueued•4h ago•51 comments

Python Data Science Handbook

https://jakevdp.github.io/PythonDataScienceHandbook/
84•cl3misch•4h ago•18 comments

Peter Thiel's Apocalyptic Worldview Is a Dangerous Fantasy

https://jacobin.com/2025/11/peter-thiel-palantir-apocalypse-antichrist
81•robtherobber•30m ago•27 comments

Apple Releases Open Weights Video Model

https://starflow-v.github.io
342•vessenes•11h ago•108 comments

What will enter the public domain in 2026?

https://publicdomainreview.org/features/entering-the-public-domain/2026/
395•herbertl•13h ago•252 comments

YouTube increases FreeBASIC performance (2019)

https://freebasic.net/forum/viewtopic.php?t=27927
120•giancarlostoro•2d ago•23 comments

I Designed and Printed a Custom Nose Guard to Help My Dog with DLE

https://snoutcover.com/billie-story
13•ragswag•2d ago•1 comments

Comparing AWS Lambda ARM64 vs. x86_64 Performance Across Runtimes in Late 2025

https://chrisebert.net/comparing-aws-lambda-arm64-vs-x86_64-performance-across-multiple-runtimes-...
91•hasanhaja•7h ago•41 comments

DeepSeek-v3.2: Pushing the frontier of open large language models [pdf]

https://huggingface.co/deepseek-ai/DeepSeek-V3.2/resolve/main/assets/paper.pdf
906•pretext•1d ago•432 comments

India orders smartphone makers to preload state-owned cyber safety app

https://www.reuters.com/sustainability/boards-policy-regulation/india-orders-mobile-phones-preloa...
828•jmsflknr•1d ago•609 comments

Beej's Guide to Learning Computer Science

https://beej.us/guide/bglcs/
266•amruthreddi•2d ago•96 comments

Zig's new plan for asynchronous programs

https://lwn.net/SubscriberLink/1046084/4c048ee008e1c70e/
69•messe•2h ago•54 comments

Fallout 2's Chris Avellone describes his game design philosophy

https://arstechnica.com/gaming/2025/12/fallout-2-designer-chris-avellone-recalls-his-first-forays...
19•LaSombra•57m ago•3 comments

An LED panel that shows the aviation around you

https://github.com/AxisNimble/TheFlightWall_OSS
58•yzydserd•5d ago•11 comments

How Brian Eno Created Ambient 1: Music for Airports (2019)

https://reverbmachine.com/blog/deconstructing-brian-eno-music-for-airports/
138•dijksterhuis•9h ago•74 comments

Proximity to coworkers increases long-run development, lowers short-term output (2023)

https://pallais.scholars.harvard.edu/publications/power-proximity-coworkers-training-tomorrow-or-...
107•delichon•2h ago•74 comments

Show HN: RunMat – runtime with auto CPU/GPU routing for dense math

https://github.com/runmat-org/runmat
9•nallana•1h ago•2 comments

Lazier Binary Decision Diagrams for set-theoretic types

https://elixir-lang.org/blog/2025/12/02/lazier-bdds-for-set-theoretic-types/
21•tvda•4h ago•2 comments

Rootless Pings in Rust

https://bou.ke/blog/rust-ping/
95•bouk•9h ago•68 comments

Tom Stoppard has died

https://www.bbc.com/news/articles/c74xe49q7vlo
149•mstep•2d ago•46 comments

Reverse math shows why hard problems are hard

https://www.quantamagazine.org/reverse-mathematics-illuminates-why-hard-problems-are-hard-20251201/
147•gsf_emergency_6•14h ago•30 comments

After Windows Update, Password icon invisible, click where it used to be

https://support.microsoft.com/en-us/topic/august-29-2025-kb5064081-os-build-26100-5074-preview-3f...
143•zdw•14h ago•148 comments

Codex, Opus, Gemini try to build Counter Strike

https://www.instantdb.com/essays/agents_building_counterstrike
269•stopachka•3d ago•107 comments