frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

All in on MatMul? Don’t Put All Your Tensors in One Basket!

https://www.sigarch.org/dont-put-all-your-tensors-in-one-basket-hardware-lottery/
16•matt_d•5d ago

Comments

deliciousturkey•1h ago
When GPUs started being used for deep learning (after AlexNet), GPUs were not at all matmul machines. They were machines that excel in most kinds of heavily parallel workloads. And this holds to this day, with the exception of the tensor core, which is an additional hardware block designed to accelerate this specific task.

Matrix multiplication didn't "win" because HW was designed for it. It won because matrix multiplication is a fundamental part of linear algebra and is very effective in deep learning (most kinds of functions you might want to write for deep learning can be expressed as a matmul). Acceleration of it became later. Additionally, matrix multiplication is a good fit for physics, as you can design the HW so that data movement is minimized, and most of the chip area and power are spent in actual computation, and not moving data around.

Fundamentally speaking, you also want to make your algorithm compatible with real-world physics. The need for heavy parallelism is required by the fact that you cannot physically make a fast chip that processes dependent operations. It's just not possible to propagate signals through transistors fast enough to make it possible. Even CPUs, even if they present a non-parallel programming environment, have to rely on expensive tricks like speculative out-of-order execution to make "sequential" code parallel to make it fast.

In general though, I personally would wish that chips would be made with taking programmability in mind. A fixed-function matrix multiplier might be slightly more efficient than a parallel computing chip with smaller matrix multipliers. But it would be significantly more programmable, and you can design much more interesting (and potentially more efficient) algorithms for it.

Don’t Look Up: Sensitive internal links in the clear on GEO satellites [pdf]

https://satcom.sysnet.ucsd.edu/docs/dontlookup_ccs25_fullpaper.pdf
303•dweekly•8h ago•75 comments

NanoChat – The best ChatGPT that $100 can buy

https://github.com/karpathy/nanochat
1193•huseyinkeles•19h ago•226 comments

Why Study Programming Languages

https://people.csail.mit.edu/rachit/post/why-study-programming-languages/
61•bhasi•4h ago•37 comments

Dutch government takes control of Chinese-owned chipmaker Nexperia

https://www.cnbc.com/2025/10/13/dutch-government-takes-control-of-chinese-owned-chipmaker-nexperi...
486•piskov•1d ago•423 comments

Why the push for Agentic when models can barely follow a simple instruction?

https://forum.cursor.com/t/why-the-push-for-agentic-when-models-can-barely-follow-a-single-simple...
138•fork-bomber•3h ago•124 comments

Palisades Fire suspect's ChatGPT history to be used as evidence

https://www.rollingstone.com/culture/culture-news/chatgpt-palisades-fire-suspect-1235443216/
119•quuxplusone•5d ago•87 comments

Copy-and-Patch: A Copy-and-Patch Tutorial

https://transactional.blog/copy-and-patch/tutorial
45•todsacerdoti•5h ago•6 comments

No science, no startups: The innovation engine we're switching off

https://steveblank.com/2025/10/13/no-science-no-startups-the-unseen-engine-were-switching-off/
495•chmaynard•21h ago•345 comments

Ultrasound is ushering a new era of surgery-free cancer treatment

https://www.bbc.com/future/article/20251007-how-ultrasound-is-ushering-a-new-era-of-surgery-free-...
23•1659447091•6d ago•5 comments

Sony PlayStation 2 fixing frenzy

https://retrohax.net/sony-playstation-2-fixing-frenzy/
130•ibobev•11h ago•57 comments

America is getting an AI gold rush instead of a factory boom

https://www.washingtonpost.com/business/2025/10/13/manufacturing-artificial-intelligence/
224•voxleone•19h ago•263 comments

First device based on 'optical thermodynamics' can route light without switches

https://phys.org/news/2025-10-device-based-optical-thermodynamics-route.html
143•rbanffy•5d ago•18 comments

Show HN: SQLite Online – 11 years of solo development, 11K daily users

https://sqliteonline.com/
394•sqliteonline•21h ago•126 comments

Modern iOS Security Features – A Deep Dive into SPTM, TXM, and Exclaves

https://arxiv.org/abs/2510.09272
175•todsacerdoti•16h ago•8 comments

Smartphones and being present

https://herman.bearblog.dev/being-present/
276•articsputnik•20h ago•179 comments

JIT: So you want to be faster than an interpreter on modern CPUs

https://www.pinaraf.info/2025/10/jit-so-you-want-to-be-faster-than-an-interpreter-on-modern-cpus/
133•pinaraf•1d ago•27 comments

LLMs are getting better at character-level text manipulation

https://blog.burkert.me/posts/llm_evolution_character_manipulation/
95•curioussquirrel•14h ago•62 comments

DDoS Botnet Aisuru Blankets US ISPs in Record DDoS

https://krebsonsecurity.com/2025/10/ddos-botnet-aisuru-blankets-us-isps-in-record-ddos/
125•JumpCrisscross•11h ago•95 comments

NVIDIA DGX Spark In-Depth Review: A New Standard for Local AI Inference

https://lmsys.org/blog/2025-10-13-nvidia-dgx-spark/
39•yvbbrjdr•9h ago•33 comments

vali, a C library for Varlink

https://emersion.fr/blog/2025/announcing-vali/
33•GalaxySnail•3d ago•10 comments

Strudel REPL – a music live coding environment living in the browser

https://strudel.cc
167•birdculture•15h ago•31 comments

New York Times, AP, Newsmax and others say they won't sign new Pentagon rules

https://apnews.com/article/pentagon-press-access-defense-department-rules-95878bce05096912887701e...
209•baobun•7h ago•70 comments

KDE celebrates the 29th birthday and kicks off the yearly fundraiser

https://kde.org/fundraisers/yearend2025/
6•jrepinc•34m ago•0 comments

Why did containers happen?

https://buttondown.com/justincormack/archive/ignore-previous-directions-8-devopsdays/
129•todsacerdoti•22h ago•152 comments

Passt – Plug a Simple Socket Transport

https://passt.top/passt/about/
23•zdw•1w ago•3 comments

America's future could hinge on whether AI slightly disappoints

https://www.noahpinion.blog/p/americas-future-could-hinge-on-whether
138•jxmorris12•17h ago•137 comments

JSON River – Parse JSON incrementally as it streams in

https://github.com/rictic/jsonriver
194•rickcarlino•5d ago•81 comments

Abstraction, not syntax

https://ruudvanasseldonk.com/2025/abstraction-not-syntax
94•unripe_syntax•1d ago•51 comments

Software update bricks some Jeep 4xe hybrids over the weekend

https://arstechnica.com/cars/2025/10/software-update-bricks-some-jeep-4xe-hybrids-over-the-weekend/
394•gloxkiqcza•20h ago•269 comments

Nanochat

https://simonwillison.net/2025/Oct/13/nanochat/
26•bilsbie•9h ago•4 comments