frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
152•yi_wang•5h ago•48 comments

Haskell for all: Beyond agentic coding

https://haskellforall.com/2026/02/beyond-agentic-coding
73•RebelPotato•5h ago•18 comments

SectorC: A C Compiler in 512 bytes (2023)

https://xorvoid.com/sectorc.html
267•valyala•13h ago•51 comments

Total surface area required to fuel the world with solar (2009)

https://landartgenerator.org/blagi/archives/127
30•robtherobber•4d ago•28 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
207•mellosouls•15h ago•355 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
170•surprisetalk•12h ago•163 comments

LLMs as the new high level language

https://federicopereiro.com/llm-high/
75•swah•4d ago•130 comments

Brookhaven Lab's RHIC concludes 25-year run with final collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
76•gnufx•11h ago•59 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
183•AlexeyBrin•18h ago•35 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
176•vinhnx•16h ago•17 comments

Why there is no official statement from Substack about the data leak

https://techcrunch.com/2026/02/05/substack-confirms-data-breach-affecting-email-addresses-and-pho...
30•witnessme•2h ago•7 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
328•jesperordrup•23h ago•98 comments

The Architecture of Open Source Applications (Volume 1) Berkeley DB

https://aosabook.org/en/v1/bdb.html
8•grep_it•5d ago•0 comments

First Proof

https://arxiv.org/abs/2602.05192
138•samasblack•15h ago•81 comments

Wood Gas Vehicles: Firewood in the Fuel Tank (2010)

https://solar.lowtechmagazine.com/2010/01/wood-gas-vehicles-firewood-in-the-fuel-tank/
35•Rygian•2d ago•9 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
86•momciloo•13h ago•17 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
77•chwtutha•3h ago•20 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
109•thelok•15h ago•24 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
593•theblazehen•3d ago•212 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
41•mbitsnbites•3d ago•5 comments

FDA intends to take action against non-FDA-approved GLP-1 drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
114•randycupertino•8h ago•241 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
314•1vuio0pswjnm7•19h ago•502 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
235•limoce•4d ago•125 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
907•klaussilveira•1d ago•277 comments

Where did all the starships go?

https://www.datawrapper.de/blog/science-fiction-decline
160•speckx•4d ago•244 comments

Selection rather than prediction

https://voratiq.com/blog/selection-rather-than-prediction/
36•languid-photic•4d ago•17 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
304•isitcontent•1d ago•39 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
498•lstoll•1d ago•332 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
447•ostacke•1d ago•114 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
314•dmpetrov•1d ago•158 comments
Open in hackernews

Developing a BLAS Library for the AMD AI Engine [pdf]

https://uni.tlaan.nl/thesis/msc_thesis_tristan_laan_aieblas.pdf
46•teleforce•1mo ago

Comments

kouteiheika•1mo ago
So it's called an "AI Engine", but its performance is worse than just running the same thing on CPU? Doesn't it make it essentially useless for anything AI related? What's the point of this hardware then? Better power efficiency for tiny models? Surely someone must be using it for something?
shetaye•1mo ago
The CPU baseline seems to be the beefy host CPU. The AIE is presumably faster than what you could do with the FPGA (DPS, LUT, etc.) alone.
heavyset_go•1mo ago
The point is offloading ML workloads to hardware that is energy efficient, not necessarily "fast" hardware.

You want to minimize the real and energy costs at the expense of time.

Assuming NPUs don't get pulled from consumer hardware altogether, theoretically the time/efficiency trade-off gap will become smaller and smaller as time goes on.

titanix88•1mo ago
Looks like the author have not used software pipelining compiler directives with the kernel loops. AMD AIE architecture has 5 cycle load/store latency and 7 cycle FP unit latency. With software pipelining, they could have 5-10x speed up for long loops.
nl•1mo ago
Note that this is BLAS on the AMD/Xilinx VCK5000 FPGA: https://www.amd.com/en/products/adaptive-socs-and-fpgas/eval...
heavyset_go•1mo ago
How does this line compare to the Ryzen AI branded Xilinx FPGAs in newer mobile AMD APUs?
wmf•1mo ago
The Ryzen AI NPU is from Xilinx but it's not an FPGA BTW.
heavyset_go•1mo ago
I thought the XDNA line was related to Xilinx's Versal (or Alveo, I forget) lines that use FPGA fabric?

Or maybe I'm misinterpreting press releases, as evidently Notebookcheck.net lied to me years ago :(

[1] https://www.notebookcheck.net/AMD-details-4-nm-Zen-4-Ryzen-7...

wtallis•1mo ago
It's an IP block that Xilinx can provide for use on their FPGAs, but as implemented on the Ryzen parts it's synthesized into a hard IP block, not an FPGA block plus bitstream.
fooblaster•1mo ago
This architecture is likely going to be a dead end for AMD. It has been in the wild for several years, yet still has no open programming model, multiple compiler stacks with poor software support. I find it likely that AMD drops this architecture and unifies their ML support around their GPGPU hardware.
fooblaster•1mo ago
Looks like they have made some progress on a native model in recent months: https://github.com/amd/IRON/tree/devel
imtringued•1mo ago
I know this is a master thesis but I'm kind of disappointed by this. The AMD AI Engine is a GEMM and Flash Attention workhorse. Those are the primary workloads and the non-sparse versions of those workloads map 1:1 to the AI Engine. We don't see that in this master thesis.

Like, I'm sitting here on the sidelines and thinking that someone is going to do implement this stuff before I even get a chance, which is why I never mention the blatantly obvious communication pattern that is breathing down your neck that the AI Engines are begging you to implement. Doing Flash Attention is slightly more difficult, but not meaningfully so.

If you are using broadcasting to spread your A and B matrices, you're doing it wrong. You can do the thing that others do inside their processor "outside". Once you understand that, you will start to realize that this is actually the best possible architecture for dense GEMM and dense FlashAttention.