frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Running a One Trillion-Parameter LLM Locally on AMD Ryzen AI Max+ Cluster

https://www.amd.com/en/developer/resources/technical-articles/2026/how-to-run-a-one-trillion-parameter-llm-locally-an-amd.html
29•mindcrime•2h ago

Comments

verdverm•1h ago
The setup was around $10k, but maybe more now with mem/ssd prices.

This is a good list, I like my Beelink a lot, my Minisforum likes to turn itself off every couple of weeks, not sure why yet.

https://www.techradar.com/pro/there-are-15-amd-ryzen-ai-max-...

---

Performance is pretty bad (<10/tps) and context is quite limited. Still good to see progress

Prompt Size (tokens) | TFT (s) - Flash Attention Disabled | TFT (s) - Flash Attention Enabled

4096 | 53.7s | 39.7s

8192 | Out Of Memory (OOM) | 90.5s

16384 | Out Of Memory (OOM) | 239.1s

ibeckermayer•1h ago
Cool that it's possible but basically unusable performance characteristics. For an 8192 token prompt they report a ~1.5 minute time-to-first-token and then 8.30tk/s from there. For context ChatGPT is typically <<1s ttft and ~50tk/s.
elcritch•1h ago
That’s pretty awesome!

Though only 5gig Ethernet? Can’t they do usb-c / thunderbolt 40 Gb/s connections like Macs?

burnt-resistor•1h ago
Framework has gone fully in the tank of Apple consumerization route of unrepairability and unupgradeability with a nonstandard machine, soldered-on RAM, and no meaningful PCIe slots. There's only the superficial appearance of longevity and future-proofness when it's really yet another silo. There's no way to add an IB, FC, or 100/400 GbE NICs to these machines. 5 GbE is a joke. Non-ECC RAM is a joke.
tills13•1h ago
I set up ollama today and can barely run a 3b parameter model before the lag makes it unbearable.

How much is one of these gonna run me?

Microgpt

http://karpathy.github.io/2026/02/12/microgpt/
225•tambourine_man•2h ago•25 comments

We do not think Anthropic should be designated as a supply chain risk

https://twitter.com/OpenAI/status/2027846016423321831
341•golfer•6h ago•154 comments

The Windows 95 user interface: A case study in usability engineering (1996)

https://dl.acm.org/doi/fullHtml/10.1145/238386.238611
167•ksec•5h ago•102 comments

Obsidian Sync now has a headless client

https://help.obsidian.md/sync/headless
414•adilmoujahid•11h ago•146 comments

The happiest I've ever been

https://ben-mini.com/2026/the-happiest-ive-ever-been
364•bewal416•2d ago•175 comments

Show HN: Xmloxide – an agent made rust replacement for libxml2

https://github.com/jonwiggins/xmloxide
37•jawiggins•4h ago•24 comments

H-Bomb: A Frank Lloyd Wright Typographic Mystery

https://www.inconspicuous.info/p/h-bomb-a-frank-lloyd-wright-typographic
31•mrngm•2d ago•9 comments

Block the “Upgrade to Tahoe” Alerts

https://robservatory.com/block-the-upgrade-to-tahoe-alerts-and-system-settings-indicator/
161•todsacerdoti•8h ago•72 comments

Woxi: Wolfram Mathematica Reimplementation in Rust

https://github.com/ad-si/Woxi
261•adamnemecek•3d ago•108 comments

Addressing Antigravity Bans and Reinstating Access

https://github.com/google-gemini/gemini-cli/discussions/20632
213•RyanShook•14h ago•174 comments

SpacetimeDB ThreeJS Support

https://discourse.threejs.org/t/spacetimedb-threejs-support-and-free-tier/90052
6•ryker2000•3d ago•3 comments

Verified Spec-Driven Development (VSDD)

https://gist.github.com/dollspace-gay/d8d3bc3ecf4188df049d7a4726bb2a00
156•todsacerdoti•10h ago•81 comments

Deterministic Programming with LLMs

https://www.mcherm.com/deterministic-programming-with-llms.html
29•todsacerdoti•3d ago•13 comments

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

https://venturebeat.com/technology/alibabas-new-open-source-qwen3-5-medium-models-offer-sonnet-4-...
259•lostmsu•7h ago•172 comments

Building a Minimal Transformer for 10-digit Addition

https://alexlitzenberger.com/blog/post.html?post=/building_a_minimal_transformer_for_10_digit_add...
42•kelseyfrog•5h ago•7 comments

Show HN: Now I Get It – Translate scientific papers into interactive webpages

https://nowigetit.us
196•jbdamask•14h ago•99 comments

Werner Herzog Between Fact and Fiction

https://www.thenation.com/article/culture/werner-herzog-future-truth/
70•Hooke•1d ago•14 comments

Samsung Galaxy update removes Android recovery menu tools, including sideloading

https://9to5google.com/2026/02/27/samsung-galaxy-update-android-recovery-menu-removed/
46•pabs3•1h ago•5 comments

New evidence that Cantor plagiarized Dedekind?

https://www.quantamagazine.org/the-man-who-stole-infinity-20260225/
113•rbanffy•3d ago•70 comments

MCP server that reduces Claude Code context consumption by 98%

https://mksg.lu/blog/context-mode
263•mksglu•17h ago•62 comments

Microsoft announces new "mini PCs" for Windows 365

https://www.neowin.net/news/microsoft-announces-new-mini-pcs-for-windows-365/
9•mikece•2d ago•6 comments

Running a One Trillion-Parameter LLM Locally on AMD Ryzen AI Max+ Cluster

https://www.amd.com/en/developer/resources/technical-articles/2026/how-to-run-a-one-trillion-para...
29•mindcrime•2h ago•5 comments

Our Agreement with the Department of War

https://openai.com/index/our-agreement-with-the-department-of-war
239•surprisetalk•7h ago•199 comments

The whole thing was a scam

https://garymarcus.substack.com/p/the-whole-thing-was-scam
660•guilamu•11h ago•189 comments

The archivist preserving decaying floppy disks

https://www.popsci.com/technology/floppy-disk-archivist-project/
54•Brajeshwar•3d ago•5 comments

747s and Coding Agents

https://carlkolon.com/2026/02/27/engineering-747-coding-agents/
136•cckolon•1d ago•59 comments

The Eternal Promise: A History of Attempts to Eliminate Programmers

https://www.ivanturkovic.com/2026/01/22/history-software-simplification-cobol-ai-hype/
248•dinvlad•3d ago•167 comments

Ghosts'n Goblins – “Worse danger is ahead”

https://superchartisland.com/ghostsn-goblins/
67•elvis70•3d ago•25 comments

Unsloth Dynamic 2.0 GGUFs

https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
206•tosh•19h ago•54 comments

From Noise to Image – interactive guide to diffusion

https://lighthousesoftware.co.uk/projects/from-noise-to-image/
114•simedw•2d ago•15 comments