frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Benchmark Framework Desktop Mainboard and 4-node cluster

https://github.com/geerlingguy/ollama-benchmark/issues/21
87•geerlingguy•2h ago

Comments

jeffbee•1h ago
I had been hoping that these would be a bit faster than the 9950X because of the different memory architecture, but it appears that due to the lower power design point the AI Max+ 395 loses across the board, by large margins. So I guess these really are niche products for ML users only, and people with generic workloads that want more than the 9950X offers are shopping for a Threadripper.
dijit•1h ago
Sounds about right.

I’m struggling to justify the cost of a Threadripper (let alone pro!) for a AAA game studio though.

I wonder who can justify these machines. High frequency trading? data science? shouldn’t that be done on servers?

jeffbee•37m ago
Yeah I don't get it either. To get marginally more resources than the 9950X you have to make a significant leap in price to a $1500+ CPU on a $1000 motherboard.
kadoban•32m ago
Threadripper very rarely seems to make any sense. The only times it seems like you want it are for huge memory support/bandwidth and/or a huge number of pcie slots. But it's not cheap or supported enough compared to epyc to really make sense to me any time I've been specing out a system along those lines.
rtkwe•47m ago
It also seems like the tools aren't there to fully utilize them. Unless I misunderstood he was running off CPU only for all the test so there's still the iGPU and NPU performance that's not been utilized in these tests.
geerlingguy•39m ago
No, only a couple initial tests with Ollama used CPU. I ran most tests on Vulkan / iGPU, and some on ROCm (read further down the thread).

I found it difficult to install ROCm on Fedora 42 but after upgrading to Rawhide it was easy, so I re-tested everything with ROCm vs Vulkan.

Ollama, for some silly reason, doesn't support Vulkan even though I've used a fork many times to get full GPU acceleration with it on Pi, Ampere, and even this AMD system... (moral of the story just stick with llama.cpp).

edwinjones•14m ago
Sadly, the reason they give is subjectively terrible:

https://x.com/ollama/status/1952783981000446029

No experimental flag option, no "you can use the fork that works fine but we don't have capacity to support this" just a hard "no, we think it's unreliable". I guess they just want you to drop them and use llama.cpp.

mhitza•22m ago
I've ran a comparison benchmark for the smaller models https://gist.github.com/mhitza/f5a8eeb298feb239de10f9f60f841...

Comparing it against the RTX 4000 SFF Ada (20GB) which is around $1.2k (if you believe the original price on the nvidia website https://marketplace.nvidia.com/en-us/enterprise/laptops-work...). Which I have access to on a Hetzner GEX44.

I'm going to ballpark it between 2.5-3x faster than the desktop. Except for the tg128 test, where the difference is "minimal" (but I didn't do the math).

GPT-5

https://openai.com/gpt-5/
997•rd•3h ago•1122 comments

Historical Tech Tree

https://www.historicaltechtree.com/
59•louisfd94•1h ago•18 comments

GPT-5: Key characteristics, pricing and system card

https://simonwillison.net/2025/Aug/7/gpt-5/
256•Philpax•2h ago•79 comments

GPT-5 for Developers

https://openai.com/index/introducing-gpt-5-for-developers
248•6thbit•3h ago•117 comments

Benchmark Framework Desktop Mainboard and 4-node cluster

https://github.com/geerlingguy/ollama-benchmark/issues/21
87•geerlingguy•2h ago•8 comments

Building Bluesky comments for my blog

https://natalie.sh/posts/bluesky-comments/
206•g0xA52A2A•4h ago•88 comments

Encryption made for police and military radios may be easily cracked

https://www.wired.com/story/encryption-made-for-police-and-military-radios-may-be-easily-cracked-researchers-find/
30•mikece•2h ago•13 comments

Show HN: Octofriend, a cute coding agent that can swap between GPT-5 and Claude

https://github.com/synthetic-lab/octofriend
37•reissbaker•1h ago•16 comments

Windows XP Professional

https://win32.run/
206•pentagrama•6h ago•128 comments

DNA tests are uncovering the true prevalence of incest (2024)

https://www.theatlantic.com/health/archive/2024/03/dna-tests-incest/677791/
58•georgecmu•2h ago•34 comments

Infinite Pixels

https://meyerweb.com/eric/thoughts/2025/08/07/infinite-pixels/
200•OuterVale•7h ago•45 comments

How to sell if your user is not the buyer

https://writings.founderlabs.io/p/how-to-sell-if-your-user-is-not-the
108•mooreds•5h ago•56 comments

Foundry (YC F24) is hiring staff-level product engineers

https://www.ycombinator.com/companies/foundry/jobs/jwdYx6v-founding-product-engineer
1•lakabimanil•3h ago

Lightweight LSAT

https://lightweightlsat.com/
35•gregsadetsky•2h ago•19 comments

OpenAI's new open-source model is basically Phi-5

https://www.seangoedecke.com/gpt-oss-is-phi-5/
12•emschwartz•1h ago•1 comments

Open music foundation models for full-song generation

https://map-yue.github.io/
19•selvan•3d ago•3 comments

Gemini CLI GitHub Actions

https://blog.google/technology/developers/introducing-gemini-cli-github-actions/
211•michael-sumner•11h ago•87 comments

Show HN: Browser AI agent platform designed for reliability

https://github.com/nottelabs/notte
25•ogandreakiro•3h ago•7 comments

How AI conquered the US economy: A visual FAQ

https://www.derekthompson.org/p/how-ai-conquered-the-us-economy-a
119•rbanffy•10h ago•117 comments

Laptop Support and Usability (LSU): July 2025 Report

https://github.com/FreeBSDFoundation/proj-laptop/blob/main/monthly-updates/2025-07.md
85•grahamjperrin•6h ago•45 comments

A generic non-invasive neuromotor interface for human-computer interaction

https://www.nature.com/articles/s41586-025-09255-w
17•msephton•3d ago•2 comments

Monte Carlo Crash Course: Quasi-Monte Carlo

https://thenumb.at/QMC/
88•zote•3d ago•9 comments

Jepsen: Capela dda5892

https://jepsen.io/analyses/capela-dda5892
60•aphyr•5h ago•6 comments

Leonardo Chiariglione: “I closed MPEG on 2 June 2020”

https://leonardo.chiariglione.org/
190•eggspurt•10h ago•180 comments

The Sunlight Budget of Earth

https://www.asimov.press/p/sunlight-budget
36•mailyk•4h ago•12 comments

Zero-day flaws in authentication, identity, authorization in HashiCorp Vault

https://cyata.ai/blog/cracking-the-vault-how-we-found-zero-day-flaws-in-authentication-identity-and-authorization-in-hashicorp-vault/
200•nihsy•13h ago•87 comments

Preventing ZIP parser confusion attacks on Python package installers

https://blog.pypi.org/posts/2025-08-07-wheel-archive-confusion-attacks/
36•miketheman•4h ago•8 comments

Arm desktop: emulation

https://marcin.juszkiewicz.com.pl/2025/07/22/arm-desktop-emulation/
74•PaulHoule•8h ago•33 comments

Lithium compound can reverse Alzheimer’s in mice: study

https://hms.harvard.edu/news/could-lithium-explain-treat-alzheimers-disease
108•highfrequency•5h ago•68 comments

Claude Code IDE integration for Emacs

https://github.com/manzaltu/claude-code-ide.el
730•kgwgk•1d ago•246 comments