Nvidia sells tiny new computer that puts big AI on your desktop

https://arstechnica.com/ai/2025/10/nvidia-sells-tiny-new-computer-that-puts-big-ai-on-your-desktop/

24•turbocon•1d ago

Comments

adam_patarino•1d ago

If you would buy this I’d love to know how you’d use it.

antinomicus•19h ago

Though the adage “this is the worst it’ll ever be” is parroted daily by AI cultists, the fact is it’s still yet to be proven that currently available LLMs can be made cost effective. For now every ai company is lighting tens of billions of dollars on fire every year and hoping better algorithms, hardware, and user lock in will ensure profits eventually. If this doesn’t happen, they will design more and more “features” in the LLM to monetize it - shopping, ads, sponsored replies, who knows? It may get really awful. And these companies will have so much of our data and eventually the need to make profits will lead them to sell that data and just generally try to extract as much out of us as they can.

This is why in the long run I believe we all should aspire to do LLM inference locally. But unfortunately we just are not anywhere close to par with the SoTA cloud models available. Something like DGX spark would be a decent step in this direction, but this platform appears to mostly be for prototyping / training models meant to eventually be run on data center nvidia hardware.

Personally I think I will probably spec out an M5 max/ultra Mac Studio once that’s a thing, and start trying to do this more seriously. The tools are getting better every day and “this is the worst it’ll ever be” is much more applicable to locally run models.

BizarroLand•18h ago

I would use it for locally hosted RAG or whatever tech has supplanted it instead of paying API fees. We have ~20TB of documents that occasionally need to be scanned and chatted with and $4,000 one time (+ electricity) is chump change compared to the annual costs we would otherwise be looking at.

turbocon•1d ago

I want to know if this is any different than all of the AMD AI Max PCs with 128gb of unified memory? The spec sheet say "128 GB LPDDR5x", so how is this better?

https://nvdam.widen.net/s/tlzm8smqjx/workstation-datasheet-d...

andsoitis•1d ago

> AMD AI Max PCs with 128gb of unified memory? The spec sheet say "128 GB LPDDR5x", so how is this better?

Framework's AMD AI Max PCs also come with LPDDR5x-8000 memory: https://frame.work/desktop?tab=specs

Numerlor•1d ago

The GPU is significantly faster and it has cuda, though I'm not sure where it'd fit in the market.

At the lower price points you have the AMD machines which are significantly cheaper, even though they're slower and with worse support. Then there's apple's with higher memory bandwidth and even the nvidia agx Thor is faster in GPU compute at the cost of worse CPU and networking, and at the 3-4K price point even a threadripper system becomes viable that can get significantly more memory

BoredPositron•19h ago

CUDA.

mcphage•1d ago

That’s a tiny box that draws 240 watts… what does it use for cooling?

gradientsrneat•23h ago

Interesting, but perhaps not surprising, that the OS is Ubuntu-based, with Nvidia software preinstalled.

BizarroLand•18h ago

Given that it runs on ARM chips and is specifically designed for AI tasks, I would be more surprised to see it running Windows by default

hulitu•18h ago

> Nvidia sells tiny new computer that puts big AI on your desktop

A bit expensive for 128 GB RAM. What can the CPU do ? Can it run flawlessly all svchost.exe instances in Windows 11 ? At this money, does it have a headphones output ?

Self-hosting your code on Gitea

Microsoft: RU, China increasingly using AI to escalate cyberattacks on the US

Conflict-Free Replicated Data Types (CRDTs): Convergence Without Coordination

CDC tormented: HR workers summoned from furlough to lay off themselves, others

She Faked Her Way into Yale. Then Things Unraveled

Fuse-ZSTD: mimic transparent compression on ext4

Conference Cheat Sheet: Strategy+Examples for Positive ROI

Where Are the Aliens? New Study Suggests They're Stuck Like Us

FlashWorld: High-quality 3D Scene Generation within Seconds

DBT Multi-Adapter Utils

Show HN: Chorey – A Type-Safe, Asynchronous Pipeline Framework for Python

Who's Submitting AI-Tainted Filings in Court?

The 2-pager used to raise $3.5M from the investors behind Lovable, n8n, and Miro

Google background is dark even when in light mode

Blind Conductor and Amnesiac Agents-problems no one talks about

3D-printed fuel cells could reshape sustainable aerospace applications

Working with the Amiga's RAM and Rad Disks

Bulk Operations in Boost.Bloom

Western Executives Shaken After Visiting China

PowerShell Universal joins Devolutions: a new chapter in IT automation

Improving the Trustworthiness of JavaScript on the Web

The evolution of 37signals over 25 years

Procedural Generation with Wave Function Collapse

Operational Transparency

Show HN: PAO Trainer – A small app I built to practice PAO memory systems [video]

Phage Therapy

The Human Only Public License

Show HN: I made DressMate, an AI to decide what to wear from your own wardrobe

Practical seed recovery for the PCG pseudo-random number generator

Which Nested Data Format Do LLMs Understand Best? JSON vs. YAML vs. XML vs. MD