frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Will local models on normal hardware ever compete?

1•locusofself•1h ago
I have a Macbook Air M3 with 24gb RAM. The other day, I wanted to try running an LLM locally for the first time ever. I ran gemma-4-e4b and threw some chats at it.

It reminded me of my very first experiences with ChatGPT a bit. Clearly less capable than something like Opus 4.6, but I made me excited about the possibilities.

I know that fairly capable models can be run by mere mortals who have a fancy GPU.

My real question is, will some combination of hardware and software optimizations get us anywhere close to "state of the art" models running on truly basic hardware?

With all the ridiculous capex being spent on datacenters etc, what if something akin to Moore's Law, or other algorithmic breakthroughs, will get us super capable LLMs that can run on the average machine?

Comments

rvz•1h ago
> With all the ridiculous capex being spent on datacenters etc, what if something akin to Moore's Law, or other algorithmic breakthroughs, will get us super capable LLMs that can run on the average machine?

It is more a software problem and the next breakthrough will come from clever algorithms.

You have just seen TurboQuant create promising efficiency gains and there many other papers being released that propose more optimisations from software that make it possible to run 100B+ LLMs on device.

bigyabai•27m ago
> It is more a software problem and the next breakthrough will come from clever algorithms.

I don't know if I can agree. The hardware side is extremely sub-optimal on raster-focused GPU architectures like Apple Silicon. If I had to bet, the hardware will improve a lot more than the software will over the next 10 years as more vendors adopt GPGPU characteristics.

> You have just seen TurboQuant create promising efficiency gains

TurboQuant looks like a vibe-laundered implementation of EDEN quantization: https://openreview.net/forum?id=tO3ASKZlok

anuramat•50m ago
I'm pretty sure an average machine will always be less capable than a datacenter, the rest depends on your definition of super capable

Google's best practices document for designing AI products

https://pair.withgoogle.com/guidebook/
1•dotancohen•2m ago•0 comments

Excited Delirium[audio]

https://thisiscriminal.com/episode-355-excited-delirium-3-6-2026/
1•muddi900•4m ago•0 comments

North Korean IT workers are stealing remote jobs: Americans are helping them

https://fortune.com/2026/04/25/north-korean-it-worker-scheme-american-faciliators/
1•napolux•4m ago•1 comments

New Bible TUI App Releases v1.0.0

https://github.com/DeLsonJabberwo/bible-tui
1•delsonjabberwo•4m ago•0 comments

Agent Harness Engineering

https://addyosmani.com/blog/agent-harness-engineering/
1•kiyanwang•7m ago•0 comments

The Normal Work of Creating Reliability

https://surfingcomplexity.blog/2026/04/26/the-normal-work-of-creating-reliability/
1•azhenley•8m ago•0 comments

Everything that went wrong with Claude

https://clawd.rip/
1•aratahikaru5•13m ago•0 comments

How to hire people who are better than you

https://longform.asmartbear.com/hire-better-than-you/
1•kiyanwang•14m ago•0 comments

Terraform is dead

https://grahamgilbert.com/blog/2026/04/20/terraform-is-dead/
2•milkglass•17m ago•0 comments

Chernobyl disaster (April 26th, 1986)

https://en.wikipedia.org/wiki/Chernobyl_disaster
1•simonebrunozzi•18m ago•0 comments

But what is L0-L2 processing for satellite data?

https://medium.com/@aryachauhan7/but-what-is-l0-l2-processing-for-satellite-data-27f39f5324a1
1•marklit•19m ago•0 comments

Don't Confuse Computer Science with Coding

https://substack.com/home/post/p-194090221
2•AbbeFaria•20m ago•0 comments

AI can cost more than human workers now

https://www.axios.com/2026/04/26/ai-cost-human-workers
9•nreece•28m ago•2 comments

Conversations with Cosmos

https://madsenaim.substack.com/p/coming-soon
1•aimmia•32m ago•0 comments

I Am Doing This: The Origin Story of Project-AI

https://zenodo.org/records/19592336
1•IAmSoThirsty•32m ago•0 comments

Lipovive Review: Effective Formula for Fitness

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•JamesLoynes•34m ago•0 comments

OpenAI boss 'deeply sorry' for not telling police of mass shooter's account

https://www.bbc.com/news/articles/cq6je7e80r7o
3•chistev•38m ago•0 comments

An AI driven WP theming workflow

https://anchor.host/a-custom-wordpress-theme-from-scratch-in-2026-an-ai-driven-workflow/
1•g00m•39m ago•0 comments

Savings Hacks You Must Know

https://www.threads.com/@financial.tips.101/post/DXnfOxRCMEX
1•hennix22•43m ago•0 comments

Claude 4.7 vs. ChatGPT 5.5

https://www.tomsguide.com/ai/7-0-wipeout-i-put-chatgpt-5-5-and-claude-4-7-through-7-impossible-te...
3•ageospatial•44m ago•0 comments

Claude Platform on AWS (Coming Soon)

https://aws.amazon.com/claude-platform/
1•qainsights•45m ago•0 comments

Txtfold – summarize large files for LLMs

https://github.com/kristiandupont/txtfold
1•kristiandupont•49m ago•0 comments

The Silencing Engine

https://kitchencloset.com/realstuff/essays/the_silencing_engine/
1•bcRIPster•53m ago•0 comments

Show HN: ChatForm – Create an AI chat form in 1 minute

https://chatform.000ooo.ooo/
1•fengyiqicoder•1h ago•0 comments

Draft's knowledge graph engine – deterministic codebase understanding for AI

https://www.getdraft.dev/blog/local-graph-engine/
1•mayurpise•1h ago•0 comments

Why the Future Doesn't Need US

https://web.archive.org/web/20160210081017/http://www.wired.com/2000/04/joy-2/
1•signa11•1h ago•0 comments

The Publishing Mystery That No One Wants to Talk About

https://www.theatlantic.com/books/2026/04/who-really-wrote-autistic-author-woody-brown-novel/686814/
1•samclemens•1h ago•0 comments

AMD's Zen: Coming Back from the Dead

https://clamtech.org/?dest=zen1
1•matt_d•1h ago•1 comments

Coyote vs. Acme (1990)

https://www.newyorker.com/magazine/1990/02/26/coyote-v-acme
2•aaronbrethorst•1h ago•0 comments

Learning About FPGAs in Finance

https://www.semidesignjobs.com/blog/fpgas-in-finance-hft
1•johncole•1h ago•0 comments