frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

What if AI doesn't need more RAM but better math?

https://adlrocha.substack.com/p/adlrocha-what-if-ai-doesnt-need-more
23•adlrocha•1h ago

Comments

Lerc•1h ago
This is one of the basic avenues for advancement.

Compute, bytes of ram used, bytes in model, bytes accessed per iteration, bytes of data used for training.

You can trade the balance if you can find another way to do things, extreme quantisation is but one direction to try. KANs were aiming for more compute and fewer parameters. The recent optimisation project have been pushing at these various properties. Sometimes gains in one comes at the cost of another, but that needn't always be the case.

LoganDark•1h ago
We will not see memory demand decrease because this will simply allow AI companies to run more instances. They still want an infinite amount of memory at the moment, no matter how AI improves.
jurgenburgen•22m ago
If models become more efficient we will move more of the work to local devices instead of using SaaS models. We’re still in the mainframe era of LLM.
DeathArrow•6m ago
I don't think we are there yet. Models running in data centers will still be noticeably better as efficiency will allow them to build and run better models.

Not many people would like today models comparable to what was SOTA 2 years ago.

To run models locally and have results results as good as the models running in data centers we need both efficiency and to hit a wall in AI improvement.

None of those two conditions seem to become true for the near future.

redrove•12m ago
I disagree. I think a sharp drop in memory requirements of at least an order of magnitude will cause demand to adjust accordingly.

Alpaca: Cross-Border Balancing Capacity Cooperation for aFRR

https://www.entsoe.eu/network_codes/eb/alpaca/
1•doener•4m ago•0 comments

IAMPerformance Issue 001 – Physics-based quantum hardware intelligence report [pdf]

https://iamperformance.online/IAMPerformance_Issue001.pdf
1•hmahaffeyges•5m ago•1 comments

Show HN: WhatToBuy – Describe your situation, get AI-curated shopping carts

1•crackeddude•7m ago•0 comments

Hledger AI Policy

https://hledger.org/AI.html
1•yehoshuapw•7m ago•0 comments

Beasts of the Southern Wild

https://medium.com/luminasticity/on-beasts-of-the-southern-wild-40fc0ea39a2b
1•bryanrasmussen•14m ago•0 comments

Native Jellyfin Client for macOS

https://github.com/CustomIcon/Lume
1•CustomIcon•15m ago•1 comments

New Infinity Stealer malware grabs macOS data via ClickFix lures

https://www.bleepingcomputer.com/news/security/new-infinity-stealer-malware-grabs-macos-data-via-...
1•01-_-•17m ago•0 comments

Overestimation of microplastics potentially caused by scientists' gloves

https://news.umich.edu/nitrile-and-latex-gloves-may-cause-overestimation-of-microplastics-u-m-stu...
1•giuliomagnifico•19m ago•0 comments

Cicada Variant 2026: The New Covid Threat Emerging in Silence

https://comuniq.xyz/post?t=891
1•01-_-•20m ago•0 comments

Scion: Running Concurrent LLM Agents with Isolated Identities and Workspaces

https://googlecloudplatform.github.io/scion/overview/
2•smartius•26m ago•0 comments

ESP32-S31: 320MHz 2C RV32IMAFCP+CLIC, 512KB SRAM, GbE, 802.11ax, 61 GPIO

https://www.espressif.com/en/news/ESP32_S31_Release
1•topspin•26m ago•0 comments

What fork() Actually Copies

https://tech.daniellbastos.com.br/posts/what-fork-actually-copies/
2•thunderbong•31m ago•0 comments

Are LLMs a Dead End? [video]

https://www.youtube.com/watch?v=9IMV80GvBpU
2•pullshark91•34m ago•0 comments

How Developers use AI

https://vibecodingstats.com/
2•krenerd•35m ago•0 comments

Lithuanian Legislation as a Git Repo

https://github.com/Yiin/lt-teises-aktai
1•debesyla•36m ago•0 comments

Should you do a PhD? (2025)

https://neurofrontiers.blog/should-you-do-a-phd/
1•lentoutcry•38m ago•0 comments

DaVinci-MagiHuman: Open-source AI model for realistic video generation

https://firethering.com/davinci-magihuman-open-source-ai-video-model/
1•steveharing1•42m ago•0 comments

Working on Products People Hate

https://www.seangoedecke.com/working-on-products-people-hate/
2•herbertl•42m ago•0 comments

Vibe physics: The AI grad student

https://www.anthropic.com/research/vibe-physics
1•cl3misch•48m ago•0 comments

A minimal React shopping list app structured for Capacitor/iOS packaging

https://github.com/sangress/shopping-list
1•sangress_dev•49m ago•0 comments

From Agent to Domain Intelligence: A Self-Evolving Knowledge Engine

https://simaxiaoqian.substack.com/p/from-agent-to-domain-intelligence
1•qingant•1h ago•1 comments

City Skylines II: Office Evolution and City Stations Available Now

https://www.paradoxinteractive.com/games/cities-skylines-ii/news/office-evolution-and-city-statio...
1•doener•1h ago•0 comments

Debugging and Fixing Interaction to Next Paint (INP)

https://www.remoterocketship.com/advice/how-i-debugged-and-fixed-inp/
1•Lior539•1h ago•0 comments

Iceflake's First Patch for Cities Skylines 2 Is Good [video]

https://www.youtube.com/watch?v=pNL0iYIj0mA
1•doener•1h ago•0 comments

Lat.md: Agent Lattice: a knowledge graph for your codebase, written in Markdown

https://github.com/1st1/lat.md
1•doppp•1h ago•0 comments

Tried a New AI Image Tool for Real-World Design Work (Nano Banana Pro)

https://www.nanobananapro.org
1•nanobananapro•1h ago•0 comments

LinkedIn uses 2.4 GB RAM across two tabs

3•hrncode•1h ago•0 comments

How to use ETag header for optimistic concurrency

https://event-driven.io/en/how_to_use_etag_header_for_optimistic_concurrency/
1•birdculture•1h ago•0 comments

30 Years Ago, Robots Learned to Walk Without Falling

https://spectrum.ieee.org/honda-p2-robot-ieee-milestone
1•vinhnx•1h ago•0 comments

Show HN: Litmus – Run a Parallel Autonomous ML Research Org on Your OpenClaw

https://github.com/kuberwastaken/litmus
1•kuberwastaken•1h ago•0 comments