frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Why no inference directly from flash/SSD?

1•myrmidon•3h ago
My understanding is that current LLMs require a lot of space for pre-computed weights (that are constant at inference-time).

Why is it currently not feasible to just keep those in flash memory (fast PCIe SSD Raid or somesuch), and only use RAM for intermediate values/results?

Even modest success on this front seems very attractive to me, because Flash storage appears much cheaper and easier to scale than GPU memory right now.

Are there any efforts in this direction? Is this a flawed approach for some reason, or am I fundamentally misunderstanding things?

Comments

sunscream89•2h ago
> A typical DRAM has a transfer rate of approximately 2-20GB/s, whereas typical SSDs have a transfer rate of 50MB-200MB/s. So it's one to two orders of magnitude slower.

Hot Chips 2025: Session 1 – CPUs – By George Cozma

https://chipsandcheese.com/p/hot-chips-2025-session-1-cpus
1•rbanffy•1m ago•0 comments

Getting AI Agent Architecture Right with MCP

https://decodingml.substack.com/p/getting-agent-architecture-right
1•rbanffy•1m ago•0 comments

Tyromancy (Telling the future using cheese)

https://en.wikipedia.org/wiki/Tyromancy
1•reaperducer•2m ago•0 comments

Indiana Jones and the Last Crusade Adventure Prototype Recovered for the C64

https://www.gamesthatwerent.com/2025/09/indiana-jones-and-the-last-crusade-adventure-prototype-re...
1•ibobev•2m ago•0 comments

VMware's in court again. Customer relationships rarely go this wrong

https://www.theregister.com/2025/09/08/vmware_in_court_opinion/
1•rntn•3m ago•0 comments

Plot IMDB Series Ratings

https://imdb.derfor.dk/
1•0x000042•4m ago•1 comments

10xDevAi

https://10xdevai.com
1•chaimvaid•6m ago•0 comments

Your Zodiac Sign Is 2k Years Out of Date

https://www.nytimes.com/interactive/2025/upshot/zodiac-signs.html
2•gk1•8m ago•0 comments

Nicholas (Nick) J. Fuentes

https://x.com/NickJFuentes
1•barrister•11m ago•0 comments

Every Commodore Amiga Model Ever Made [video]

https://www.youtube.com/watch?v=JUwpkKVw0Xk
1•rbanffy•12m ago•0 comments

Training to Improve Memory

https://ethz.ch/en/news-and-events/eth-news/news/2025/09/press-release-training-to-improve-memory...
1•geox•15m ago•0 comments

David Baltimore, Nobel-Winning Molecular Biologist, Dies at 87

https://www.nytimes.com/2025/09/07/science/david-baltimore-dead.html
1•mitchbob•17m ago•1 comments

Pre-owned software trial kicks off in UK as Microsoft pushes resale ban

https://www.theregister.com/2025/09/08/microsoft_valuelicensing_latest/
1•beardyw•17m ago•0 comments

Lolgato: Advanced controls for Elgato lights on macOS

https://github.com/raine/Lolgato
1•rane•19m ago•0 comments

Show HN: Search the IndieWeb, one query at a time

https://search.indieblog.page/search
1•splitbrain•20m ago•1 comments

Don't Build an RL Environment Startup

https://benanderson.work/blog/dont-build-rl-env-startup/
1•jxmorris12•20m ago•0 comments

MacBook lid angle sensor sound effects

https://github.com/samhenrigold/LidAngleSensor
1•fanf2•22m ago•0 comments

Show HN: AIHint – Open standard for verifiable website trust metadata

https://github.com/Ai-Hint/aihint-standard
1•aihint•24m ago•1 comments

Show HN: The Daily Word Game Experience

https://wafflegames.net/
1•yangyiming•25m ago•0 comments

TS framework introspectable by AI via GraphQL

https://runner.bluelibs.com/
1•theodordiaconu•27m ago•0 comments

Beyond package management: How Nix refactored my digital life

https://www.jimmyff.co.uk/blog/beyond-package-management-how-nix-refactored-my-digital-life/
1•jimmyff•27m ago•1 comments

Undersea cables cut in Red Sea, disrupting internet access in Asia and Mideast

https://apnews.com/article/red-sea-undersea-cables-cut-internet-disruption-yemen-b79fe7b9764647ac...
1•cobbzilla•29m ago•0 comments

ButterBarTheGr8's Aug 15, 2025 comment in "Unsuitable SSD/NVMe hardware for ZFS"

https://github.com/openzfs/zfs/discussions/14793
1•sipofwater•30m ago•4 comments

Will AI Choke Off the Supply of Knowledge?

https://www.wsj.com/tech/ai/will-ai-choke-off-the-supply-of-knowledge-8a71cbcd
2•throw0101a•34m ago•1 comments

Source Cooperative

https://source.coop/
1•marklit•35m ago•0 comments

Ask HN: What program is running on this 1996 laptop?

1•fcpguru•37m ago•0 comments

Tor VPN Beta (Android)

https://play.google.com/store/apps/details?id=org.torproject.vpn&hl=en_US
2•HelloUsername•38m ago•0 comments

14 Killed in protests in Nepal over social media ban

https://www.tribuneindia.com/news/world/massive-protests-in-nepal-over-social-media-ban/
63•whatsupdog•39m ago•17 comments

Ask HN: Would Windows users want a native multi-model AI client?

1•120-dev•39m ago•0 comments

The Dropshipping Problem: Youth Digital Marketing Gone Wrong

2•haebom•43m ago•0 comments