frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Dual RTX 5060 Ti 16GB vs. RTX 3090 for Local LLMs

https://www.hardware-corner.net/guides/dual-rtx-5060-ti-16gb-vs-rtx-3090-llm/
11•pietrushnic•13h ago

Comments

supermatt•9h ago
What is the difference like with batching?

It seems all these tests only compare a single prompt at a time, which is just going to be throttled by memory bandwidth (faster on 3090) and clock speed (faster on 5060) for the most part.

The 3090 has almost 3x the cores of a 5060, so I’m guessing it will absolutely wipe the floor with the dual 5060 setup for batched inference - which is increasingly essential for agentic workflows and complex tool use.

Havoc•8h ago
One substantial downside is other uses. e.g. I also use my desktop for gaming. And a 3090 beats a 5060 easily on that. By a sizable margin - ~33% on some games

Not sure I'd trade more LLM vram for that.

esafak•7h ago
Reading this gave me flashbacks to the 80s, when tinkerers tried to move utilities into the upper- and extended memory area to free up precious conventional memory, 640KB of which we were told ought to have been "enough for anyone". All this because we were saddled with a 16-bit OS. This is not an LLM problem -- 32GB of memory is peanuts in 2025 -- this is an Intel and AMD problem.
zamadatix•6h ago
As the article highlights the problem is really twofold. You need enough VRAM to load the model at all but there also needs to be enough bandwidth that accessing all of that memory is fast enough to be worthwhile. It'd be "easy" to slap 2 TB of "slow" DDR5 onto a GPU but it wouldn't perform much better than a high core count CPU running LLMs with the same memory.
omneity•6h ago
I am not entirely surprised by the relative equivalence for the sparse model. The combined bandwidth of 2x 5060 Ti ≃ 1x 3090. There are inefficiencies in multi-gpus that are more negligible at smaller dimensions, hence why the dense 32B model performs significantly worse on the dual 5060 setup.

For reference I am getting ~40 output tok/s on a 4090 (450W) with Qwen3 32B and a context window of 4096.

> Ultimately, as the user note aptly put it, the decision largely boils down to how much context you anticipate using regularly.

Hah. (emphasis mine)

Scroll-Driven Camera Animation

https://garden.bradwoods.io/notes/javascript/three-js/scroll-driven-camera-animation
1•surprisetalk•1m ago•0 comments

Animate a mesh across a sphere's surface

https://garden.bradwoods.io/notes/javascript/three-js/animate-a-mesh-on-a-spheres-surface
1•surprisetalk•1m ago•0 comments

The Agentic Systems Series

https://gerred.github.io/building-an-agentic-system/
1•ghuntley•2m ago•0 comments

Catalyzing a Golden Age: A Blueprint for Strategic AI R&D Investment

https://ifp.org/catalyzing-a-golden-age/
1•surprisetalk•2m ago•0 comments

How should we think about AI welfare? (Joe Carlsmith) [video]

https://www.youtube.com/watch?v=N5pinDL1zbI
1•surprisetalk•3m ago•0 comments

The stakes of AI moral status

https://joecarlsmith.com/2025/05/21/the-stakes-of-ai-moral-status/
1•surprisetalk•4m ago•0 comments

Evidence Studio: AI-powered BI-as-Code Platform

https://evidence.dev/blog/evidence-studio
1•hughess•4m ago•0 comments

Boltz-2 for predicting ligand/protein binding affinity

https://www.rxrx.ai/boltz-2
1•slyrus•4m ago•0 comments

Reference Works for Every Subject

https://www.lesswrong.com/posts/HLJMyd4ncE3kvjwhe/the-best-reference-works-for-every-subject
1•surprisetalk•4m ago•0 comments

Japanese researchers develop transparent paper as alternative to plastics

https://japannews.yomiuri.co.jp/science-nature/technology/20250605-259501/
1•anigbrowl•5m ago•0 comments

Holo1: Cost-Efficient Web Agent Powered by Open Weights

https://arxiv.org/abs/2506.02865
2•marc-thibault•11m ago•0 comments

Show HN: Real-Time Trade Alerts from Trump's Truth Social Posts

https://www.tac.ooo
1•hoerzu•12m ago•1 comments

600 Miles from the North Pole on a boat. My Starlink Mini is at 171 mbit/s

https://old.reddit.com/r/Starlink/comments/1l0im21/currently_about_600_miles_from_the_north_pole_on/
4•tosh•14m ago•0 comments

Resistance to Immunity (2019)

https://www.nybooks.com/articles/2019/05/23/anti-vax-resistance-immunity/
1•NaOH•23m ago•1 comments

Adventures in Babysitting Coding Agents

https://changelog.com/friends/96
2•ghuntley•24m ago•0 comments

A glance at the Rust compiler team operations

https://blog.rust-lang.org/inside-rust/2025/06/05/a-glance-at-the-team-compiler-operations/
3•andrewstetsenko•24m ago•0 comments

Professional Decline (The Atlantic)

https://www.theatlantic.com/magazine/archive/2019/07/work-peak-professional-decline/590650/
2•highfrequency•25m ago•0 comments

Dual-Engine Serverless SQL Lakehouse

https://rvernica.github.io/2025/06/ducklake-with-postgres
1•inrev•25m ago•1 comments

Fowl Forward over Wormhole, Locally

https://github.com/meejah/fowl
1•rahimnathwani•28m ago•0 comments

'Proof' Review: Finding Truth in Numbers

https://www.wsj.com/arts-culture/books/proof-review-finding-truth-in-numbers-b9779228
1•Hooke•31m ago•0 comments

Sync engine's best friend: fine-grained rendering

https://www.youtube.com/watch?v=YQT26cnCKqo
1•devagr•32m ago•0 comments

Supreme Court allows DOGE to access social security data

https://www.nbcnews.com/politics/supreme-court/supreme-court-trump-doge-social-security-data-access-elon-musk-rcna206515
11•anigbrowl•33m ago•1 comments

Neutral WordPress package manager launches at the Linux Foundation

https://www.fastcompany.com/91347003/wordpress-veterans-launch-fair-project-to-tackle-security-and-control-concerns
2•ke4qqq•38m ago•0 comments

Roman Elementary Mathematics: The Operations

https://penelope.uchicago.edu/Thayer/E/Journals/CJ/47/2/Roman_Elementary_Mathematics%2A.html
2•rawgabbit•41m ago•0 comments

Examples of linkedSignal() usage in Angular applications

https://medium.com/@eugeniyoz/examples-of-linkedsignal-usage-in-angular-applications-415fcd5e243a
1•EugeneOZ•42m ago•0 comments

New gene therapy can target airway and lungs via nasal spray

https://medicalxpress.com/news/2025-05-gene-therapy-airway-lungs-nasal.html
1•PaulHoule•47m ago•0 comments

Show HN: Asteroid Impact Probability Tool

https://b612foundation.org/asteroid-institute-launch-of-adamimpact-probability-demo-to-analyze-and-visualize-future-impact-risk/
1•dremy•48m ago•0 comments

The Rise of Marketing Speak

https://sustainableviews.substack.com/p/the-rise-of-marketing-speak
2•spyckie2•49m ago•0 comments

Semi-Sync Meetings: Stop Wasting Our Time

https://lukebechtel.com/blog/semi-sync-meetings-stop-wasting-our-time
1•marviel•49m ago•0 comments

Higher Order Continuity for Smooth As-Rigid-as-Possible Shape Modeling

https://jcgt.org/published/0014/01/10/
1•ibobev•50m ago•0 comments