Ask HN: How much are you spending on your GPU in terms of energy?

1•Simorgh•7h ago

I view the optimisation of GPU energy-consumption as an important state of the art problem.

I think it's really interesting to look at how the GPU market is evolving. TensorPool [1], as an example, who I'm not affiliated with, is a startup that is looking at lowering GPU inference costs.

I think there was some research in relation to energy consumption a couple of years back [2], but I've not noticed anything more recently, since, having briefly searched.

I'm really interested to hear the thoughts of the community in terms of energy costs and provisioning spend w.r.t. increasing usage over time.

[1] https://tensorpool.dev/ [2] GPT-4 energy consumption: https://www.sciencedirect.com/science/article/pii/S2542435123003653

Comments

westurner•4h ago

A TPU is supposed to do more Tensor ops TOPS/wHr than a GPU.

Though, some GPUs have a TPU. For example Nvidia DLSS3 is a TPU.

"A PCIe Coral TPU Finally Works on Raspberry Pi 5" (2023) https://news.ycombinator.com/item?id=38310063

"ARM adds neural accelerators to GPUs" (2025) https://news.ycombinator.com/item?id=44919793

From "The von Neumann bottleneck is impeding AI computing?" (2025) https://news.ycombinator.com/item?id=45398473 :

> How does Cerebras WSE-3 with 44GB of 'L2' on-chip SRAM compare to Google's TPUs, Tesla's TPUs, NorthPole, Groq LPU, Tenstorrent's, and AMD's NPU designs?

Tensor Processing Unit: https://en.wikipedia.org/wiki/Tensor_Processing_Unit

- "Ask HN: Are you paying electricity bills for your service?" (2024) https://news.ycombinator.com/item?id=42454547 re: Zero Water datacenters

- "Show HN: LangSpend – Track LLM costs by feature and customer (OpenAI/Anthropic)" (2025-10) https://news.ycombinator.com/item?id=45771618

westurner•3h ago

To make electronic and photonic TPUs (and CPUs, GPUs, and QPUs) faster, we should make them out of graphene and carbon nanotubes and other allotropes of carbon (instead of photoresisting and doping silicon and copper)

Behind y-s2: serverless multiplayer rooms

Auditing Permissions for All Shared Files in Google Drive

$400M Machine that Prints COMPUTER CHIPS [video]

OWASP Kubernetes Top Survey

Show HN: 24-hour Halloween radio station hosted by Dr. Eleven

Lightweight 2D Framebuffer Library for Linux

These Are All the Same Thing

Long-Term Asset Return Study – The Ultimate Guide to Long-Term Investing

Renovate 42 Is Coming

Show HN: I Built Bookmarks Manager Online

The Future of Routing with the Navigation API – Eduardo San Martin Morote [video]

'It's quite useless to us': What autistic people want

Disney yanks channels from YouTube TV after parties fail to resolve dispute

Chemical additive slashes carbon emissions when creating synthetic fuels

Future of AI

Ask HN: Why I rarely see game dev startup here?

Businesses are running out of pennies in the US

How the Substack feed is learning to understand your reading journey

Geometric Pattern Generator

How China Powers Its Electric Cars and High-Speed Trains

Bluesky hits 40M users, introduces 'dislikes' beta

Security Community Slams MIT-Linked Report Claiming AI Powers 80% of Ransomware

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

We Have a Human Problem

FCC to rescind ruling that said ISPs are required to secure their networks

The new American dream is to get rich *quick

YouTube's AI Moderator Pulls Windows 11 Workaround Videos, Calls Them Dangerous

Show HN: Historian – A simple shell history tool

I'm an IIT Madras Student. But to Some, I'm Diluting the Brand

Ask HN: How much are you spending on your GPU in terms of energy?

Comments

Behind y-s2: serverless multiplayer rooms

Auditing Permissions for All Shared Files in Google Drive

$400M Machine that Prints COMPUTER CHIPS [video]

OWASP Kubernetes Top Survey

Show HN: 24-hour Halloween radio station hosted by Dr. Eleven

Lightweight 2D Framebuffer Library for Linux

These Are All the Same Thing

Long-Term Asset Return Study – The Ultimate Guide to Long-Term Investing

Renovate 42 Is Coming

Show HN: I Built Bookmarks Manager Online

The Future of Routing with the Navigation API – Eduardo San Martin Morote [video]

'It's quite useless to us': What autistic people want

Disney yanks channels from YouTube TV after parties fail to resolve dispute

Chemical additive slashes carbon emissions when creating synthetic fuels

Future of AI

Ask HN: Why I rarely see game dev startup here?

Businesses are running out of pennies in the US

How the Substack feed is learning to understand your reading journey

Geometric Pattern Generator

How China Powers Its Electric Cars and High-Speed Trains

Bluesky hits 40M users, introduces 'dislikes' beta

Security Community Slams MIT-Linked Report Claiming AI Powers 80% of Ransomware

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

Strix Halo's Memory Subsystem: Tackling iGPU Challenges

We Have a Human Problem

FCC to rescind ruling that said ISPs are required to secure their networks

The new American dream is to get rich *quick

YouTube's AI Moderator Pulls Windows 11 Workaround Videos, Calls Them Dangerous

Show HN: Historian – A simple shell history tool

I'm an IIT Madras Student. But to Some, I'm Diluting the Brand