news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The eighth-generation TPU: An architecture deep dive

https://cloud.google.com/blog/products/compute/tpu-8t-and-tpu-8i-technical-deep-dive

46•meetpateltech•1h ago

Comments

zshn25•47m ago

Splitting TPUs into dedicated training vs inference chips feels like an admission that the bottleneck has shifted from FLOPs to memory bandwidth + latency. Are future gains to come more from memory/system design than raw compute scaling? What’s that saying about Scaling laws?

xnx•40m ago

> Splitting TPUs into dedicated training vs inference chips feels like an admission that the bottleneck has shifted from FLOPs to memory bandwidth + latency.

With the expected scale of inference, it makes cost sense to make dedicated hardware for each task if the workloads are even slightly different. Probably similar to the video decoding chips in TVs not being very cheap/efficient compared to chips capable of encoding video.

sdenton4•12m ago

I think the first two paragraphs of the post are exactly saying that the bottleneck is memory... Long contexts, bigger but less flop-intensive models (moe's).

The funny thing about scaling laws is that as soon as they were known, the whole objective became learning how to break them - bending the curve, at least. They provided an incredibly useful target, but 'law' was a bit too strong a word.

ricardo81•11m ago

dupe https://news.ycombinator.com/item?id=47862497

QuantumNomad_•8m ago

They are different blog posts, written by different people at Google

ttul•3m ago

No matter how smart your large language model is, if you can’t find the energy to power it, it won’t run. I could imagine Google winning merely because their chips are more efficient. Of course, the other labs are capable of making chips, but Google has been doing it for years.

How to Open Source and Not Starve

https://hajo.me/blog/2026/04/22/how-to-open-source-and-not-starve/

1•fxtentacle•1m ago•0 comments

The handmade beauty of Machine Age data visualizations

https://resobscura.substack.com/p/the-handmade-beauty-of-machine-age

1•benbreen•2m ago•0 comments

You lose words on the tip of your tongue (2020)

https://www.bbc.com/future/article/20201125-on-the-tip-of-your-tongue-is-it-a-sign-of-a-bad-memory

1•stephen-hill•2m ago•0 comments

Reverse-engineering a supply chain attack delivered via fake Web3 job interview

https://www.reymom.xyz/blog/security/2026-04-15-supply-chain-attack

1•reymon-dev•2m ago•0 comments

Everything I know about floppy disks (2023)

https://thejpster.org.uk/blog/blog-2023-08-28/

1•stephen-hill•3m ago•0 comments

Build It Yourself (2025)

https://lucumr.pocoo.org/2025/1/24/build-it-yourself/

1•stephen-hill•3m ago•0 comments

AI fact-checker with guardrail classifier and MCP server

https://fact-check-analyzer.vercel.app/

1•amahadeven•4m ago•0 comments

How Skopx Learns Your Business While You Work

https://skopx.com/resources/live-platform-business-context

1•skopx•5m ago•0 comments

Open Benchmark: Text Normalization in Commercial Streaming TTS Models

https://async-vocie-ai-text-to-speech-normalization-benchmark.static.hf.space/index.html

1•baghdasaryana•5m ago•0 comments

Push Notifications Can Betray Your Privacy (and What to Do About It)

https://www.eff.org/deeplinks/2026/04/how-push-notifications-can-betray-your-privacy-and-what-do-...

1•u1hcw9nx•7m ago•0 comments

Don't read the PDF, write the parser

https://adriacidre.com/blog/self-healing-parsers-instead-of-vision/

1•kumulo•8m ago•1 comments

Context Bloat in AI Agents

https://glama.ai/blog/2025-12-16-what-is-context-bloat-in-mcp

1•OmShree0709•8m ago•0 comments

Linus Torvalds on AI code review: Anybody who thinks all AI is slop is in denial

https://lore.kernel.org/intel-gfx/CAHk-=wi_drr4Ls9KtXW1k8L2FUDF0YdnyjvKmPgLXHDFnnRWEg@mail.gmail....

4•victordw•8m ago•1 comments

A record-setting 31.4 Tbps attack caps a year of DDoS assaults

https://blog.cloudflare.com/ddos-threat-report-2025-q4/

1•theorchid•8m ago•0 comments

Tim Cook to Be Replaced by Near-Identical,More Expensive CEO with a Nicer Camera

https://unsourcednews.com/tim-cook-to-be-replaced-by-near-identical-more-expensive-ceo-with-a-nic...

2•01-_-•9m ago•0 comments

Show HN: CatchAll – slowest web search API that outperforms everything on recall

https://platform.newscatcherapi.com/catchall/try

4•artembugara•9m ago•1 comments

TurboOCR: CUDA and TensorRT OCR Server at 270 img/s

https://github.com/aiptimizer/TurboOCR

1•pfdomizer•9m ago•0 comments

Show HN: Ohita – a tool to simplify API key management for AI agents

https://ohita.tech/

1•jusasiiv•9m ago•0 comments

Statutory Copyleft

https://www.thomas-huehn.com/statutory-copyleft/

1•Brajeshwar•10m ago•0 comments

Google puts AI agents at heart of its enterprise money-making push

https://www.reuters.com/business/google-puts-ai-agents-heart-its-enterprise-money-making-push-202...

1•tartoran•10m ago•0 comments

Show HN: Sift – a minimal news app (looking for UI/UX feedback)

https://apps.apple.com/us/app/sift-curated-news/id6761124682

1•Roshan_Roy•10m ago•0 comments

DOJ charges SPLC with fraud for paying white supremacist groups $3M

https://nypost.com/2026/04/21/us-news/doj-charges-southern-poverty-law-center-with-fraud-for-payi...

1•anonymousiam•11m ago•0 comments

Show HN: Stonks-CLI – track your investment portfolio from your terminal

https://github.com/igoropaniuk/stonks-cli

1•friedchocolate•13m ago•0 comments

I spent 20 years building an AI agent engine, and what v6 got right

https://labsai.medium.com/why-i-spent-20-years-building-an-ai-agent-engine-and-what-version-6-fin...

1•ginccc•15m ago•0 comments

UK lawmakers approve lifetime smoking ban for today's under-18s

https://www.reuters.com/business/healthcare-pharmaceuticals/uk-lawmakers-approve-lifetime-smoking...

1•tartoran•16m ago•0 comments

Show HN: API Ingest – Agentic Search in API Docs

https://github.com/mohidbt/api-ingest

1•mohidbutt•17m ago•0 comments

Show HN: An MCP server that fact-checks AI bug diagnoses against AST evidence

https://github.com/EruditeCoder108/unravelai

2•EruditeCoder108•20m ago•0 comments

Prinesh Where R U?

1•triple_t•21m ago•0 comments

Inko 0.20.0: reducing heap allocations by 50%

https://inko-lang.org/news/inko-0-20-0-reducing-heap-allocations-by-50/

1•YorickPeterse•22m ago•0 comments

Probing the Planck scale with quantum computation

https://arxiv.org/abs/2604.06322

1•Tyyps•23m ago•0 comments