frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

TinyTinyTPU: 2×2 systolic-array TPU-style matrix-multiply unit deployed on FPGA

https://github.com/Alanma23/tinytinyTPU-co
41•Xenograph•2h ago

Comments

hinkley•1h ago
I think I could trust AI more if we used it to do heuristics for expensive deterministic processes. Sort of a cross between Bloom Filters and speculative execution. Determine the odds the expensive operation 1 will indicate that expensive operation 2 needs to happen, and then start expensive operation 2 while we determine if it’s actually needed. If its right 95% of the time, which is the sort of ranges AI can aspire to, that’s skipping the high latency task chaining 19 times out of 20, which would be pretty good.
rjsw•1h ago
There have been comments that some leading AI researchers were switching away from working on language models to do stuff with "real world data".
hnuser123456•44m ago
There are Bayesian neural networks that could apparently track probability rather than just e.g. randomly selecting one output from the top-k based on probability, but I'm still learning up on them myself. Sounds like they're not normally combined with language models.
aunty_helen•1h ago
I think it’s only a matter of time before we see asic vendors making TPU devices. Same thing happened with BTC. There was enough money there to spawn an industry. Nvidias 70% margins are too hard to ignore. And if playing on the open market seems too rough, there’s always acquisition potential like what happened to groq.
NitpickLawyer•56m ago
Aren't high end accelerators already closer to ASICs than to og GPUs, tho?
fooblaster•1h ago
Great! How do you program it?
ph4evers•22m ago
Such a cool project! Next one is to run jaxprs via the driver?
mrinterweb•11m ago
I've been wondering when we will see general purpose consumer FPGAs, and eventually ASICs, for inference. This reminds me of bitcoin mining. Bitcoin mining started with GPUs. I think I remember a brief FPGA period that transitioned to ASIC. My limited understanding of Google's tensor processing unit chips are that they are effectively a transformer ASIC. That's likely a wild over-simplification of Google's TPU, but Gemini is proof that GPUs are not needed for inference.

I suspect GPU inference will come to an end soon, as it will likely be wildly inefficient by comparison to purpose built transformer chips. All those Nvidia GPU-based servers may become obsolete should transformer ASICs become mainstream. GPU bitcoin mining is just an absolute waste of money (cost of electricity) now. I believe the same will be true for GPU-based inference soon. The hundreds of billions of dollars being invested on GPU-based inference seems like an extremely risky bet that ASIC transformers won't happen, although Google has already widely deployed their own TPUs.

babl-yc•8m ago
This is cool. I'm observing a trend of "build a tiny version from the ground-up to understand it" a la Karpathy's micrograd/minGPT. Seems like one of the best ways to learn.

"Inspector Dangerfuck", ANSI art comic from 1994

https://breakintochat.com/blog/2025/12/31/ansi-art-and-webcomics-part-3-eerie-and-inspector-dange...
1•Kirkman14•55s ago•1 comments

TimescaleDB to ClickHouse replication: Use cases, features, and how we built it

https://clickhouse.com/blog/timescale-to-clickhouse-clickpipe-cdc
1•saisrirampur•1m ago•0 comments

Exposure to Multiple Fine Particulate Matter Components and Incident Depression

https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2843119
1•wjb3•1m ago•1 comments

Qsp: A simple S-Expression parser for Rust TokenStreams

https://github.com/KnorrFG/qsp
1•PaulHoule•5m ago•0 comments

Grok says safeguard lapses led to images of 'minors in minimal clothing' on X

https://www.reuters.com/legal/litigation/grok-says-safeguard-lapses-led-images-minors-minimal-clo...
1•erhuve•6m ago•0 comments

Replace Your Standup with a Todo List

https://www.skeptrune.com/posts/todolist-standup/
1•skeptrune•8m ago•0 comments

Malleable Systems Collective – Collective digest – 2025

https://malleable.systems/blog/2025/12/27/collective-digest-2025/
1•gjvc•8m ago•0 comments

Nearly Half of Americans Read Zero Books in 2025

https://dailycitizen.focusonthefamily.com/nearly-half-of-americans-read-zero-books-in-2025/
1•bookofjoe•9m ago•1 comments

Fixing a Buffer Overflow in Unix v4 Like It's 1973

https://sigma-star.at/blog/2025/12/unix-v4-buffer-overflow/
3•gdgghhhhh•10m ago•0 comments

Oldest known cremation in Africa – mystery about Stone Age hunter-gatherers

https://theconversation.com/oldest-known-cremation-in-africa-poses-9-500-year-old-mystery-about-s...
1•olvy0•11m ago•0 comments

Fujiwhara Effect

https://en.wikipedia.org/wiki/Fujiwhara_effect
1•wjb3•12m ago•0 comments

A different way to think about Python API Clients

https://paulwrites.software/articles/python-api-clients
1•paulhallett•15m ago•0 comments

2025 Starlink Progress Report

https://starlink.com/progress
1•0xedb•20m ago•0 comments

Weaponized (teeny tiny) black holes

https://joshchamot.substack.com/p/weaponized-teeny-tiny-black-holes
4•petethomas•23m ago•1 comments

Selling theoretical frameworks to enterprises €50K-€300K licensing)

1•Boiindil•23m ago•1 comments

Ask HN: Seeking 3rd‑Party Permission (Smoke Tests) – Legal and Ethical Guidance

1•ohitsujiza•24m ago•0 comments

PowRSS: Discover the Indie Web

https://powrss.com/
2•subdavis•26m ago•0 comments

Show HN: A free, no-signup invoice generator for one-off invoices

https://the-invoice.app/
1•block_hacks•26m ago•1 comments

Ex-Samsung engineer accused of giving 10nm DRAM process data to China's CXMT

https://www.tomshardware.com/pc-components/dram/samsung-engineer-accused-of-leaking-10nm-dram-pro...
4•walterbell•31m ago•0 comments

Show HN: I built a Netflix-style link-in-bio because link lists felt dead

https://www.linklynx.bio/
1•rafaelvalle03•33m ago•0 comments

Chinese memory maker CXMT prepares $4.2B USD IPO as DRAM demand skyrockets

https://www.tomshardware.com/pc-components/dram/chinese-memory-maker-cxmt-prepares-to-file-for-ip...
7•walterbell•35m ago•0 comments

Alaska Wolf Found with Record Amount of Mercury, a Sign of Growing Contamination

https://e360.yale.edu/digest/alaska-mercury-wildlife
3•speckx•37m ago•0 comments

Building AI agents with just bash and a filesystem in TypeScript

https://turso.tech/blog/agentfs-just-bash
1•penberg•38m ago•0 comments

Show HN: Local-first computational notebook for everyday life

https://inkblots.app
1•paulrusso•39m ago•0 comments

2026 Predictions for Art, Science, and Tech from my 2 yo podcast

https://www.youtube.com/watch?v=uvXuWshY0ps
1•andrewjneumann•40m ago•2 comments

Show HN: Sk` – manage AI agent skills across Claude, codex, opencode, et all

https://github.com/803/skills-supply
1•alizainf•40m ago•0 comments

Star Wars Racer Revenge game is key to jailbreaking PlayStation 5

https://www.tomshardware.com/video-games/playstation/forgotten-star-wars-racer-revenge-game-is-ke...
2•canucker2016•42m ago•0 comments

Show HN: ExpiryGuard – track expiring certs and API keys

https://github.com/sanjayselvaraj/expiryguard
1•sanjayselvaraj•42m ago•0 comments

Reasons to Love the Field of Programming Languages

https://danilafe.com/blog/i_love_programming_languages/
2•birdculture•44m ago•0 comments

MSI teases new PSU with 'instant protection' against melting RTX 5090 cables

https://www.tomshardware.com/pc-components/power-supplies/msi-teases-new-power-supplies-with-inst...
2•canucker2016•46m ago•1 comments