frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Dav2d

https://jbkempf.com/blog/2026/dav2d/
237•captain_bender•3h ago•66 comments

Cloudflare Turnstile requiring fingerprintable WebGL

https://hacktivis.me/articles/cloudflare-turnstile-webgl-fingerprinting
51•HypnoticOcelot•48m ago•15 comments

The Website Specification

https://specification.website/
311•k1m•7h ago•130 comments

London's Free Roof Terraces

https://diamondgeezer.blogspot.com/2026/05/londons-free-roof-terraces.html
174•zeristor•7h ago•82 comments

I Put a Datacenter GPU in My Gaming PC for £200

https://blog.tymscar.com/posts/v100localllm/
66•birdculture•1h ago•34 comments

Domain expertise has always been the real moat

https://www.brethorsting.com/blog/2026/05/domain-expertise-has-always-been-the-real-moat/
733•aaronbrethorst•18h ago•431 comments

Security Envelope Pattern collection – S.E.C.R.E.T

https://secret-archive.org/
48•ColinWright•2d ago•5 comments

The people who actually want AI to replace humanity

https://www.vox.com/future-perfect/489976/ai-successionism-transhumanism-posthumanism
27•plastic-enjoyer•41m ago•16 comments

Shantell Sans (2023)

https://shantellsans.com/process
328•aleda145•16h ago•40 comments

One year of Roto, a compiled scripting language for Rust

https://blog.nlnetlabs.nl/one-year-of-roto-the-compiled-scripting-language-for-rust/
79•Hasnep•2d ago•17 comments

What it's like to have your insulin pump die while you're on vacation

https://blog.lauramichet.com/what-its-like-to-have-the-machine-that-keeps-you-alive-die-while-you...
39•speckx•2d ago•36 comments

The AV2 Video Standard Has Released (Final v1.0 Specification)

https://av2.aomedia.org
279•ksec•17h ago•123 comments

A Gentle Introduction to Lattice-Based Cryptography [pdf]

https://cryptography101.ca/wp-content/uploads/lattice-based-cryptography.pdf
124•jayhoon•2d ago•8 comments

I found a seashell in the middle of the desert

https://github.com/Hawzen/I-found-a-seashell-in-the-middle-of-the-desert#i-found-a-seashell-in-th...
369•Hawzen•2d ago•99 comments

Avian Visitors

https://theodore.net/projects/AvianVisitors/
83•fdb•8h ago•7 comments

Show HN: Breathe CLI – Paced resonance breathing in the macOS terminal

https://github.com/marekkowalczyk/breathe-cli
76•marekkowalczyk•18h ago•10 comments

Inkstravaganza

https://www.inkandswitch.com/newsletter/dispatch-015/
10•surprisetalk•3d ago•2 comments

You Weren't Meant to Have a Boss (2008)

https://paulgraham.com/boss.html
22•downbad_•2h ago•1 comments

Show HN: Atomic Editor – Obsidian-style live preview for CodeMirror 6

https://kenforthewin.github.io/atomic-editor/
5•kenforthewin•2h ago•0 comments

Telli (YC F24) is hiring in engineering, design, and GTM [Berlin, on-site]

https://hi.telli.com/join-us
1•sebselassie•8h ago

A pictorial introduction to differential geometry (2017)

https://arxiv.org/abs/1709.08492
84•ricudis•9h ago•2 comments

Racket v9.2

https://blog.racket-lang.org/2026/05/racket-v9-2.html
204•spdegabrielle•3d ago•19 comments

Associative learning turns DEET from aversive to appetitive in Aedes aegypti

https://journals.biologists.com/jeb/article/229/10/jeb251935/371741/Associative-learning-switches...
54•croes•2d ago•24 comments

Accenture to acquire Ookla

https://newsroom.accenture.com/news/2026/accenture-to-acquire-ookla-to-strengthen-network-intelli...
308•Garbage•22h ago•155 comments

Mechanical Pencil: An illustrated celebration of the engineering around us

https://mechanical-pencil.com/
136•Muhammad523•14h ago•17 comments

Microsoft Office 2019 and 2021 for Mac view-only conversion

https://consumerrights.wiki/w/Microsoft_Office_2019_and_2021_for_Mac_view-only_conversion_(2026)
937•antipurist•15h ago•331 comments

Openrsync: An implementation of rsync, by the OpenBSD team

https://github.com/kristapsdz/openrsync
447•sph•1d ago•167 comments

Voxel Space (2017)

https://s-macke.github.io/VoxelSpace/
294•davikr•1d ago•63 comments

Zig ELF Linker Improvements Devlog

https://ziglang.org/devlog/2026/#2026-05-30
219•kristoff_it•21h ago•88 comments

Pandoc Templates

https://pandoc-templates.org/
423•ankitg12•1d ago•55 comments
Open in hackernews

I Put a Datacenter GPU in My Gaming PC for £200

https://blog.tymscar.com/posts/v100localllm/
66•birdculture•1h ago

Comments

lucamark•39m ago
Congrats! Most people won’t want to debug drivers, kernels, ACPI, adapters, and fan headers. But for those who do, the capability-per-pound is absurd.
lelanthran•38m ago
> The compute is still real. The VRAM is still real. And the memory bandwidth is where it gets genuinely surprising.

Because humans write exactly like this /s

postalrat•32m ago
Where do you think llms learned to write that way?
alehlopeh•18m ago
Marketing content.
jlund-molfese•15m ago
You can also look at past posts by the same author (before LLM usage proliferated) if you’re curious.

The project is still very cool, but it’s a little less enjoyable to read when everything sounds the same. It would be just as annoying for people to manually write in a corporate/marketing style, because humanity is what makes the small web interesting.

https://blog.tymscar.com/posts/privategithubcicd/

lelanthran•14m ago
> Where do you think llms learned to write that way?

Not from individual human content, that's for sure - maybe MLM marketing copy? Sleazy 4AM ads?

I mean, every time this response comes up, I keep asking the person to point at something written prior to 2022 that gets 80%+ on the LLM detectors, and yet no one can find anything.

Maybe you, postalrat, can find something written in this style that was published prior to 2022.

hattmall•2m ago
It's a function of the LLM "thought process"! It's not really modeled after human speech. It is in short segments but not long form, same reason you see the same rather odd nuances in LLM generated code.

If they way you thought was to run a bunch of if statements, generate content, then feed that content back to get a "score" of what seems the most plausible, run the if statements again, and adjust / merge responses, then you would write similarly. The recognizable cadence of LLM generated content is pretty clearly the result of a lot of if statements being fused together.

tgv•8m ago
Because their custom training data contains an emphasis on such verbiage. It doesn't come from the God-knows-how-many TB of web content the model is pre-trained on. There, such phrasing is only a drop in the sea. But the "yes, you're right" phrases, the em dash, etc., come from the later stage, for which content is created according to some (probably overprecise) guidelines.
bossyTeacher•23m ago
X is Y. Z is Y. And Alpha is genuinely Beta.

Classic LLM writing style.

driverdan•16m ago
There's interesting stuff in this writeup but it sure seems like most of it was written by an LLM.
matja•35m ago
The AMD MI250X GPUs are also interesting - 128GB of HBM2E at 3TB/s, sometimes you see them second-hand for under $1k, the catch obviously is that it needs an OAM socket. Never seen an easy way to hook them up to a regular mainboard.
Teknomadix•19m ago
These are interesting, and offer beefy through put. No point in adapting to a PCI lane thought, stuck behind the slot-bus bottleneck.
knollimar•34m ago
A little bit of local copium but neat read.

Isn't a rasbpi with 16gb of RAM $300 now?

matja•23m ago
The latest Raspberry Pi 5 has one 32-bit channel (2x 16-bit subchannels) of LPDDR4X-4267 SDRAM giving 17.1GB/s of bandwidth, 52x less than this GPU. Never mind lacking the CUDA and Tensor cores, so the FP16 performance is 102x less (307 GFLOPS vs 31.4 TFLOPS). So for £200, there's absolutely no comparison for this specific use-case.
thejj100100•20m ago
I don't understand what point you're trying to make here? Are you talking about the price of RAM?
jmyeet•34m ago
Some context:

- In 2017, the v100 was a ~$10,000 GPU. I believe there was a PCI-e version but this is probably so cheap because SXM2 is going to be harder to use;

- A 5090 has 1800GB/s of internal memory bandwidth (compared to 900GB/s in the 9 year old GPU). Of course a 5090 is substantially more expensive;

- A 5090 has ~21k CUDA cores vs ~5k;

- The current $10k NVidia GPU is the RTX 6000 Pro w/ 96GB of VRAM. It has slightly more CUDA cores but it otherwise pretty much just a 5090. This is unsurprising. NVidia uses VRAM for market segmentation.

Consider this: in 5-10 years, the trillions spent on AI data centers will likewise be sold for scrap most likely. That's how short the runway is for OpenAI and Anthropic to recover that investment.

Anyway, I'm kind of impressed the author managed to get this all to work. I don't think it even would've occurred to me that someone had made an SXM2 adapter, particularly because it's not even used anymore. Like props to whoever did that.

b112•25m ago
I bet 3 years, but otherwise agree.
echelon•18m ago
> Consider this: in 5-10 years, the trillions spent on AI data centers will likewise be sold for scrap most likely. That's how short the runway is for OpenAI and Anthropic to recover that investment.

Even more interesting: it'll devalue all of SaaS and the entire US tech sector.

We might have just shot our most valuable non-AI tech products in the foot.

mondainx•33m ago
Great write-up, I've often considered these DC cards for a project and now you've convinced me to pick one up; you describe the price of the unit against what one spends on tokens and that does it for me.
casey2•31m ago
Some resell group is going to have to make this easier. The shear amount of these cards otherwise heading towards the landfill is staggering. That is if Big Tech don't destroy them to prevent model weights from leaking.
eric__cartman•26m ago
How would destroying the GPUs prevent the model weights from leaking? By the time you get your hands on them the memory is powered off for a long enough time that a cold-boot style attack is impossible.
Alifatisk•18m ago
> The shear amount of these cards otherwise heading towards the landfill is staggering.

The thought of throwing away working cards sounds so bizarre to me. I can't believe companies would dispose them into the landfill like that, it is at least worth giving away for refuse.

wookmaster•3m ago
There’s a long history of corporations doing evil things to ensure their business model succeeds
Teknomadix•22m ago
Tesla V100 SXM2 16GB is NOT DGX class as the author writes. It's HGX class. The V100 comes in two classes, SXM2 and SXM4, the latter coming with a Max of 80gb on board memory. Typically these are installed 8×A100 80GB SXM4 on an HGX riser, and what that gives you is NVSwitch fabric and 640GB of pooled HBM2e (on package stacked memory /w ~2 TB/s of memory bandwidth). 2u standard rack footprint too.
omarqureshi•21m ago
Could probably avoid the crazy fan with a waterblock - I've seen a whole kit, v100 + PCIE adapter + block for £235. Yes, you'll have to pay for pump, radiators and radiator fans, but that should really quieten it down
recursivegirth•19m ago
> The compute is still real. The VRAM is still real. And the memory bandwidth is where it gets genuinely surprising.

Had to stop there. Annoying. I can't stand AI use for writing. It makes any otherwise great article feel so disingenuous.

m0rde•17m ago
What a difficult world you must live in these days
peddling-brink•13m ago
While I don’t disagree with their sentiment, I’m far more annoyed with it than the AI writing.
m0rde•8m ago
Yeah. I get that many HN comments are just complaints (heck mine was too and just as negative and shaming). But how bad of a day must you be having to try to shame someone about how they choose to write up an experience they thought was neat. Whatever, free speech and all that. Hope OC's day gets better.
qingcharles•7m ago
Every single HN post has the same comment now.
rafram•5m ago
Only because so many of the articles posted on HN now are AI-written, and badly, too. A lot of tech people are so impressed with LLMs’ capabilities in code that they fail to recognize how bad they are at writing enjoyable prose. And it feels like a chore to write out a whole blog post by hand when the machine could do it for you! But the result we get is so, so much worse and more annoying.
mickeyp•17m ago
Impressive work. But the problem is not the 30 tok/s which is fine for agentic coding and chat.

It's prefill; slow prefill kills agentic workloads dead.

If you have 100,000 tokens at ~150tok/s per the OP, you're looking at:

    You have: 100000 / (150/s)

    You want: hms

     11 min + 6.6666667 sec
Which is quite a wait indeed.
Aurornis•8m ago
Most people won’t be dumping 100K tokens into it at once, but I agree that all of the prefill time that adds up during a session becomes a lot to account for.

This is also a problem for all of the Mac local LLMs. Macs are a great way to get a lot of high bandwidth memory, but their compute is very far behind current gen dedicated GPUs. Some of the expensive Mac Studio setups allow you to run very large models with usable tokens/s, but you can be waiting a long time for it to get to the point of generating those tokens.

whoamii•3m ago
The real question: did your local LLM write this post?