frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

CUDA-like programming of Cerebras WSE

https://github.com/greg1232/cerebras-py-sim
1•gdiamos•2h ago

Comments

gdiamos•2h ago
This weekend I was reading this paper on programming the Cerebras wafer scale engine, https://arxiv.org/html/2405.07898v1 . Data movement is the expensive part of computing, and some algorithms like stencils only require nearest neighbor data movement per cycle. Cerebras wafers have very low energy transfer between neighboring processing elements on the same wafer, so they come up with a language called Tungsten that focuses on this exchange primitive in the kernel programming model.

I thought the challenge of programming 100,000s of cores using a mesh would be interesting so I wrote a simulator, simple compiler, and a few simple kernels for the wafer scale engine using publicly available documents.

I'm used to CUDA. So I asked: "How would you map something like CUDA onto a machine like this?" Well I use something like malloc to allocate global memory, memcpy to move between host and device memory, and a queue of launch thread block launches, but this time, thread blocks can communicate using nearest neighbor send/recv instructions within the same block instead of through shared memory on a streaming multiprocessor. This is inspired by the stencils in Tungsten.

The whole program is made up of a bulk synchronous kernel of many thread blocks.

I think it is interesting because CUDA has some hard limits on thread block sizes, but this mesh perspective lets you grow or shrink the blocks significantly.

Note that some information about cerebras wafer engines like the ISA is not public (as far as I know). In this code, I just guessed what it could be.

So this should not be taken as a faithful or accurate simulation of the wafer scale engine. More like a point on the design space that is similar in that it includes a wafer sized mesh of processing elements.

When Tailwinds Vanish

https://foundersfund.com/2023/06/when-tailwinds-vanish/
1•ronfriedhaber•59s ago•0 comments

Returning to Zig

https://gracefulliberty.com/articles/return-to-zig/
1•kristoff_it•1m ago•0 comments

Why Reddit blocked unauthenticated JSON in 2026

https://medium.com/@tonywangcn/why-reddit-blocked-unauthenticated-json-in-2026-and-how-to-still-g...
2•tonywangcn•3m ago•0 comments

You probably don't need private PKI for internal infrastructure

https://www.certkit.io/blog/private-pki-internal-infrastructure
1•plopilop•4m ago•0 comments

Mid-tier US knives vs. cheap imports

https://www.paragon-knives.com/
1•bgzlsxaz•5m ago•0 comments

T1A Brings Its Full Data Stack to Dais 2026: 5 Products AndSubscription Giveaway

https://lakesentry.io/
1•tsyliya•9m ago•0 comments

Show HN: Fastembed-rs – Rust library for generating vector embeddings, reranking

https://github.com/Anush008/fastembed-rs
1•thoughtfullyso•13m ago•0 comments

Browser game about movie guessing

https://frameguesser.vercel.app/
1•mmschreiber•19m ago•0 comments

Nike Launches Sneaker Line with Russian Designer Who Backed Crimea Annexation

https://united24media.com/world/nike-launches-sneaker-line-with-russian-designer-who-backed-crime...
5•fodmap•20m ago•0 comments

Graphtatui: In terminal graph explorer made with ratatui

https://github.com/Sok205/graphtatui
2•sok205•22m ago•0 comments

Yield Curves and Volatility Surfaces Are Built in Modern Finance

https://medium.com/@DolphinDB_Inc/the-hidden-foundation-of-pricing-and-risk-how-ficc-curves-and-s...
2•Polly_Liu•23m ago•0 comments

CPMpy: Constraint Programming and Modeling library in Python, based on NumPy

https://github.com/CPMpy/cpmpy
1•tosh•24m ago•0 comments

Why Digital Twins Need Low-Latency Data Processing

https://medium.com/@DolphinDB_Inc/real-time-decision-making-how-ai-and-low-latency-computing-are-...
2•CrazyTomato•25m ago•0 comments

Githipedia – The Wiki for GitHub

https://github.com/Vendetaaaa/Githipedia
1•Vendeta•25m ago•1 comments

Show HN: I wrote a C++ ray tracer from scratch without AI

https://github.com/themartiano/luz
3•martiano•26m ago•1 comments

Ask HN: How do you handle browser tab overload?

1•formit34•26m ago•1 comments

Coversubstack-Zagreus=Whiterabbit.flexe

https://substack.com/@rootedinthought/note/c-276218913
1•dcmexpunksolar•30m ago•0 comments

LibAgar – Cross-platform GUI written in C

https://libagar.org/
2•0x0203•32m ago•0 comments

What are you looking for when reviewing LLM generated code?

2•gnunicorn•33m ago•1 comments

Ask HN: Developers, are you being forced into prompt-only engineering?

3•zerr•34m ago•0 comments

Fear about young adults' maturity is just a way of trying to control them (2023)

https://www.newstatesman.com/comment/2023/06/fear-about-young-adults-maturity-is-just-a-way-of-tr...
1•frereubu•34m ago•1 comments

Show HN: ELDC – Natural language identification, faster than FastText and CLD2

https://github.com/nitotm/eldc
2•nitotm•36m ago•0 comments

Anthropic flies staff to D.C. to clean up White House fight

https://www.axios.com/2026/06/14/anthropic-white-house-mythos-fable
5•dstala•37m ago•1 comments

Trailblazing investigative reporter Roger Cook dies

https://news.sky.com/story/trailblazing-investigative-reporter-roger-cook-dies-13554262
1•austinallegro•39m ago•0 comments

Searching for Guy Debord (2003)

https://brooklynrail.org/2003/10/express/searching-for-guy-debord/
1•robtherobber•40m ago•0 comments

Ask HN: Did you try Claude's "Fable 5" model before it was pulled?

3•aniokono•40m ago•1 comments

How to Think about Parallel Programming: Not (2010)

https://github.com/matthiasn/talk-transcripts/blob/master/Steele_Guy/ParallelProg.md
1•tosh•40m ago•0 comments

How Contaminants in drinking water are regulated by the EPA and states

https://www.ewg.org/news-insights/news/2025/03/how-contaminants-drinking-water-are-regulated-epa-...
2•num42•41m ago•0 comments

Complexity Models

https://en.algorithmica.org/hpc/complexity/
2•tosh•42m ago•0 comments

Tell HN: Claude is completely unusable for biology

4•Protostome•42m ago•0 comments