frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

GPT-OSS 120B Runs at 3000 tokens/sec on Cerebras

https://www.cerebras.ai/blog/openai-gpt-oss-120b-runs-fastest-on-cerebras
20•samspenc•9h ago

Comments

freak42•2h ago
I absolutely hate it, when a website says "try this" and after you went through the trouble of weiting something comes up with a sign up link first. Makes me leave instantly to never come back.
schappim•1h ago
I was doing a demo to my colleagues and had the above.
Alifatisk•1h ago
Same with groq.com, there is a "try this", and after you enter the prompt, it asks you to sign in. Closed the page.
traceroute66•1h ago
Headline at the top of the Cerebras page linked to by the OP "Cerebras Raises $1.1B Series G at $8.1B Valuation".

If you're going after the AI money gravy train then you need to wave the "we have $n registered users" carrot on your PPT slides for the investors because registered user == monetization opportunity.

I'm not defending it. I hate being forced to register for shit when I just want to try it or use the free tier.

But it is what it is.

Saline9515•1h ago
Well if they give it out for free (aka they pay for it), asking you to register is a reasonable ask. It's not a public service funded by taxpayers.
freak42•1h ago
Yes they can ask, but do it at the beginning not the end of the process, this is a dark pattern and fucking annoying.
magackame•30m ago
Anyone remember those online psychological tests where you spend an hour on one and in the end you need to pay up to get the result?)))
traceroute66•15m ago
> do it at the beginning not the end

Exactly this.

If you present me with a form and a submit button then I expect the input to go through and a result to be presented.

If you don't want to present me with results before login, then put the form behind the wall too.

Simple.

traceroute66•17m ago
> Well if they give it out for free (aka they pay for it), asking you to register is a reasonable ask

They have other options... rate limiting, serving (more) quantized to non-registered etc. etc.

anonym29•1h ago
This is like declaring that a Ferrari dealership offering you a free test drive in a million dollar art exhibit on wheels is evil for asking for your phone number before handing you the keys.

If this was some beat-to-hell, high-mileage used economy car, sure, that would be a pain in the ass, and not worth it. But it's a mistake to place Cerebras into that mental bucket.

You don't even need to use real information to create an account. Just grab a temp-mail disposable address and sign up as fred flintstone or mickey mouse.

If you're a heavy LLM inference user (i.e. if you've ever paid for a $200/mo sub from any of the big AI labs), I can damn near guarantee you will not regret trying out Cerebras.

freak42•20m ago
You didn't get my point at all.
rpdillon•6m ago
Would your expectations be more aligned if it's said "free trial"? That might create an expectation of a sign up where "try this" might not.
moralestapia•22m ago
Off topic but related.

A week ago I went to a launch party for a product that's supposed to "revolutionize design" (a web app w/ an OAI prompt).

No demo, only like two pictures of the actual product. Founder spent like half an hour giving a speech about the future, etc...

"All of you here will get access to it in a couple weeks."

Couple weeks go by ... I "get access". It's a .dmg, 1) What, I open it, it's not even an app, it's an installer ..., I install it, the app opens up and it's a giant red button that takes you to a website to create an account ...

These guys are completely lost.

petesergeant•1h ago
It’s an absolute beast. I run it via OpenRouter, where I have Groq and Cerebras as the providers. Cheap enough as to be almost free, strong performance, and lightning fast.
jsheard•1h ago
Cheap enough for now, but of all the companies selling inference at a loss, Cerebras and Groq are probably losing the most per token. Their hardware is ungodly expensive and its reliance on huge amounts of SRAM bottlenecks how much cheaper it can get, since SRAM density is improving at a snails pace at this point.
petesergeant•34m ago
Not doubting you but anything to back that up? Happy enough to burn VC money until someone shows up who can run it without losing money, either way.
rajman187•17m ago
They’ve filed a S1 [1] last year when attempting to go public. It showed something like a $60M+ loss for the first 6 months of 2024. The IPO didn’t happen because the CEO’s past included some financial missteps and the banks didn’t want to deal with this. At the time the majority of their revenue came from a single source in Abu Dhabi, as well

[1] https://www.sec.gov/Archives/edgar/data/2021728/000162828024...

rpdillon•3m ago
You're pointing out a bunch of high capex costs (hardware, SRAM), but then concluding that their opEx is greater than their revenue on a per unit basis. Are they really losing money on every token? It seems that using hardware acceleration would decrease inference costs and they could make it up on unit economics over time.

But I'm just reasoning from first principles. I don't have any specific data about them.

KronisLV•1h ago
The Cerebras GML-4.6 post might also be of (some?/more?) interest to the people here, since it's more useful for programming: https://news.ycombinator.com/item?id=45852751

I don't think that this is a dupe or anything and 3000 t/s is really cool, the other post just has more discussion of Cerebras and people's experiences with using GLM 4.6 for software development.

sunpazed•20m ago
This is really impressive. At these speeds, it’s possible to run agents with multi-tool turns within seconds. Consider it a feature rich, “non-deterministic API” for your platform or business.

Kagi Small Web

https://kagi.com/smallweb
28•stranges•48m ago•3 comments

My friends and I accidentally faked the Ryzen 7 9700X3D leaks

https://old.reddit.com/r/pcmasterrace/comments/1orc6jl/my_friends_and_i_accidentally_faked_the_ry...
86•djrockstar1•1h ago•5 comments

Why is Zig so cool?

https://nilostolte.github.io/tech/articles/ZigCool.html
374•vitalnodo•14h ago•280 comments

Valdi – A cross-platform UI framework that delivers native performance

https://github.com/Snapchat/Valdi
328•yehiaabdelm•12h ago•124 comments

Making Democracy Work: Fixing and Simplifying Egalitarian Paxos

https://arxiv.org/abs/2511.02743
60•otrack•5h ago•13 comments

Friendly attributes pattern in Ruby

https://brunosutic.com/blog/ruby-friendly-attributes-pattern
38•brunosutic•5d ago•19 comments

Becoming a compiler engineer

https://rona.substack.com/p/becoming-a-compiler-engineer
246•lalitkale•15h ago•118 comments

Cekura (YC F24) Is Hiring

1•atarus•1h ago

Reverse engineering a neural network's clever solution to binary addition (2023)

https://cprimozic.net/blog/reverse-engineering-a-small-neural-network/
15•Ameo•4d ago•1 comments

Myna: Monospace typeface designed for symbol-heavy programming languages

https://github.com/sayyadirfanali/Myna
292•birdculture•18h ago•138 comments

Immutable Software Deploys Using ZFS Jails on FreeBSD

https://conradresearch.com/articles/immutable-software-deploy-zfs-jails
101•vermaden•12h ago•32 comments

The Initial Ideal Customer Profile Worksheet

https://www.reifyworks.com/writing/2023-01-30-iicp
58•mrbbk•4d ago•4 comments

How did I get here?

https://how-did-i-get-here.net/
240•zachlatta•17h ago•45 comments

Why I love OCaml (2023)

https://mccd.space/posts/ocaml-the-worlds-best/
340•art-w•22h ago•246 comments

Ruby already solved my problem

https://newsletter.masilotti.com/p/ruby-already-solved-my-problem
230•joemasilotti•18h ago•96 comments

Local First Htmx

https://elijahm.com/posts/local_first_htmx/
61•srid•10h ago•30 comments

Mullvad: Shutting down our search proxy Leta

https://mullvad.net/en/blog/shutting-down-our-search-proxy-leta
125•holysoles•12h ago•81 comments

Running a 68060 CPU in Quadra 650

https://github.com/ZigZagJoe/Macintosh-Q650-68060
53•zdw•11h ago•22 comments

YouTube Removes Windows 11 Bypass Tutorials, Claims 'Risk of Physical Harm'

https://news.itsfoss.com/youtube-removes-windows-11-bypass-tutorials/
683•WaitWaitWha•16h ago•261 comments

Cerebras Code now supports GLM 4.6 at 1000 tokens/sec

https://www.cerebras.ai/code
103•nathabonfim59•13h ago•70 comments

Apple's "notarisation" – blocking software freedom of developers and users

https://fsfe.org/news/2025/news-20251105-01.en.html
143•DavideNL•7h ago•92 comments

A rats to riches story: Larry the Downing Street cat finds place in TV spotlight

https://www.theguardian.com/politics/2025/nov/08/a-rats-to-riches-story-larry-the-downing-street-...
21•zeristor•3h ago•5 comments

Venn Diagram for 7 Sets

https://moebio.com/research/sevensets/
139•bramadityaw•4d ago•36 comments

Dark mode by local sunlight (2021)

https://www.ctnicholas.dev/articles/dark-mode-by-sunlight
5•gaws•4d ago•3 comments

Angel Investors, a Field Guide

https://www.jeanyang.com/posts/angel-investors-a-field-guide/
151•azhenley•20h ago•38 comments

Can you save on LLM tokens using images instead of text?

https://pagewatch.ai/blog/post/llm-text-as-image-tokens/
30•lpellis•6d ago•10 comments

FSF40 Hackathon

https://www.fsf.org/events/fsf40-hackathon
92•salutis•5d ago•2 comments

Copy button added to Stack Overflow

https://meta.stackexchange.com/questions/414573/results-of-the-october-2025-community-asks-sprint...
12•exploraz•1h ago•12 comments

Ribir: Non-intrusive GUI framework for Rust/WASM

https://github.com/RibirX/Ribir
67•adamnemecek•16h ago•13 comments

VLC's Jean-Baptiste Kempf Receives the European SFS Award 2025

https://fsfe.org/news/2025/news-20251107-01.en.html
374•kirschner•16h ago•73 comments