frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

AbuseIPDB

https://www.abuseipdb.com/
2•palmfacehn•5m ago•0 comments

Project Ire autonomously identifies malware at scale

https://www.microsoft.com/en-us/research/blog/project-ire-autonomously-identifies-malware-at-scale/
1•PaulHoule•6m ago•0 comments

Amtrak's New Acela Trains Are Here. They're Moving Slower Than the Old Ones

https://www.wsj.com/us-news/amtraks-new-acela-trains-are-here-theyre-moving-slower-than-the-old-o...
2•JumpCrisscross•6m ago•0 comments

Essential Coding Theory [pdf]

https://cse.buffalo.edu/faculty/atri/courses/coding-theory/book/web-coding-book.pdf
2•ibobev•8m ago•0 comments

PgDog adds support for Rust plugins

https://pgdog.dev/blog/plugins-are-back
1•levkk•8m ago•0 comments

America Educates the Best and Brightest–Then Shows Them the Door

https://reason.com/2025/08/28/educating-the-worlds-best-and-brightest-then-showing-them-the-door/
2•pseudolus•9m ago•0 comments

FTC chair accuses Google of treating GOP's emails as spam

https://www.theregister.com/2025/08/29/gmail_republican_email_spam/
3•terminalbraid•9m ago•0 comments

A Brief, Incomplete and Mostly Subjective History of Chinese Internet Censorship

https://danglingpointer.fun/posts/GFWHistory
1•arrowsmith•9m ago•0 comments

Mainstream Websites that Provide Onion Services

https://github.com/alecmuffett/real-world-onion-sites
1•keepamovin•10m ago•0 comments

Expert Analysis and 2030 Price Forecast for GSAT Stock

https://dashboard-finance.com/stock/gsat/prediction
1•tchantchov•11m ago•1 comments

Show HN: WASM Quest, an open source game by Tortured Metaphor

https://github.com/Tortured-Metaphor/WASM-Quest
2•DavidCanHelp•13m ago•0 comments

Austrian regulator sides with noyb in data access case against YouTube

https://www.neowin.net/news/austrian-regulator-sides-with-noyb-in-data-access-case-against-youtube/
1•bundie•13m ago•0 comments

Compiling SvelteKit to an Executable

https://github.com/Hugo-Dz/exe
1•HugoDz•14m ago•0 comments

Illusion of Explanatory Depth

https://en.wikipedia.org/wiki/Illusion_of_explanatory_depth
1•teleforce•14m ago•0 comments

Why n8n gives AI features away for free

https://getlago.substack.com/p/why-n8n-gives-away-free-ai
2•FinnLobsien•15m ago•0 comments

Skynet: Control robots and drones with LLMs via MCP using Bash

https://github.com/hybridgroup/skynet
2•deadprogram•16m ago•0 comments

An Analog Solution for Mindful Living

https://www.theatlantic.com/newsletters/archive/2025/08/linda-gregg-mindful-poetry/684036/
1•FinnLobsien•16m ago•0 comments

Effective short intervals containing primes

https://arxiv.org/abs/2508.18786
1•bikenaga•17m ago•0 comments

Accusing Someone of "Support[Ing] Neo-Nazi Causes" May Be Libelous

https://reason.com/volokh/2025/08/28/accusing-someone-of-supporting-neo-nazi-causes-may-be-a-fact...
2•pcaharrier•18m ago•0 comments

Doubling CO2 to 840 ppm will increase the food supply by 40%

https://co2coalition.org/publications/lindzen-happer-statement-to-national-academies-of-sciences-...
2•bilsbie•18m ago•2 comments

Jam – Zero Hallucination Big Data Storage Engine

https://cithorum.ca/
1•cithorum•19m ago•2 comments

SQLite Is Edge Scale

https://www.fermyon.com/blog/sqlite-is-edge-scale
2•juanviera23•20m ago•0 comments

This class is primarily for Python support (hence the "Retarded" prefix).

https://github.com/xbmc/xbmc/blob/6d26d3ace1537e23249386eaeddbc6f04c251cb0/xbmc/interfaces/legacy...
1•lr0•21m ago•0 comments

Love Is Freedom

https://stephango.com/love
1•Brajeshwar•23m ago•0 comments

Show HN: Ec2instances.info alerts for AWS pricing changes

1•StratusBen•25m ago•0 comments

Why auroras are so much brighter and more easily visible recently

https://www.newscientist.com/article/2479260-why-auroras-are-so-much-brighter-and-more-easily-vis...
1•Brajeshwar•26m ago•0 comments

Show HN: Manipulate NumPy arrays in Python using Uiua

https://github.com/bergkvist/uiuapy
1•bergkvist•26m ago•0 comments

AI Is a Hype-Fueled Dumpster Fire [YouTube]

https://www.youtube.com/watch?v=0bF_AQvHs1M
2•OhMeadhbh•26m ago•1 comments

Engineers send quantum signals with standard Internet Protocol

https://phys.org/news/2025-08-quantum-standard-internet-protocol.html
1•Brajeshwar•26m ago•0 comments

Countering Chinese State-Sponsored Actors Compromise of Networks Worldwide

https://www.cisa.gov/news-events/cybersecurity-advisories/aa25-239a
2•fidotron•27m ago•0 comments
Open in hackernews

Deploying DeepSeek on 96 H100 GPUs

https://lmsys.org/blog/2025-05-05-large-scale-ep/
60•GabrielBianconi•1h ago

Comments

34679•1h ago
"By deploying this implementation locally, it translates to a cost of $0.20/1M output tokens"

Is that just the cost of electricity, or does it include the cost of the GPUs spread out over their predicted lifetime?

dragonslayer56•1h ago
” Our implementation, shown in the figure above, runs on 12 nodes in the Atlas Cloud, each equipped with 8 H100 GPUs.”

Maybe the cost of renting?

34679•52m ago
I'm confused because I wouldn't consider a cloud implementation to be local.
randomjoe2•22m ago
Local doesn't refer to "on metal" anymore to many people
monsieurbanana•11m ago
I missed that train
mwcz•10m ago
"On metal" is muddied too. I've heard people refer to web apps running in an OCI container as being "bare metal" deployment, as opposed to AWS or whatever hosting platform.

That's silly, but the idea that "local" is not the opposite of remote is even sillier.

ffsm8•5m ago
You can run an OCI container on bare metal though. It doesn't stop being run on bare metal just because you're running in kernel namespaces, aka docker container

Lots of people were advocating for running their k8s on bare metal servers to maximize the performance of their containers

Now wherever that's applied to your conversation... I've no clue, too little context ( 。 ŏ ﹏ ŏ )

DSingularity•19m ago
I guess local for him is independent/private.
ollybee•23m ago
H100's can be $2 and hour, so $192 an hour for the full cluster. They report 22k tokens per second, so ~ 80 million an hour, thats $16 an hour at $0.2 per million. Maybe a bit more for input tokens, but it seems a long way off.
abdellah123•54m ago
Wow, please edit the title to include Open-source !
Blahah•39m ago
Why? Open source isn't in the original title
SV_BubbleTime•31m ago
Also “open source” I feel covers for “open weights” which is not the same thing.
caminanteblanco•38m ago
There was some tangentially related discussion in this post: https://news.ycombinator.com/item?id=45050415, but this cost analysis answers so many questions, and gives me a better idea of how huge the margin on inference a lot of these providers could be taking. Plus I'm sure that Google or OpenAI can get more favorable data center rates than the average Joe Scmoe.

A node of 8 H100s will run you $31.40/hr on AWS, so for all 96 you're looking at $376.80/hr. With 188 million input tokens/hr and 80 million output tokens/hr, that comes out to around $2/million input tokens, and $4.70/million output tokens.

This is actually a lot more than Deepseek r1's rates of $0.10-$0.60/million input and $2/million output, but I'm sure major providers are not paying AWS p5 on-demand pricing.

s46dxc5r7tv8•30m ago
Separation of the prefill and decoding layers with sglang is quite nifty! Normally 8xH100 would barely be able to hold the 4bit quantization of the model without even considering the KV cache. One prefill node for 3 decode nodes is also fascinating, nice writeup.
arnaudsm•8m ago
Interestingly, this is 10x cheaper than the cheapest provider on OpenRouter : https://openrouter.ai/deepseek/deepseek-r1?sort=price

Inference is more profitable than I thought.