frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf
273•aurenvale•2h ago•60 comments

Fintech Engineering Handbook

https://w.pitula.me/fintech-engineering-handbook/
39•signa11•50m ago•5 comments

Previewing GPT‑5.6 Sol: a next-generation model

https://openai.com/index/previewing-gpt-5-6-sol/
1025•minimaxir•18h ago•642 comments

Linux on Older Hardware: The Complete Revival Guide

https://www.fosslinux.com/158206/linux-on-older-hardware-revival-guide.htm
74•tapanjk•2d ago•26 comments

Long Wave radio era set to end with switch-off

https://www.economist.com/britain/2026/06/25/the-bbc-switches-off-its-oldest-service
46•edward•1d ago•55 comments

Beer CSS – Build material design in record time

https://www.beercss.com
23•Seb-C•2h ago•2 comments

WordStar: A Writer's Word Processor (1996)

https://www.sfwriter.com/wordstar.htm
97•droidjj•7h ago•45 comments

Why does kinetic energy increase quadratically, not linearly, with speed? (2011)

https://physics.stackexchange.com/questions/535/why-does-kinetic-energy-increase-quadratically-no...
253•ProxyTracer•12h ago•122 comments

The US Army Issued Ocarinas to Soldiers in World War II

https://www.flutetunes.com/articles/my-flute-goes-to-war/
12•tomcam•2d ago•5 comments

Faster KNN search in Manticore: 2-pass HNSW, batched distances, and AVX-512

https://medium.com/@s_nikolaev/faster-knn-search-in-manticore-2-pass-hnsw-batched-distances-and-a...
12•snikolaev•1d ago•1 comments

OpenTTD 16.0-Beta1

https://www.openttd.org/news/2026/06/25/openttd-16-0-beta1
177•untilted•6h ago•32 comments

U.S. allows Anthropic to release Mythos AI to ‘trusted’ US organizations

https://www.semafor.com/article/06/27/2026/us-releases-powerful-anthropic-model-mythos-to-some-us...
458•bobrenjc93•12h ago•556 comments

AI in mathematics is forcing big questions

https://spectrum.ieee.org/ai-in-mathematics
135•rbanffy•12h ago•102 comments

MicroVMs: Run isolated sandboxes with full lifecycle control

https://aws.amazon.com/blogs/aws/run-isolated-sandboxes-with-full-lifecycle-control-aws-lambda-in...
337•justincormack•4d ago•188 comments

Fusion Programming Language

https://fusion-lang.org/
81•efrecon•2d ago•37 comments

Jest/Vitest interactive course (runs in the browser)

https://howtotestfrontend.com/courses/jest-vitest-fundamentals
9•howToTestFE•2d ago•5 comments

Hellishly Slow Level 13 Deflate Compression

https://kirill.korins.ky/articles/hellishly-slow-level-13-deflate-compression/
64•zX41ZdbW•4d ago•19 comments

IBM MCGA Gate Array Reverse Engineering

https://github.com/schlae/IBM_MCGA
37•userbinator•6h ago•6 comments

U.S. government will decide who gets to use GPT-5.6

https://www.washingtonpost.com/technology/2026/06/26/openai-says-us-government-will-vet-users-its...
1052•alain94040•16h ago•1109 comments

Anatomy of a Failed (Nation-State?) Attack

https://grack.com/blog/2026/06/25/dissecting-a-failed-nation-state-attack/
66•signa11•8h ago•11 comments

Show HN: Hacker News on a train station-style flip board

https://popflame.quickish.space/hn-flipboard/
78•PaybackTony•10h ago•18 comments

Ultrasound imaging of the brain

https://alephneuro.com/blog/ultrasound-brain
287•rossant•23h ago•114 comments

Om

https://daringfireball.net/2026/06/om
394•throw0101a•11h ago•19 comments

The gap between open weights LLMs and closed source LLMs

https://blog.doubleword.ai/frontier-os-llm
221•kkm•14h ago•180 comments

We can still stop California's 3D printer surveillance scheme

https://www.eff.org/deeplinks/2026/06/we-can-still-stop-californias-3d-printer-surveillance-scheme
409•hn_acker•14h ago•138 comments

Foreign funds help make housing unaffordable: research

https://news.mccombs.utexas.edu/research/foreign-funds-help-make-housing-unaffordable/
84•hhs•11h ago•26 comments

SCC Technical Assistance Program

https://nerocam.com/scc_tap.asp
20•luu•3d ago•1 comments

A C++ implementation of a fast hash map and hash set using hopscotch hashing

https://github.com/Tessil/hopscotch-map
94•gjvc•14h ago•16 comments

Show HN: DBOSify – Drop-in Temporal replacement built on Postgres

https://github.com/dbos-inc/dbosify-py
66•KraftyOne•2d ago•9 comments

What Is a Nomogram and Why Would It Interest Me?

https://lefakkomies.github.io/pynomo-doc/introduction/introduction.html#what-is-a-nomogram-and-wh...
126•Eridanus2•17h ago•20 comments
Open in hackernews

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf
270•aurenvale•2h ago

Comments

Havoc•1h ago
Nice.

Guessing the timing isn't accidental. Demonstrated openness vs harsh regulation

ricardobeat•1h ago
Presumably this has been in production for a while, and is one of the reasons they were able to dramatically lower prices a month ago?
_0ffh•50m ago
Lookahead Sparse Attention should be playing a big role as well, as it dramatically slashes memory consumption.
Jackobrien•1h ago
I see a world soon where there’s an extremely wide variety of small models for speculative decoding, unique to use cases, companies, and even individuals.
nicce•1h ago
Hopefully that is the case and hardware does not get impossible to get.
pydry•54m ago
yes, heavily constrained by sophisticated guardrails.

this is definitely where things are going. the enormous "eat the world" models have extreme diminishing returns by comparison.

preetham_rangu•1h ago
do they use their OCR, or someone else?
piterrro•1h ago
I’ve been using DeepSeek v4 pro for a month now in Kilo Code and its great. Fast, reliable, large context window and cheap as… Did 1,5B tokens this month and cost me 40usd (majority cached, but still).
spiderfarmer•1h ago
Is there a way to see how many tokes one does with claude code (pro)?
cptchaos•58m ago
https://ccusage.com/
bpavuk•42m ago
the casino has no clocks, as one HN user put it some time ago.

I second ccusage, it's nice

edg5000•9m ago
It's in the JSONs in ~/.claude, but last 30 days only I think. You can have the model analyze history. So for correct history you'd need to run history analysis on a cron job or something. Kinda hacky.
rvz•1h ago
This is just one of many papers DeepSeek have released to be able to serve models at extremely cheap prices, unlike the others taking on >$100B+ of debt in building data centers for the same thing.

> As with V4-Flash, we treat this point as an indication that DSpark sustains useful throughput under an interactivity target that the baseline cannot efficiently support. At matched system capacities, DSpark delivers 57% to 78% faster per-user generation.

Reminds me of the flawed solution in scaling servers in 2017 that use memory-intensive technologies by adding even more servers to solve the problem. (It just increases costs.)

Rather than doing that, think about which critical parts of your app can be written in a more performant technology.

Fast forward to 2026, now you can see who is just throwing more money at the problem to create even more problems where as DeepSeek is giving us optimized solutions.

I know exactly who I would pay attention to, and it is absolutely not Anthropic.

2838383838•1h ago
Must be wonderful to be on the board of OpenAi et al & their PE investors whilst China keeps blowing up these mines under their feet lmao. Luckily Korean pension funds will buy all the trash as usual but goddamn you gotta start moving quick or you are gonna need some serious AGI to show you how to offload those bonds
ForHackernews•50m ago
"We will build the machine-god and pray for it to pay for itself."
FridgeSeal•36m ago
Every day, the rate of “could post a picture of 40k tech priests and have it taken unironically” goes up, and it’s starting to get concerning.
ozgrakkurt•22m ago
Don’t worry they will sell all the hardware and data they acquired with their grift
kamranjon•56m ago
DeepSeek continues to not only push the boundaries but also publish these incredible papers explaining how they achieved their gains - something the American labs no longer do unfortunately. Chinese labs are doing the most interesting work in AI right now.
herodoturtle•53m ago
Publishing by necessity I wonder? American labs on the cutting edge pioneering the way forward, so Deepseek open sourcing what they’ve got is to help even the playing field.

Hopefully the experts here can offer insight. The above is just my hunch and I’m not a specialist in this field.

jonplackett•51m ago
Wouldn’t that just help the American labs anyway though? Or do they assume they’ve actually already figured this stuff out and kept it secret?
_0ffh•41m ago
I'm afraid I'm even balking at the word "pioneering" in context with US frontier labs. They are probably doing a few new things, right, but they are not blazing any trails for others to follow along, the Chinese are.
epolanski•23m ago
Chinese papers and techniques have been very influential and copied by US labs.

Multi-head Latent Attention (MLA), Multi-Token prediction, MoE architecture are some of the most famous examples.

tomalaci
pokot0•36m ago
I am wondering if this is why they can offer their pro model at ~1/4th of the price compared to the other providers offering the same model, and if other providers will be able to do the same in a short timeframe.
vidarh•29m ago
It'd presumably help a lot, but also when you use their endpoint they get more training data.
nicce•26m ago
This applies to every provider. OpenAI seems to be the worst hoarder.
pokot0•4m ago
actually you can buy inference on third party providers that serve deepseek v4 pro with zero data retention (ZDR).
epolanski•17m ago
US labs do it too.
sschueller•4m ago
I have been heavily using DeepSeek V4 Pro at Max for a month now and I would say it is 100x cheaper. If I pay for Claude I will hit that limit so fast I am always waiting 5 hours. Using the frontier models at Kilo I go through dollars while doing the same thing via DeepSeek it is pennies.
danielabinav160•20m ago
Would love to see these numbers reproduced on consumer GPUs, not just A100s.
tommica•17m ago
Maybe somaday an 8gb videocard can be used for coding...
kamranjon•2m ago
The hugging face models are already up and seem to be the original models with the speculative decoding module built in which is very cool:

Flash: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-DSpark

Pro: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-DSpark

Excited to see if this makes it into DwarfStar for local inference, have been using the flash model extensively since the 2-bit quants were made available.

•
50m ago
Probably because American AI companies are on the hook for quite a lot of investment money. I think they are trying to find the magical moat to justify their valuation.

Revealing optimizations similar to these would pretty much reduce their competitive position.

lwansbrough•43m ago
Chinese labs are also still behind, so they’re incentivized to collaborate and have no reason to do it in private.

I suspect their tune will change if they ever take the lead..

colordrops•41m ago
So the marketplace is working.
abc123abc123•32m ago
This is the way! Open source models will benefit, and once open source models reach the state of "good enough" the hyped up US AI companies will fear, since the availability of free, good enough, AI models will set the ceiling for how much they can charge. Then the bubble will pop.
tw1984•39m ago
> Chinese labs are also still behind, so they’re incentivized to collaborate and have no reason to do it in private.

US labs in Google, Meta and SpaceX are not leading, none of them managed to build something on par with GLM 5.2.

Care to explain to me why they still don't collaborate and still choose to do it in private?

lwansbrough•37m ago
No idea I don’t work there.
budsniffer952•35m ago
Wait, are you claiming that these companies haven't contributed to the ecosystem via research and open source?
vidarh•35m ago
I'm not sure I'd put Google in that list, but either way: Because they think they have enough capital that they can catch up and don't need the reputational boost of this.
CuriouslyC•28m ago
As good as Gemini's visual intelligence is, it's a terrible agent.
7speter•9m ago
Google at least still releases open source models to the public.
oefrha•35m ago
Which is a good thing. Self-serving motives are more reliable than altruistic ones.
nubg•27m ago
Very interesting take
broodbucket•21m ago
Look at how far OpenAI has drifted from their original mission. Everything comes back to greed, so it's ideal for the world if selfish motives happen to coincide with what's good for the world, like advancements in open models
roenxi•14m ago
It's a standard take since it is how markets tend to work. They aren't powered by altruism, it is a big system for turning greed into good results. We don't have all this stuff because people suddenly woke up one morning and decided to be nice.
amelius•16m ago
You mean more predictable, not more reliable.
rrvsh•11m ago
Could you explain? (asking in good faith)
budsniffer952•37m ago
Do you think that DeepSeek are building their models for free, or something? They aren't "on the hook" for anything?

What's with all the China glazing about this stuff? They release some open-source work and people act like they are suddenly the beacon of freedom and transparency.

abc123abc123•31m ago
This is incorrect binary thinking. Them releasing open source can be good, but that does not commit you to think that china or chinese companies are saints. There are many shades of grey here and one does not exclude the other (nor include it).
7speter•6m ago
I’m think its in our best interests to lever these american ai companies to exhibit at least some degree of freedom and transparency anyway we can…
cromka•25m ago
I seriously am far from fear mongering and doomsday mentality, but I just can't see how OpenAI and Anthropic can have a successful IPO if the quality gap between the free and paid continues to narrow like that...
cyanydeez•24m ago
fascism. it works be corporate fascism.
2838383838•14m ago
this place might as well be fucking reddit nowadays
rvz•44m ago
Exactly. They did not have to open up their research up and this is what happens when smart researchers are forced to squeeze performance gains out of existing hardware.

They don't have TPUs or access to the latest Vera Rubin GPUs either to get performance gains for free. All of the optimizations Deepseek have done are in software and it goes down to the PTX assembly level.

Compared to Anthropic who are celebrating in fixing a flickering issue in a terminal app which took months to fix.

vidarh•30m ago
> Compared to Anthropic who are celebrating in fixing a flickering issue in a terminal app which took months to fix.

It's funny, because if you ran Claude Code on a slow terminal, the cause of the flicker was obvious: They kept dumping the entire history of the chat back into the terminal in a number of situations, and relied on the terminal to them end up in the correct state.

yorwba•22m ago
Anthropic almost certainly also has optimized software down to the assembly level, considering this take-home interview challenge they published: https://github.com/anthropics/original_performance_takehome/... which is all about instruction-level performance optimizations. That they don't prioritize UI fixes just means they consider other things more important.
epolanski•24m ago
R1 was very influential on US models development.
jmyeet•19m ago
Chinese companies (and labs) operate in conjunction with the CCP so whatever they're doing, it's because it's Chinese state policy.

What became clear when DeepSeek came onto the scene was that China was seeking to commoditize LLMs. They consider it an issue of national security not to be beholden to US tech companies when it comes to AI. And I, for one, fully endorse this policy.

Another data point on this is the black market for Claude tokens in China [1]. The chat logs themselves are a commodity to train models.

I believe that OpenAI in particular is a bet on a trillion dollar pot of gold that doesn't exist. Google, Microsoft, Amazon and Meta will all be fine. Anthropic is in a far better position than OpenAI (IMHO) but if DeepSeek or some other Chinese open weight model gets as good at coding, they're in real trouble too.

[1]: https://news.ycombinator.com/item?id=48667495