frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Show HN: Lumen–free Real-time LLM token and cost monitor

https://github.com/DataGrout/lumen
1•Datagrout•54s ago•0 comments

Oracle's AI spending blows past estimates, raising worries over growing debt

https://www.reuters.com/technology/oracle-beats-fourth-quarter-revenue-estimates-2026-06-10/
2•1vuio0pswjnm7•1m ago•0 comments

ZenNotes – Open-source local Markdown app with system-wide Vim motions

https://github.com/ZenNotes/zennotes
1•daesorin•2m ago•0 comments

Hacker Tube – Timeless tech videos (popular on Hacker News)

https://www.orangecrumbs.com/v
2•oyster143•3m ago•0 comments

Why Thermodynamics Rules Future Orbital Data Centers

https://spectrum.ieee.org/orbital-data-centers-heat
2•rbanffy•4m ago•0 comments

Google DeepMind is worried about when agents start to interact

https://www.technologyreview.com/2026/06/11/1138794/google-deepmind-is-worried-about-what-happens...
2•leopoldj•5m ago•0 comments

Show HN: 5dive – Run a Team of Claude Code Agents from Telegram

https://github.com/5dive-com/5dive
3•lodar•5m ago•0 comments

Show HN: GeoLeadScraper – Open-source Google Maps scraper (Chrome extension)

https://github.com/ozhehkovski/geoleadscraper
2•bizdata•6m ago•0 comments

DSF 2026 Fundraising Goals

https://www.djangoproject.com/weblog/2026/jun/10/dsf-2026-fundraising-goals/
2•petr25102018•6m ago•0 comments

Too-ly.eu – browser-based dev tools with no tracking, no login, EU-hosted

https://www.too-ly.eu/
2•quintoncarroll•6m ago•0 comments

Workers are spending over 6 hours a week botsitting AI, fueling job frustration

https://www.businessinsider.com/botsitting-ai-hidden-human-labor-at-work-2026-6
4•ZeidJ•7m ago•0 comments

Show HN: Editxr – open-source WYSIWYG Markdown editor for the terminal

https://github.com/pixdeo/editxr
4•mromanuk•8m ago•1 comments

Hawkish GOP lawmaker Don Bacon says he was hacked by Russia

https://www.politico.com/live-updates/2026/06/10/congress/hawkish-gop-lawmaker-don-bacon-says-he-...
2•campuscodi•8m ago•0 comments

25.3% of Bitcoin sits at addresses with an exposed public key

https://chainquery.com/reports/quantum-exposure
4•granya•8m ago•0 comments

Homebrew 6.0.0

https://brew.sh/2026/06/11/homebrew-6.0.0/
6•mikemcquaid•9m ago•1 comments

Why adorable bees are emerging from this cemetery

https://grist.org/cities/why-millions-of-adorable-bees-are-emerging-from-this-cemetery/
2•speckx•9m ago•0 comments

Is Artificial General Intelligence Achievable?

2•hsnewman•10m ago•0 comments

Intelligence per Dollar

https://tomtunguz.com/tokens-per-result/
2•gmays•10m ago•0 comments

We chose code as the source of truth for AI-generated workflows

https://www.serval.com/serval-news/do-the-hard-things-always
2•emot•11m ago•0 comments

Europe 2031: What getting AI wrong means for us

https://europe2031.ai/
4•atlasunshrugged•12m ago•1 comments

How Kyoto Is Using Data to Fight Overtourism

https://www.youtube.com/watch?v=0Z74tlGUSG4
2•Akiko_Tsuka•12m ago•0 comments

Kasspian – An AI that tells you your business idea isn't good

https://kasspian.com
4•izhonline•15m ago•0 comments

The Compute Caoursel Spcx

https://askmelon.com/beta/articles/2026-06-09T1151-the-compute-carousel
3•alvations•16m ago•0 comments

Tiny Awards for Small Websites

https://tinyawards.net/
3•marvinborner•16m ago•0 comments

Cohere's First Model for Developers

https://cohere.com/blog/north-mini-code
3•hmokiguess•18m ago•0 comments

Show HN: Workplane – collaborative files for agents (and humans)

https://workplane.co
3•matanrak•18m ago•0 comments

Linux Lite 8.0 sheds Chrome, slims down, and finds its name fits better

https://www.theregister.com/os-platforms/2026/06/10/linux-lite-80-sheds-chrome-slims-down-and-fin...
2•rbanffy•19m ago•0 comments

Build Your Dream Home: Fable 5 vs. GPT-5 vs. Gemini

https://www.promptfrenzy.com/showdown/dream-home
2•spobin•19m ago•0 comments

Open Reproduction of DeepSeek-R1

https://github.com/huggingface/open-r1
3•yogthos•19m ago•0 comments

Flatiron is a fast columnar analytics library for Clojure

https://github.com/yogthos/flatiron
3•yogthos•19m ago•0 comments
Open in hackernews

Intellect-2 Release: The First 32B Model Trained Through Globally Distributed RL

https://www.primeintellect.ai/blog/intellect-2-release
201•Philpax•1y ago

Comments

esafak•1y ago
How are they ensuring robustness against adversarial responses?
nsingh2•1y ago
From the article, seems like TOPLOC:

> based on top of novel components such as TOPLOC, which verifies rollouts from untrusted inference workers

https://github.com/PrimeIntellect-ai/toploc

xmasotto•1y ago
Can an expert explain how this protects against adversarial actors?

At a glance it looks like something akin to a computing a checksum that's locality sensitive, so it's robust to floating point errors, etc.

What's to stop someone from sending bad data + a matching bad checksum?

yorwba•1y ago
The validation procedure is described on page 8 of the TOPLOC paper: https://arxiv.org/abs/2501.16007

The checksum is validated by redoing the computation, but making use of the fact that you already have the entire response to enable greater parallelism than when generating it one token at a time.

DoctorOetker•1y ago
TOPLOC attempts to detect model substitution, i.e. responses being generated by a different model than requested, it comes with certain caveats, as far as I can tell the TOPLOC paper considers verifiable learning / training as out of scope.
ndgold•1y ago
Pretty badass
quantumwoke•1y ago
Wonder what the privacy story is like. Enterprises don't usually like broadcasting their private data across a freely accessible network.
bjt12345•1y ago
A strong use case here for quantum-safe encryption.
Zambyte•1y ago
Why? Quantum safe cryptography is mostly interesting right now in the context of defending against store now, decrypt later attacks. That doesn't seem helpful here, because they'll still need to decrypoit for training. Did you mean homomorphic encryption?
mountainriver•1y ago
Awesome work this team is doing. Globally distributed MoE could have real legs
refulgentis•1y ago
I guess I'm bearish?

It's not that they trained a new model, but they took an existing model and RL'd it a bit?

The scores are very close to QwQ-32B, and at the end:

"Overall, as QwQ-32B was already extensively trained with RL, it was difficult to obtain huge amounts of generalized improvement on benchmarks beyond our improvements on the training dataset. To see stronger improvements, it is likely that better base models such as the now available Qwen3, or higher quality datasets and RL environments are needed."

fabmilo•1y ago
The interesting delta here is that this proves that we can distribute the training and get a functioning model. The scaling factor is way bigger than datacenters
refulgentis•1y ago
The RL, not the training. No?
itchyjunk•1y ago
RL is still training. Just like pretraining is still training. SFT is also training. This is how I look at it. Models weights are being updated in all cases.
refulgentis•1y ago
Simplifying it down to "adjusting any weights is training, ipso facto this is meaningful" obscures more light than it sheds (as they noted, RL doesn't get you very far, at all)
comex•
jumploops•1y ago
Congrats to the team on the launch!

Personal story time: I met a couple of their engineers at an event a few months back. They mentioned they were building a distributed training system for LLMs.

I asked them how they were building it and they mentioned Python. I said something along the lines of “not to be the typical internet commenter guy, but why aren’t you using something like Rust for the distributed system parts?”

They mumbled something about Python as the base for all current LLMs, and then kinda just walked away…

From their article: > “Rust-based orchestrator and discovery service coordinate permissionless workers”

Glad to see that I wasn’t entirely off-base :)

Havoc•1y ago
Given the latencies at play python did probably make more sense though
bwfan123•1y ago
The technical underpinning has nothing to do with the language. It is a different way of optimizing parameters called diloco. I agree though that python is an abomination for systems services componentry when there are languages like rust.
throwanem•1y ago
There's a name and a logo. "Hubris" feels slightly beggared. https://en.m.wikipedia.org/wiki/The_Metamorphosis_of_Prime_I...
Extropy_•1y ago
This looks like a startup company. Why shouldn't it have a name and logo?
Philpax•1y ago
Their point is that the name and logo are clearly drawing from the Metamorphosis of Prime Intellect, with all the potential baggage that comes with it. It's an interesting choice.
throwanem•1y ago
The novel was the first popular codifier of the concepts of strongly superhuman ASI and hard-takeoff singularity, literally the work that introduced these ideas to the then quasi-New Atheist hangers-on among the kuro5hin crowd who became the initial core of what would develop into the follower base for singularitarianism. It was quite well written for that purpose, with enough sex and action to paper over the slow parts, and a real grasp of what it feels like when time contracts and dilates at once in those dolly-zoom moments where the universe is different forever and nothing outwardly changes. Combined with the seductive appeal and literally universal scope of the ideas that power its plot, it is no wonder the novel should have left so strong an impression on a few.

Someone intentionally invoking that history is interesting indeed. Someone doing it by accident might be more so. But I already gave that choice the name I judge it deserves.

bcoates
schneehertz•1y ago
I used to have an idea related to science fiction novels that artificial intelligence could aggregate computing power through the network to perform ultra-large-scale calculations, thereby achieving strong artificial intelligence. Reality will also develop in this way, which is very interesting
abtinf•1y ago
Does this have anything to do with The Metamorphosis Of Prime Intellect, or did they just abuse the name and the cover art?
arthurcolle•1y ago
Prime Intellect is a grabby AI :)
danielhanchen•1y ago
I made some GGUFs at https://huggingface.co/unsloth/INTELLECT-2-GGUF

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99

Also it's best to read https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-e... on sampling issues for QwQ based models.

Or TLDR, use the below settings:

./llama.cpp/llama-cli -hf unsloth/INTELLECT-2-GGUF:Q4_K_XL -ngl 99 --temp 0.6 --repeat-penalty 1.1 --dry-multiplier 0.5 --min-p 0.00 --top-k 40 --top-p 0.95 --samplers "top_k;top_p;min_p;temperature;dry;typ_p;xtc"

3abiton•1y ago
This is rather exciting! I see the future of Co-op models made by a community of experts on a specific field that would still allow them to be competitive with "AI monopolies". Maybe not all hope is lost!
iTokio•1y ago
It’s interesting that it does something useful (training a LLM) without trust and in a decentralized way.

Maybe this could be used as proof of work? To stop wasting computing resources in crypto currencies and get something useful as a byproduct.

proof_by_vibes•1y ago
There could be merit to this. Proofs are generally computationally hard, so it's possible that a currency could be created by quantifying verification.
littlestymaar•1y ago
> To stop wasting computing resources in crypto currencies and get something useful as a byproduct.

Bitcoin is the only major cryptocurrency that still use proof of work today (others are either using “proof of stakes” or are “Layer 2” chains), and due to its (relative lack of) governance structure, it's very unlikely to ever change.

fastball•1y ago
The emphasis is indeed on "without trust" – as far as I can tell this project is unable to verify whether the decentralized training nodes are contributing productively.

Without the ability to validate that training compute is heading in the globally desired direction, it is unlikely you could use it as the foundation of a (sound) cryptocurrency.

mentalgear•1y ago
The reward model could be used as a validation/reward for the client. Give the same nodes the same inferences to make, and the one with the highest reward (those could be short, or even partially calculated long-term) will also get the "currency" reward.
Thomashuet•1y ago
Summary: We've use the most complexest, buzzwordiest training infrastructure to increase the performance of our base model by a whopping 0.5% (±1%).
Weryj•1y ago
But this isn’t about the performance, the infrastructure is the product here.
lonelyasacloud•1y ago
Indeed, most reliable way to make money in a gold rush is to sell shovels.
Mougatine•1y ago
very cool work!
bwfan123•1y ago
The most interesting thing I see is the productization of the diloco work done here [1]. If someone can make this scale, then we can say goodbye to expensive backend networking and mainframe-like AI training machinery.

[1] https://arxiv.org/abs/2311.08105

ikeashark•1y ago
I wonder why they randomly noted a torch-compile vs non torch-compile figure where torch-compile degraded model performance. What made it degrade? It seems to only appear in one figure and nowhere else.
1y ago
But does that mean much when the training that produced the original model was not distributed?
christianqchung•1y ago
Third party fine tuned open weighted LLMs tend to be good at a handful of benchmarks, but parity or lower on others compared to the original model. There are some exceptions like Nvidia's Nemotron series, but the differences generally are so small as to be imperceptible. Deepseek released finetunes of several Qwen and Llama models alongside R1, and while they were better in some select (mostly math) and coding domains, there were problems resulting from fine tuning that didn't result in them overtaking the original models in usage.
cess11•1y ago
Seems that's mostly a byproduct from working on the core business idea, GPU arbitrage.
•
1y ago
Maybe torment nexus was taken
Philpax•1y ago
That sounds like it'll lead to human-driven reward hacking [0]?

[0]: https://en.wikipedia.org/wiki/Reward_hacking

bastawhiz•1y ago
Arguably that's worse than crypto proof of work: inference is extremely expensive and you're multiplying every operation by N. Which means the cost is multiplied by N.

And like, what are you doing? You've managed to find a use case where you don't care that you're doing compute on some untrusted servers online (and no, there's no magic AI homomorphic encryption) but at the same time you're willing to accept the latency of doing the work multiple times AND it's probably all low end 4090s doing the work AND you're willing to pay for the wasted compute? I'm here shuddering at the thought of model setup times when one node in a cluster goes down and you're facing that on... well, probably most inferences? If you're not administering the infra, you get the lowest common denominator of performance.

mentalgear•1y ago
That would be indeed a very promising way of FINALLY making cryptocurrency useful!
_ink_•1y ago
I read an argument, that proof of work needs to be useless and wasteful. If it would produce value in itself it would make 51% attacks more economic and thus the currency less secure.
throwanem•1y ago
Sure. The whole point of "proof of work" is to show (prove) you've lost energy to heat (work). That's what makes it costly and thus an honest signal.

The model breaks where work can be counterfeited (usually impossible) or where energy prices go to zero, which is why "bitcoin colonialism" was briefly a thing last decade. Much of bitcoin's design, this aspect also, is intended to protect against the bare-fanged, red-eyed money weasels it was also designed to attract.

ucha•1y ago
It needs to not have economic value but it doesn't necessarily need to be useless and wasteful.
Xmd5a•1y ago
For instance if the end product, in this case the LLM, is made available to anyone, publicly...
piiToo•1y ago
If it improves the economic value of something else it has economic value just not on its own discrete value.

Wrappers on candy don’t have value intrinsically but improve the quality of the candy.

api•1y ago
I’ve seen an argument that military power and credible threat are the proof of work mechanism for fiat currencies. That is also useless, but it does throw off secondary useful effects like inventions.

Not totally convinced the analogy maps but interesting.

genewitch•1y ago
Military is certainly proof of burn...
andruby•1y ago
Hadn't thought of it in that way, but there's some merit to that if you include government, police & power in general. Law enforcement needed really high penalties on counterfeiting money and check fraude to make cash and checks work. And I guess some of that is still the case with credit card fraude.
throwanem•1y ago
"Fraud," and there is no historicity to the idea that counterfeiting and adulteration only became a problem with the introduction of paper instruments. Indeed those replaced specie in considerable part to reduce opportunities for chicanery! Gold is gold, after all.
throwanem•1y ago
Somebody spilled bong water on that before it got to you, I feel like. What backs the credible threat of military force is that the threat is credible, which is why the United States maintains a dozen carrier strike groups and does not want to have any kind of conversation at all about hypersonic weapons and especially hypersonic anti-shipping missiles.
api•1y ago
That's why I said the analogy doesn't map perfectly.

Still I do think there's some validity to the comparison. Fiat currencies are not backed by "nothing." They are backed by a state. Some percentage of the cost of operating a state is therefore "work" done to back the currency's value.

The question is: if we had a cryptocurrency backed by digital PoW that scaled to the level of fiat currencies (millions of transactions per second) and had some of their other desirable characteristics, would the state be able to proportionally shrink? That's what I'm not convinced of, but it'd be an interest experiment if we could spin up another universe and try it.

Geee•1y ago
No, this process doesn't produce "proof of work", i.e. verifiable proofs that energy has been used.
naasking•1y ago
New weights that have lower loss than the input weights is proof that work has been done.
k__•1y ago
Arweave and Filecoin use PoW algorithms that prove something useful.
bastawhiz•1y ago
> Maybe this could be used as proof of work

There's nothing provable here. Crypto proof of work is easily verified (does the hash of this value look the way I expect?). How do you prove in ~O(1) time that someone did some operation with their GPU? You don't. You don't even know what the thing is that you're training (without a trained model you don't have the ability to know whether the model the was allegedly trained learned the thing you want it to learn).

naasking•1y ago
> How do you prove in ~O(1) time that someone did some operation with their GPU? You don't.

The work in this case could be that the weights after the was done work have lower loss than the input weights. Applying the new weights to input to check that it's lower is much cheaper than calculating the weights, which is the same trend as proof of work (not sure about the magnitude of difficulty being enough to replace proof of work though).

refulgentis•1y ago
Trying again, apologies:

- Minimizing loss could be a useful heuristic on a base model. Here, we expect the distribution to be different as we are only doing RL. Measuring loss means we're measuring the difference against the base model inputs: a non-goal, we expect reasoning post RL-training to look quite different from a web scrape.

Let's set that aside. Let's say lower loss = model improved.

- Checking the loss requires the entire dataset used to train the base model + forward pass. That’s O(N·d) where N is samples, d is model size. This takes us from "cool demo of RL can be done on the edge with little benefit" to "we're shipping around terabytes of data constantly among clients"

- Proof of work as a technical term is different from proof of work as a colloquial term: the former is a cryptographic puzzle whose solution is universally and instantly checkable, while the latter just means “I can show I did something,” with no strict guarantee or uniqueness. Randomly perturbing one parameter could show "proof of work" without the work we actually wanted to be done, being done.

- Early in base model training, shaving 0.01 off the loss is easy. Later, impossible. In an RL environment, we're expecting some to go bad. In our interpretation of "loss decrease means model better means you did work", that would mean loss would increase -- that is how it learns in an RL environment. However, that does not mean no work is done.

bastawhiz•1y ago
That's far from O(1). Now you need to transfer the weights back and test them.
naasking•1y ago
> That's far from O(1). Now you need to transfer the weights back and test them.

I think what matters most is that the verification is much, much cheaper than the calculation itself to prove that work was done, it doesn't explicitly have to be O(1), eg. the magnitude difference has to exceed a certain threshold to make proof of work viable.

throwanem•1y ago
> Some percentage of the cost of operating a state is therefore "work" done to back the currency's value.

No, this is perfectly reasonable and catastrophically, dangerously inverted. We do not operate a state to generate money. We use money to fund the operation of the state. Otherwise we create a perverse incentive attracting what would be parasitism, had we not just incompetently surrendered effective beneficial ownership of the resource to the first sufficiently convincing comer.

Say, for example, the Afrikaner failson of a gemstone magnate, who is regrettably good at cosplaying a foolish person's idea of a wise person.