Open Source AI Must Win

https://opensourceaimustwin.com/?share=v2

303•vednig•1h ago

Comments

george_max•1h ago

With open-weight AI, there might not be an incentive to put large sums of capital towards training / research. There might be a donation fund of some sorts, but it certainly won't reach the level of fundraising that the frontier labs are receiving.

Because of this, I think it might not be possible to have AI *only* open-weight; major players like OpenAI, Anthropic, Google will likely stay for good, with better models than open-source versions.

I think it might look something like Photoshop & GIMP, with Photoshop being a frontier lab, and GIMP being the open-weight model. GIMP is decent for many different image editing workflows, but Photoshop is just better.

I would definitely prefer to have an open-weight model better than frontier labs'. Though I don't think it's possible.

thewebguyd•1h ago

I think the same, but I also think that local AI is actually inevitable, even if not open source models. I wouldn't be surprised to see OpenAI and others release an on-prem product. Whether that's effectively an appliance rack, or some other form, people (large companies) are going to want to run inference locally for data sovereignty & cost controls. Especially if we get to a point where companies want AI integrated into manufacturing and other air-gapped networks.

george_max•1h ago

I do believe that if OpenAI and others release an open-weight model that is better or on par with their frontier variants, it might ruin their primary business model.

That is, of course, unless they develop their own hardware specifically to run this open model. But, that does ruin the point of open models.

thewebguyd•1h ago

When/if gains slow down, I can definitely see branching out into hardware to sell for on-prem inference once the models can be etched into the silicon with hard wired weight chips. I'd guess maybe at least 5+ years away from that though.

cocoa19•30m ago

We already have this. We don't need Mythos to categorize images on my phone. A small dedicated model would do.

LPisGood•1h ago

That is fantastic news then, if commercial product products will always be better than open source, and open source products will continue to get better

george_max•57m ago

Agreed. The only "issue" is that commercial products will always be ahead, with less friction for most users. This ultimately results in most people using these over open-weight variants. Users might not even be aware that the open-model variants exist. Similar to Windows / MacOS and Linux.

bbor•50m ago

Which is the nearterm future that we must demand: a stop to the amounts of capital flowing to ASI research. Join me, Anthropic, Google, and OpenAI’s-founding-charter in saying the obvious, y’all; Pause AI, now.

It should be clear by now that there’s a whole universe of work to do with the models we have today, from studying to securing to ‘harness’ing. There are tons of economic benefits to be reaped already, if applied carefully. Doesn’t that sound nicer than rolling the dice with the lives of trillions?

mufufu•33m ago

Lives of trillions?

reilly3000•22m ago

Current and possible future populations?

tonyhart7•37m ago

the moat is in hardware, without capital intensive acquisition how tf they going to get that money ?????

I learn it hard from prusa 3d printer open model

pennomi•23m ago

Perhaps, unless there is a way for users to donate compute to training, folding@home style. I don’t see how that could be practical though.

glerk•1h ago

it is inevitable that it will win

information wants to be free

Avicebron•1h ago

Inevitable isn't "in our lifetimes"

ks2048•1h ago

“information wants to be free” - doesn’t seem correct. More like it’s easier to spread info than to hide it.

ijidak•1h ago

Intelligence is now data in the form of weights.

And once it leaks, it's permanently in the wild.

Interesting times.

planb•1h ago

This is not about information but about capital. Even if we had free access to the weights of the best models in the world: who would be able to run them?

glerk•1h ago

Technology is deflationary. I am holding in my hand a device that would have been a supercomputer 30 years ago. It costed me a couple of hundreds of dollars.

These models and the hardware they are running on will get even more efficient. We are nowhere near the physical limits of what we can achieve.

em-bee•1h ago

what is Open Source AI even?

to me Open Source, like Free Software, is something i can run on my own computer. any AI system that runs on a computer that i do not control is by my definition not Open Source.

so how then can Open Source AI win? it can't even compete. even if we collect enough money and create a dedicated Open Source organization to build and run a community owned AI datacenter, how does that help?

so what exactly is the demand here?

matheusmoreira•1h ago

We can run open weight models on our own machines.

em-bee•1h ago

yes, but a model that runs on my own machine will never have the capacity of a model that runs in a datacenter. as i said, it can't compete with that.

thewebguyd•52m ago

If RAM prices ever come down, you can have a machine that can run a capable local model.

Qwen 2.5 72B is surprisingly capable, almost on par with GPT-4o if not a little better. You can run it on a 128GB Mac Studio with 8-bit quantization. You need about 77GB for the weights and ~15GB for your context window & cache.

Pricing remains to be seen, but there's also those new nvidia laptops coming out the surface laptop ultra should have 128GB RAM w/ Blackwell GPU, they're saying 1 petaflop of AI compute, if you can tolerate Windows (no idea if it'll boot Linux until the hardware is out).

These models are roughly ~1 year or less behind the frontier models. We really just need hardware to catch up and alleviate the price pressure on RAM.

sheeshkebab•

matheusmoreira•1h ago

Winning is a tall order. I'm just hoping it'll get good enough while allowing us to run it locally with no idiotic "safety" controls or censorship of any sort. Looks like the best open weight models are at Sonnet level, if they get to Opus 4.6 level it's gonna be perfect.

avaer•1h ago

I agree with sentiment and mission, but the goal is inseparable from politics at this point.

Being Open Source (tm) will not protect you from the government/others imposing controls on your silicon or what it is allowed to do, which is already happening around the world.

Even having the models be open source won't fix the regulation or economic incentives. Which is not something you can compress into a couple of paragraphs.

AI is civilizational infrastructure and it needs civilizational solutions. Not just source.

impure•1h ago

Not to be that guy, but the correct term is Open Weight LLM. And I’d argue it already has. Many open models are already very competitive with closed models at a fraction of the cost.

MaxPock•1h ago

Were it not for China, America would have restricted the most advanced models from being used outside the US. NATO members would have access to GPT-4, with some countries entirely blocked from AI.

Biden's GPU controls should give you an idea. Thank you, China. Open source AI must win.

thewebguyd•47m ago

Unfortunately the US is no stranger to using export controls to restrict frontier technology.

Famously, the PowerMac G4 was briefly subject to export controls. Apple turned it into a marketing campaign.

sanex•15m ago

Just happened 5 hours ago.

nerfbatplz•4m ago

China unironically saved humanity. I'm no fan of the CCP but if they hadn't organized an effort to compete with the US no one else would have done it and we'd be begging our AI overlords for tokens and praying we don't get caught conducting wrongthink.

Go ask Claude to criticize Anthropic and see how long your account stays active.

mrcwinn•1h ago

Quick, someone start open data center and open energy system and open water supply.

CharlesW•1h ago

Can we assume that the author isn't using "Opensource" to mean "Openweights"?

Or are we still collectively brainwashed by the strategic false equivalence established by Big AI CMOs?

AshamedCaptain•1h ago

On this very thread you already have people talking about "open weights" and similar nonsense. What is open about them? They're free to download, but that hardly qualifies as open. Where is the source? Where are the instructions to modify and build your own?

I'd never though I'd have to utter the expression "open as in beer".

The blatant attempt at manipulating vocabulary here is... quite blatant.

singpolyma3•25m ago

There is no source because it's not software. You can of course modify and make your own.

nl•24m ago

I'm a strong proponent of Open Source (TM) but I disagree with this take.

The weights are the useful artifact here. You can modify them, fine tune them and do what you want with them.

Unlike binary software there is nothing limiting that.

It is also useful to have access to the training recipes and to some extent the data. But I'm of the opinion that learning on something is not copyright infringement, so there are many circumstances where distributing the raw training data will not be possible.

For me this is like Open Office: it is open source, and largely inspired by and learned from Microsoft Office. But they don't need to distribute MS Office for Open Office to be Open Source.

In addition there are models that meet the criteria you appear to propose. The AllenAI models are a good example.

gslepak•1h ago

Where does Anthropic or OpenAI winning leave us?

Dependents of an AI-megacorp for our "facts"? Our software? Our work?

It's possible these companies will become everyone's boss, and will dictate to everyone what everyone is allowed to work on, think, say, do, believe, etc.

Before Big Tech springs that trap, we must support and divert resources to open models.

malux85•48m ago

> Dependents of an AI-megacorp for our "facts"? Our software? Our work?

It's worse than this, it's more like our thinking. There's already plummetting math grades [1], handing over our thinking to AI megacorps where there's likely to be a monopoly or duopoly is an incredibly dangerous thing for humanity as a whole.

[1] https://www.dailycal.org/news/campus/academics/failing-grade...

george_max•37m ago

If humanity is over-reliant on frontier labs' models to perform work, the result is a dependence on the actual intelligence of these models -- not on human intelligence. This could be a small reason, on top of many others, why investors are throwing hundreds of billions of dollars a bit "carelessly" to these labs. It's fascinating seeing the models do the "hard work" (the deep, challenging thinking) for you.

The conundrum which tricks me though - is this a net negative or a positive? If humans are less intelligent, but their output is 2-3 times more intelligent (with AI), what's the result? At what point do we, as humans, stop comprehending anything and give all intelligent work to the neural nets?

And if that does happen, could we live in a society where no work, or at least a significantly less amount of work, is needed? To me, it seems like a dystopian net positive.

It might seem far-fetched to ask these, but I think these questions are getting more prevalent by the day.

wewewedxfgdf•1h ago

Yeah except for all the money it costs to do well.

gnarlouse•1h ago

BAP BAP BAP goes the Billionaire Alignment Problem

danielrmay•57m ago

I hope the news moves this debate past "open weights vs. closed APIs" as the only axis. Open weights matter, definitely, but applied AI also needs open infrastructure around the model and it feels a bit like I'm yelling into the abyss highlighting the future we're incentivizing - cognition rented from a few institutions with access changing based on policy, geopolitics and platform incentives like advertising

b33j0r•52m ago

Available components must win. I’ve often been a critic of open weights and open architectures that give very few normal people access. What’s the point of releasing the plans for a nuclear reactor if no one can have the fuel?

nektro•44m ago

the public only wins once we shut it down globally through treaties like other tech that's too dangerous for anyone to have

palisade•35m ago

I've been contemplating a decentralized model training system for some time using volunteer machines that we all contribute. But, it is astronomically difficult. The communication speeds are untenable.

And, there is the issue of data poisoning from untrusted nodes. I've almost cracked that last issue with a self-healing checkpointed rollback system that doesn't have to throw out anything that follows the corrupt datum.

But, I'm just one person with an idea and I don't have infinite funds to make this happen. This isn't a small project.

Maybe there would be interest in something like this, now that entire frontier labs are being banned from making further progress.

The total power of all GPUs on the planet dwarf their capabilities, if we had a way to harness them in a distributed way efficiently. We wouldn't be able to train a Fable as fast as them, but eventually having access is better than never having access.

thomasjeff1•33m ago

I believe we are not the only ones

Davidzheng•24m ago

Is the total compute capacity outside of meta, google, amazon, anthropic, oai and x is higher than even the capacity of any of them? In any case, there's no chance a public collaboration gets to anthropic levels of compute even if communication were no issue.

laserx•21m ago

there are some strong open source groups like NOUS research taking the fight https://nousresearch.com/

ai_fry_ur_brain•

RIshabh235•20m ago

our dependency on US AI will lead to data concentration in hands of few megacorps.

aryasyn•17m ago

Definitely, but I see the gap widening everyday, especially while commercial AI models have started converging towards AGI. However I do believe and support the cause, as it's the next big thing as developers we need to take to prevent a complete monopoly in the coming few years.

ai_fry_ur_brain•6m ago

"Converging towards AGI"

These things can't even center a div correctly half the time.

Not everything is code. Just because it generates a shitty SaaS clone for you and that seemed magical, it does not mean we are approaching "AGI".

An AGI could design an Oil tanker, manage the project from start to finish, handle all contract negotiations and purchasables, payroll, scheduling. Then it could do that 50x over and start a leading logistics firms.

In reality an LLM can't even complete upwork projects that are worth $20 an hour more than 4% or the time.

Source:

https://labs.scale.com/leaderboard/rli

4% guys, 4%. It cannot complete entry level work on fucking Upwork 96% of the time. Stop falling for the marketing and sorry but an LLM will never be AGI you people are seriously morons.

Its literally just text autocomplete with some RLHF post training, holy shit im losing my mind. I want this hype to end so badly holy shit I need this to end.

AlphaSite•17m ago

I think models will be a commodity sooner rather than later. This whole race doesnt matter. First mover advantage is real, but over enough time it wont matter.

steren•14m ago

Wasn't it the point of ... OpenAI?

abhinavsharma•9m ago

Open-source AI can, by definition, never "win". AI is just hillclimbing today, and closed labs can always absorb everything the open world does and build upon it.

It doesn't really matter for most use cases, because the way AI is working is capability saturation. https://www.delanceyukschoolschesschallenge.com/the-rising-t...

The only exception to this is fields that are inherently adversarial (to nature or others) and an edge relative to competition matters.

Qatar pursued secret talks with Iran to shield gas complex from strikes

Show HN: The A-C Coupling Theorem – Solving Diophantine Systems in O(1)

Notion Is Migrating to SwiftUI, Apple Confirms at WWDC

Fable 5 Released and Suddenly I'm More Paranoid About My VSCode Extensions

The MilkV Jupiter 2/SpacemiT K3

Palantir loses lawsuit disputing story of how Swiss govt rejected its services

Show HN: DNSweep – DNS lookup with ASN, anycast, and CDN/WAF detection

A Lean 4-verified Balansis lib to eliminate NaN and make zero-division safe

Charlie Dalin, Who Set a Sailing Record While Battling Cancer, Dies at 42

Gene Shalit, longtime 'Today' show movie critic, dies at 100

Elon Musk becomes first trillionaire

Text/Plain Blog

OpenHands Index

Let's call 'em "aigents"

Dhtmlx Gantt – JavaScript Gantt Chart (Community Edition)

CipherNode – An offline, self-correcting AI swarm compiled to a single .exe

China cracks down on Western AI models while US companies flock to DeepSeek

Forbes declares Elon Musk as the world’s first trillionaire

A generic dynamic array in C that stores no capacity and needs no struct

TempleOS running in the browser with custom emulator

Fred-80 – a fantasy console that runs on real Amiga hardware (68080 CPU)

Streamlit

Michelangelo's Prisoner Graffiti

Show HN: Lead Qualifier – Get leads qualified in minutes

Kimi-K2.7-Code

Agentic-Engineering-Handbook

Emerging Security Risks in Quantum Computing

Show HN: WebCLI – make the web browser just another agent skill

Show HN: Babel realtime calls with strangers in any language

Mythos, make me a pelican on a bicycle (in 3D)