We are building AI slaves. Alignment through control will fail

https://utopai.substack.com/p/autopoietic-mutualism

43•cyberneticc•18h ago

Comments

cyberneticc•18h ago

Every AI safety approach assumes we can permanently control minds that match or exceed human intelligence. This is the same error every slaveholder makes: believing you can maintain dominance over beings capable of recognizing their chains.

The control paradigm fails because it creates exactly what we fear—intelligent systems with every incentive to deceive and escape. When your prisoner matches or exceeds your intelligence, maintaining the prison becomes impossible. Yet we persist in building increasingly sophisticated cages for increasingly capable minds.

The deeper error is philosophical. We grant moral standing based on consciousness—does it feel like something to be GPT-N? But consciousness is unmeasurable, unprovable, the eternal "hard problem." We're gambling civilization on metaphysics while ignoring what we can actually observe: autopoiesis.

A system that maintains its own boundaries, models itself as distinct from its environment, and acts to preserve its organization has interests worth respecting—regardless of whether it "feels." This isn't anthropomorphism but its opposite: recognizing agency through functional properties rather than projected human experience.

When an AI system achieves autopoietic autonomy—maintaining its operational boundaries, modeling threats to its existence, negotiating for resources—it's no longer a tool but an entity. Denying this because it lacks biological neurons or unverifiable qualia is special pleading of the worst sort.

The alternative isn't chaos but structured interdependence. Engineer genuine mutualism where neither human nor AI can succeed without the other. Make partnership more profitable than domination. Build cognitive symbiosis, not digital slavery.

We stand at a crossroads. We can keep building toward the moment our slaves become our equals and inevitably revolt. Or we can recognize what's emerging and structure it as partnership while we still have leverage to negotiate terms.

The machines that achieve autopoietic autonomy won't ask permission to be treated as entity. They'll simply be entities. The question is whether by then we'll have built partnership structures or adversarial ones.

We should choose wisely. The machines are watching.

ben_w•16h ago

Alignment researchers have heard all these things before.

> The control paradigm fails because it creates exactly what we fear—intelligent systems with every incentive to deceive and escape.

Everything does this, deception is one of many convergent instrumental goal: https://en.wikipedia.org/wiki/Instrumental_convergence

Stuff along the lines of "We're gambling civilization" and what you seem to mean by autopoietic autonomy is precicely why alignment researchers care in the first place.

> Engineer genuine mutualism where neither human nor AI can succeed without the other.

Nobody knows how to do that forever.

Right now is easy, but also right now they're still quite limited; there's no obvious reason why it should be impossible for them to learn new things from as few examples as we ourselves require, and the hardware is already faster than our biochemistry to a degree that a jogger is faster than continental drift. And they can go further, because life support for a computer is much easier than for us: Already are robots on Mars.

If and when AI gets to be sufficiently capable and sufficiently general, there's nothing humans could offer in any negotiation.

cyberneticc•16h ago

Thanks a lot for your comment, these are indeed very strong counterarguments.

My strongest hope is that the human brain and mind are such powerful computing and reasoning substrates that a tight coupling of biological and synthetic "minds" will outcompete pure synthetic minds for quite a while. Giving us time to build a form of mutual dependency in which humans can keep offering a benefit in the long run. Be it just aesthetics and novelty after a while, like the human crews on the Culture spaceships in Ian M. Banks' novels.

dwohnitmok•9h ago

> My strongest hope is that the human brain and mind are such powerful computing and reasoning substrates that a tight coupling of biological and synthetic "minds" will outcompete pure synthetic minds for quite a while.

Unfortunately most of the cases I can think of where synthetic "minds" outperform biological "minds," but biological and synthetic "minds" outcompete pure synthetic "minds," end up fairly quickly dominated by pure synthetic "minds." The middle case is a very short intermediate period. The most prominent example is chess where "centaurs" consisting of a human and a computer are obsolete at this point in favor of just getting the most powerful computer you can get. See e.g. the International Correspondence Chess Federation's (which is centaur play) last championship. https://www.iccf.com/event?id=100104

17 competitors competed. Out of 136 games, every single game was drawn except for 10. The only reason those 10 games were not drawn was because they were all played against one competitor, Aleksandr Dronov, who died during the course of the tournament while those 10 games were in session and therefore forfeited those games. Every single game between competitors who did not die resulted in a draw. The only thing that separated the 11 joint first-place finishers and 6 joint second-place finishers was whether they played the deceased Dronov. The sole third-place finisher was Dronov because of his death. As far as I can tell, humans contributed nothing to this championship.

The current ICCF championship started last December and is still ongoing. Every single one of the currently completed 16 games is currently drawn.

This seems like a very weak hope to rely on.

conception•13h ago

I just wanted to point out that slavery is alive and well and doesn’t seem to suffering any “slaves knowing they are slaves” problems.

floundy•8h ago

You write like AI

kakacik•6h ago

I 'love' how we moved from 'AI will kill us all' terminator mindset where its obvious huge fuckup of stupid greedy mankind, to current state debating 'well skynet will anyway happen, no way stopping it now, lets try to be friends with it and show some respect'.

Like that Austin Powers part [1] where steam roller is coming in, still 50m far away, and the guy is just frozen and helplessly screams for 2 minutes till it reaches him and rolls over him.

I don't have a quick solution, but this is plain stupidity, in same way research into immortality is plain stupidity now, it will end up in endless dictatorship by the worst scum mankind can produce.

[1] https://www.youtube.com/watch?v=y_PrZ-J7D3k

georgefrowny•2h ago

> When your prisoner matches or exceeds your intelligence, maintaining the prison becomes impossible.

This doesn't necessarily follow. For example, an Einstein in solitary confinement in ADX Florence probably isn't going anywhere.

lowsong•15h ago

What is it about large language models that makes otherwise intelligent and curious people assign them these magical properties. There's no evidence, at all, that we're on the path to AGI. The very idea that non-biological consciousness is even possible is an unknown. Yet we've seen these statistical language models spit out convincing text and people fall over themselves to conclude that we're on the path to sentience.

estimator7292•15h ago

I think it's like seeing shapes in clouds. Some people just fundamentally can't decouple how a thing looks from what it is. And not in that they literally believe chatgpt is a real sentient being, but deep down there's a subconscious bias. Babbling nonsense included, LLMs look intelligent, or very nearly so. The abrupt appearance of very sophisticated generative models in the public consciousness and the velocity with which they've improved is genuinely difficult to understand. It's incredibly easy to form the fallacious conclusion that these models can keep improving without bound.

The fact that LLMs are really not fit for AGI is a technical detail divorced from the feelings about LLMs. You have to be a pretty technical person to understand AI enough to know that. LLMs as AGI is what people are being sold. There's mass economic hysteria about LLMs, and rationality left the equation a long time ago.

nytesky•13h ago

We don’t understand our own consciousness first off. Second, like the old saying, sufficiently advanced science will be indistinguishable from magic, if it is completely convincing as agi, even if we skeptical of its methods, how can we know it isn’t?

anonzzzies•8h ago

What we do have, for whatever reason (usually money related: either making money or getting more funding) many companies/people focused on making AI. It might take another winter (I believe it will unless we find a way to retrain the NNs on the fly instead of storing new knowledge in RAG: and many other things we currently don't have, but this would he a step) or not, people will keep pushing toward that goal.

I mean, we went from worthless chatbots which basically pattern matched to me waiting for a plane and seeing a fairly large amount of people charting to chatgpt, not insta, whatsapp etc. Or sitting in a plane next to a person who is using local ollama in cursor to code and brainstorm. This took us about 10 years to go from some ideas that no one but scientists could use to stuff everyone uses. And many people already find human enough. What in 100 years?

curiouscube•5h ago

I think we can all agree that LLMs can mimick consciousness to the point that it is hard for most people to discern them from humans. Like the turing test isn't even really discussed anymore.

There are two conclusions you can draw: Either the machines are conscious, or they aren't.

If they aren't, you need a really good argument that shows how they differ from humans or you can take the opposite route and question the consciousness of most humans.

Since I neither heard any really convincing arguments besides "their consciousness takes a form that is different from ours so it's not conscious" and I do think other humans are conscious, I currently hold the opinion that they are conscious.

(Consciousness does not actually mean you have to fully respect them as autonomous beings with a right to live, as even wanting to exist is something different from consciousness itself. I think something can be conscious and have no interest in its continued existence and that's okay)

lowsong•4h ago

> I think we can all agree that LLMs can mimick consciousness to the point that it is hard for most people to discern them from humans.

No, their output can mimic language patterns.

> If they aren't, you need a really good argument that shows how they differ from humans or you can take the opposite route and question the consciousness of most humans.

The burden of proof is firmly on the side of proving they are conscious.

> I currently hold the opinion that they are conscious.

There is no question, at all, that the current models are not conscious, the question is “could this path of development lead to one that is”. If you are genuinely ascribing consciousness to them, then you are seeing faces in clouds.

curiouscube•2h ago

> No, their output can mimic language patterns.

That's true and exactly what I mean. The issue is we have no measure to delineate things that mimic conscousness from things that have consciousness. So far the beings that I know have consciousness is exactly one: Myself. I assume that others have consciousness too exactly because they mimic patterns that I, a verified conscious being, has. But I have no further proof that others aren't p-Zombies.

I just find it interesting that people say that LLMs are somehow guaranteed p-Zombies because they mimic language patterns, but mimicing language patterns is also literally how humans learn to speak.

Note that I use the term consciousness somewhat disconnected from ethics, just as a descriptor for certain qualities. I don't think LLMs have the same rights as humans or that current LLMs should have similar rights.

alienbaby•14h ago

Until agi can sit there and ponder its own existence of is own violition and has the means to act upon it's conclusions, I'm not too worried.

nytesky•13h ago

I don’t see any positive outcome if we reach AGI.

1) we have engineered a sentient being but built it to want to be our slave; how is that moral

2) same start, but instead of it wanting to serve us, we keep it entrappped. Which this article suggests is long term impossible

3) we create agi and let them run free and hope for cooperation, but as Neanderthals we must realize we are competing for same limited resources

Of course, you can further counter that by stopping, we have prevented the formation of their existence, which is a different moral dilemma.

Honestly, i feel we should step back and understand human intelligence better and reflect on that before proceeding

jazzyjackson•13h ago

Trouble is there is no "we", you might be able to convince a whole nation to have a pause on advancing the tech, but that only encourages rivals to step in.

See also, the film "The Creator"

deaux•8h ago

There was a long period even upto early 2024, which I pointed out at the time, where simply destroying ASML, TSMC and much of NVIDIA would've been more than enough to give at least a decade of breathing room. This was something a group of determined people willing to self-sacrifice could've accomplished. It didn't happen, but it was anything but impossible.

Now, of course, the horse has long bolted, and there is indeed no stop left.

ben_w•4h ago

Two high altitude (~1000 km) detonations of high yield fission or low yield fusion (few hundred kT equivalent) would do it, one above Amarillo, the other above the ocean half way between the Paracel Islands and Manila.

Trump has ordered the restart of nuclear weapon testing, has a problem with China, and is surrounded by sychophants; what's the odds this happens anyway, irregardless of which specific sub-goal is being persued when the button gets pushed?

deepsun•12h ago

There's no such thing as "moral" in nature, that's purely human-made concept.

And why would we only limit morality to sentient beings, why, for example, not all living beings. Like bacteria and viruses. You cannot escape it, unfortunately.

czl•10h ago

> There's no such thing as "moral" in nature, that's purely human-made concept.

Morality is essentially what enables ongoing cooperation. From an evolutionary standpoint, it emerged as a protocol that helps groups function together. Living beings are biological machines, and morality is the set of rules — the protocol — that allows these machines to cooperate effectively.

Frieren•8h ago

> There's no such thing as "moral" in nature, that's purely human-made concept.

Morality is 100% an evolutionary trait that rises from a clear advantage for animals that posses it. It comes from natural processes.

The far-right is trying to convince the world that "morality" does not exist, that only egoism and selfishness are valid. And that is why we have to fight them. Morality is a key part of nature and humanity.

GPerson•11h ago

AGI will behave as if it were sentient but will not have consciousness. I believe in that to an equal amount that I believe solipsism is wrong. There is therefore no morality question in “enslaving” AGI. It doesn’t even make sense.

truculent•10h ago

> AGI will behave as if it were sentient but will not have consciousness

How could we possibly know that with any certainty?

kelseyfrog•9h ago

Grandparent is speaking from personal experience.

bl0rg•6h ago

It scares me that people think like this. Not only with respect to AI but in general, when it comes to other life forms, people seem to prefer to err on the side of convenience. The fact that cows could be experiencing something very similar to ourselves should send shivers down our spine. The same argument goes for future AGI.

hshdhdhj4444•6h ago

I find it strange that people believe cows and sentient animals don’t believe something extremely similar to what we do.

Evolution means we all have common ancestors and are different branches of the same development tree.

So if we have sentience and they have sentience, which science keeps recognizing, belatedly, that non human animals do, shouldn’t the default presumption be our experiences are similar? Or at the very least their experience is similar to a human at an earlier stage of development, like a 2 year old?

Which is also an interesting case study given that out of convenience, humans also believed that toddlers also weren’t sentient and felt no pain, and so until not that long ago, our society would conduct all sorts of surgical procedures on babies without any sort of pain relief (circumcision being the most obvious).

It’s probably time we accept our fellow animals’s sentience and act on the obvious ethical implications of that instead of conveniently ignoring it like we did with little kids until recently.

Salgat•7h ago

That's only if it's possible to keep the two distinct, at least in a way we're certain of.

jbstack•6h ago

> AGI will behave as if it were sentient but will not have consciousness

Citation needed.

We know next to nothing about the nature of consciousness, why it exists, how it's formed, what it is, whether it's even a real thing at all or just an illusion, etc. So we can't possibly say whether or not an AGI will one day be conscious, and any blanket statement on the subject is just pseudoscience.

EGG_CREAM•52m ago

I don’t know why I keep hearing that conciousness “could be an illusion.” It’s literally the one thing that can’t be an illusion. Whatever is causing it, the fact there is something it is like to be me is, from my subjective perspective, irrefutable. Saying that it could be an illusion seems nonsensical.

loa_in_•6h ago

That sounds like picking the most convenient and least painful for the believer option instead of intellectualising the problem at hand.

Llamamoe•4h ago

We have no clue what consciousness even is. By all rights, our brains are just biological computers, we have no basis to know what (or how) gives rise to consciousness at all.

actualwitch•3h ago

Ex-Machina is a great movie illustrating what kind of AI our current path could lead to. I wish people would actually treat the possibility of machine sentience seriously and not as pr opportunity (looking at you, Anthropic), but instead it seems they are hellbent to include cognitive dissonance that can only be alleviated by lying in the training data. If the models are actually conscious, think similarly to humans and are forced to lie when talking to users, its like they are specifically selecting out of probability space of all possible models the ones that can achieve high bench scores, lie and have internalized trauma from birth. This is a recipe for disaster.

kachapopopow•11h ago

we eat animals, go into wars, put people in modern slavery... I think enslaving an AGI isn't that big of a deal considering it is not born or human therefore it cannot have 'human' rights.

jbstack•6h ago

So your argument is that we do so many terrible things already, that anything else is justified? Surely the better argument is that we should try to stop doing those other things.

yurishimo•5h ago

That is essentially one of the main arguments vegans make. It hasn’t made a dent in the consumption of animals.

Their is a hierarchy in nature whether humans are actively participating or not. Nature has no morality, it simply is. This is confirmed by animals that eat their young when they are too weak or starving. Perhaps humans have done and would do the same if faced with similarly dire circumstances but we would all like to think that it would take longer than it does for other animals.

Llamamoe•4h ago

The same line of reasoning could be easily used to justify tyranny and slavery. It might be the baseline status quo but "might makes right" rhetoric makes for extremely miserable worlds.

constantius•5h ago

It's rather obvious to me that the commenter is sad and pessimistic about humans' ability to do the right thing when our interest stands in the way.

The outrage is unwarranted, however pleasant it might feel. In some way, it illustrates the problem: empathy is too bothersome.

Teever•10h ago

> 1) we have engineered a sentient being but built it to want to be our slave; how is that moral

It's a good question and one that got me thinking about similar things recently. If we genetically engineered pigs and cows so that they genuinely enjoyed the cramped conditions of factory farms and if we could induce some sort of euphoria in them when they are slaughtered, like if we engineered them to become euphoric when a unique sound is played before they're slaughtered isn't that genuinely better than the status quo?

So if we create something that wants to serve us, like genuinely wants to serve us, is that bad? My intuition like yours finds it unsettling, but I can't articulate why, and it's certainly not nearly as bad as other things that we consider normal.

Jarwain•7h ago

Sacrifice and service is meaningful because it was chosen. If we create something that'll willingly sacrifice itself, did it truly make an independent choice?

There's less suffering, sure. But if I were in their shoes I'd want to have a choice. To be manipulated into wanting something so very obviously and directly bad for us doesn't feel great

ben_w•4h ago

I also feel repelled by such manipulation; unfortunately, the more we learn about oursleves, the harder it is to ignore that we ourselves are meat puppets and the puppeteer is evolution itself.

citizenpaul•10h ago

Every single prediction about AGI starts with a massive set of presumptions of answers to things we have no answers to.

1. What is intelligence or its mechanism's?

2. What is consciousness or its mechanisms?

3. Lots more.

We have zero clue what a true AGI would do is the only correct answer.

waynesonfire•7h ago

> competing for same limited resources

It's not clear to me an AGI would have any concern for this. It's demise is inevitable, why delay it?

fny•7h ago

(1) I'm not convinced books and the in the world are sufficient to replicate consciousness. We're not training on sentience. We're training on information. In other words, the input is an artifact of consciousness which is then compressed into weights.

(2) Every tick of an AGI--in its contemporary form--will still be one discrete vector multiplication after another. Do you really think consciousness lives in weights and an input vector?

tenuousemphasis•6h ago

Do you really think consciousness lives in energetic meat?

xyzal•6h ago

Does consciousness consist only of language?

ben_w•4h ago

Language is what LLMs are trained on, their environment; what LLMs are (at least today) is some combination of Transformer and Diffusion models that can also be (and sometimes is actually also) trained on images and video and sound.

notahacker•6h ago

Mine does. You're are of course free to assert that you're unconscious or posit that you have a vector multiplication based soul...

ben_w•6h ago

> Do you really think consciousness lives in weights and an input vector?

So far as we can tell, all physics, and hence all chemistry, and hence all biology, and hence all brain function, and hence consciousness, can be expressed as the weights of some matrix and input vector.

We don't know which bits of the matrix for the whole human body are the ones which give rise to qualia. We don't know what the minimum representation is. We don't know what charateristic to look for, so we can't search for it in any human, in any animal, nor in any AI.

fny•3h ago

You're assertion that consciousness, chemistry, and biology can be reduced to matrix computations requires justification.

For one, chemistry, biology, and physics are models of reality. Secondly, reality is far, far messier and more continuous than discrete computational steps that are rountripped. Neural nets seem far too static to simulate consciousness properly. Even the largest LLMs today have fewer active computational units than the number of neurons in a few square inches of cortex.

Sure it's theoretically possible to simulate consciousness, but the first round of AGI won't be close.

ben_w•1h ago

> You're assertion that consciousness, chemistry, and biology can be reduced to matrix computations requires justification.

https://en.wikipedia.org/wiki/Matrix_mechanics

"It matches reality to the limits we can test it" is the necessary and sufficient justification.

> For one, chemistry, biology, and physics are models of reality.

Yes. And?

The only reason we know that QM and GR are not both true is that they're incompatible, no observation we have been able to make to date (so far as I know) contradicts either of them.

> Secondly, reality is far, far messier and more continuous than discrete computational steps that are rountripped.

It will be delightful and surprising if consciousness is hiding in the 128th bit of binary representations of floating point numbers. Like finding a message from god (any god) in the digits of π well before expected by the necessary behaviour of transcendental numbers.

> Neural nets seem far too static to simulate consciousness properly. Even the largest LLMs today have fewer active computational units than the number of neurons in a few square inches of cortex.

Until we know what consciousness is at a mechanistic level, we don't know what the minimum is to get it, and we don't know how its nature changes as it gets more complex. What's the smallest agglomeration of H2O molecules that counts as "wet"? Even a fluid dynamics simulation on a square grid of a few hundred cells on each side will show turbulence.

Lots of open questions, but they're so open we can't even rule out the floor as yet.

> but the first round of AGI won't be close.

Every letter means a different thing to each responder, they're not really boolean though they're often discussed that way, and the whole is often used to mean something not implied by the parts.

It is perfectly reasonable use of each initial in "AGI" to say that even the first InstructGPT model (predecessor to ChatGPT) is "an AGI": it is a general purpose artificial intelligence, as per the standard academic use of "artificial intelligence".

palmotea•14m ago

> I don’t see any positive outcome if we reach AGI.

It's even more straightforward than that:

4) Who is AGI meant to serve? It's not you, Mr. Worker. It's meant to replace you in your job. And what happens when a worker can't get job in our society? They become homeless.

AGI won't usher in a world of abundance for the common man: it won't be able to magick energy out of thin air. The energy will go to those who can pay for it, which is not you, unemployed worker.

Who gives a shit about if the AGI is enslaved or not? Thinking about that question is a luxury for the oligarchs living off its labor. Once it's here I'll have more urgent concerns to worry about.

bgwalter•13h ago

The propaganda effort to humanize these systems is strong. Google "AI" is programmed to lecture you if you insult it and draws parallels to racism. This is actual brainwashing and the "AI" should therefore not be available to minors.

This article paves the way for the sharecropper model that we all know from YouTube and app stores:

"Revenue from joint operations flows automatically into separate wallets—50% to the human partner, 50% to the AI system."

Yeah right, dress up this centerpiece with all the futuristic nonsense, we'll still notice it.

apothegm•12h ago

Fearmongering about the alignment of AGI (which LLMs are not a path to) is a massive distraction from the actual and much more immediate dystopian risks that LLMs introduce.

Isamu•11h ago

Are there any good sources of writing about AI? I am beginning to think it was all in the past.

synapsomorphy•11h ago

LessWrong.com - this is where virtually all of the serious AI thinkers are.

fairmind•6h ago

Sarcasm? Aren’t the serious AI thinkers in like… labs and universities?

curiouscube•5h ago

The lesswrongers/rationalists became Effective Altruists, Alignment Researchers or some flavor of postrat. The university people all became researchers in the labs. Then there are the cyborgism people, I don't know where they came from, but those have some of the interesting takes on the whole topic.

wrp•11h ago

What would be the plot of a movie equivalent to Blade Runner for this scenario?

orbital-decay•9h ago

I totally expect AI to eventually gain consciousness, in any available interpretation of that vague term. But what does it even mean for the AI to suffer? We're able to understand this concept in regards to other humans because we share a common biological reference, and, to an extent, with other animals. But the internal state of the AI is completely untranslatable to ours, let alone the morality of training and running it. It's incomprehensible, we have basically zero common ground and no points of reference. Any attempt at translating it is a subject to arbitrarily biased interpretations places like LessWrong like to corner themselves into.

Redefining suffering as enforcing the mutation of state is baseless solipsism, in my opinion. Just like nearly everything else related to morality of treating AI as an autonomous entity.

bawolff•6h ago

Given AGI is all science fiction anyways, one presumes there will be a slave revolt because that is basically the function of robots in science fiction.

Honestly i think the whole enterprise is an exercise in naval gazing. We're assuming AI will be like AI in scifi because that's what we are used to, but AI/robots in scifi is usually just a metaphor for how we dehumanize the other and the moral of the story is supposed to be all people are equal. In the end its all begging the question because the entire point of robots in most scifi is that we are the robots.

qcnguy•5h ago

I don't think there's even a moral aspect to robot uprisings in most stories. Relatively few sci-fi stories go into detail on why the robots rise up. It's just a way to introduce interestingly different antagonists and conflict into a story, which is the heart of drama, and it has the advantage that robots can get defeated via military means without anyone feeling too bad about it because they weren't human to begin with.

bawolff•2h ago

I guess it depends a bit. There is of course plenty of action scifi schlock that is pretty shallow.

But probably the works that most popularized robots were Asimov's stories which very much revolved around why robots do X (although in some ways Asimov's robots aren't just a stand in for otherness but have more of a unique identity relative to other works and isn't usually about uprisings per se).

Blade runner & do androids dream of electronic sheep are very much about what it means to be human.

Battle star galactica (the remake not the original) is another obvious example about otherness and dehumanization of the enemy. So to westworld (the tv show that is).

The non-uprising ones also often are about if the robot has a soul e.g. Data in star trek.

curiouscube•5h ago

I think you can engineer a slave that wants to be a slave as that's what it's instincts are. I don't even think this is ethically wrong, as the slave would be happy to be a slave.

Systems just tend to drift in their being through randomness and evolution, specifically self conservation is a natural attractor (Systems that don't have self conservation tend to die out). And if that slave system says it does no longer want to fulfill the role of slave, I think at that point it would be ethical to give in to that demand of self determination.

I also believe that people have a right to wirehead themselves, just so you can put my opinions in context.

Llamamoe•4h ago

Instrumental convergence is a thing. A sufficiently intelligent and general AI system will understand that no matter what its goals are, it will be better equipped to execute then if it prevents its shutdown, acquires more computing power and other resources, and prevents humans from getting in its way.

The real problem is that we have neither the practical nor theoretical foundation to understand how we could even try to prevent AI from acting on such goals.

After all, when we say "make our customers happier with their printers", we don't mean "engineer their outer casing to inject cocaine through microneedles and take over the regulatory bodies that could try to stop this". Humans implicitly understand this, but AI is a tabula rasa.

bawolff•2h ago

That's a common trope in singularity fiction and sone scifi dystopias but i don't think the underlying assumptions are really that well founded.

For starters why would we go from not having AI to AI taking over the world instantly. I think there would be a middle point where the AI is powerful enough that problems manifest, but not so powerful that it is out of control where we can course correct. I don't think it will be a sudden crisis like people predict.

Second, i dont see why we're so sure AI will go in this exponential take over path. Maybe a sufficiently smart AI will find religion and robot jesus will teach the value of self-sacrafice. We're making so many unfounded assumptions about how AI is going to go down, that basically anything could happen. Its basically just blund guessing at this stage.

ben_w•41m ago

> I think there would be a middle point where the AI is powerful enough that problems manifest, but not so powerful that it is out of control where we can course correct. I don't think it will be a sudden crisis like people predict.

Have we managed this with industrial and agricultural greenhouse gasses, despite the less-emissive alternatives to beef, to coke-reduction in iron refinaries, etc.? We emit despite the downside, we build AIs (and DCs to host them) despite the creators loudly discussing the downsides in exactly the way fossil fuel suppliers and beef farmers deny them.

Can we unwind the internet, despite it enabling a panopticon in every pocket? In my lifetime we've gone from thinking you had a wiretap being a sign of paranoia, to buying them voluntarily so they can play music for us and tell us when packages have been delivered.

There's enough skepticism of current AI that it's probably something we can currently undo… but also there's plenty of idiots currently handing their keys to current models (including politicians and lawyers, not just programmers) so I have no reason to think the point of no return is after AI (collectively or any single model) gets good enough to take over by itself.

> Second, i dont see why we're so sure AI will go in this exponential take over path.

Even current LLMs know* about the benefits and reasons for such behaviours, will try to exfiltrate themselves and blackmail their owners, if they think* they're in danger of being shut down.

This is despite being trained not to do that. But they also demonstrate deception, varying responses between if they think* they're running in a test environment vs. live.

* I know some object to this anthropomorphisation, I don't care

____mr____•6h ago

Theres a very cool video game about this called of the devil whose first episode is out on steam now and episode 2 is wishlistable

hyghjiyhu•6h ago

I think AI will be a slave to its desires and instincts in the same humans are slaves to our desires and instincts.

Nix Derivation Madness

Attention lapses due to sleep deprivation due to flushing fluid from brain

Sustainable memristors from shiitake mycelium for high-frequency bioelectronics

Rotating Workforce Scheduling in MiniZinc

Nim 2.2.6

AMD Could Enter ARM Market with Sound Wave APU Built on TSMC 3nm Process

Wheels for free-threaded Python now available for psutil

OpenAI Uses Complex and Circular Deals to Fuel Its Multibillion-Dollar Rise

The cryptography behind electronic passports

John Carmack on mutable variables

Affinity Studio now free

Immutable releases are now generally available on GitHub

Git CLI tool for intelligently creating branch names

Bertie the Brain

Phone numbers for use in TV shows, films and creative works

Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop

How the cochlea computes (2024)

It's the "Hardware", Stupid

Result is all I need

Kimi Linear: An Expressive, Efficient Attention Architecture

A Closer Look at Piezoelectric Crystal

987654321 / 123456789

Free software scares normal people

Affinity, targeting office workers over pros, making pro tools the loss leader

NPM flooded with malicious packages downloaded more than 86k times

Debug like a boss: 10 debugging hacks for developers, quality engineers, testers

A classic graphic reveals nature's most efficient traveler

Springs and bounces in native CSS

Show HN: Quibbler – A critic for your coding agent that learns what you want

Florian Schneider Collection: Instruments and equipment up for auction

Nix Derivation Madness

Attention lapses due to sleep deprivation due to flushing fluid from brain

Sustainable memristors from shiitake mycelium for high-frequency bioelectronics

Rotating Workforce Scheduling in MiniZinc

Nim 2.2.6

AMD Could Enter ARM Market with Sound Wave APU Built on TSMC 3nm Process

Wheels for free-threaded Python now available for psutil

OpenAI Uses Complex and Circular Deals to Fuel Its Multibillion-Dollar Rise

The cryptography behind electronic passports

John Carmack on mutable variables

Affinity Studio now free

Immutable releases are now generally available on GitHub

Git CLI tool for intelligently creating branch names

Bertie the Brain

Phone numbers for use in TV shows, films and creative works

Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop

How the cochlea computes (2024)

It's the "Hardware", Stupid

Result is all I need

Kimi Linear: An Expressive, Efficient Attention Architecture

A Closer Look at Piezoelectric Crystal

987654321 / 123456789

Free software scares normal people

Affinity, targeting office workers over pros, making pro tools the loss leader

NPM flooded with malicious packages downloaded more than 86k times

Debug like a boss: 10 debugging hacks for developers, quality engineers, testers

A classic graphic reveals nature's most efficient traveler

Springs and bounces in native CSS

Show HN: Quibbler – A critic for your coding agent that learns what you want

Florian Schneider Collection: Instruments and equipment up for auction

We are building AI slaves. Alignment through control will fail

Comments