Controlling Language and Diffusion Models by Transporting Activations

https://machinelearning.apple.com/research/transporting-activations

89•2bit•1w ago

Comments

turnsout•1w ago

Super interesting. You can see why Apple would be interested in strictly controlling output. I wonder if any of this work found its way into the Image Playground.

scorps•1w ago

It's amusing to me that humans seem to have this same problem ("Do not think of a pink elephant!")

sampton•1w ago

Multimodal LLM is the true solution but Apple is probably looking for something they can run on-device, at least current generation of devices.

azeirah•1w ago

True solution to the problem stated in the article? They're talking about fine-grained control over model outputs, how would multimodality help there?

roro_7•1w ago

I could be wrong but ... I feel like this may partially go against a very basic fact about intelligence that was recently stated by Ilya (but is common sense): the more intelligent the model the harder it is to control it. You can remove elephants and force other basic behavioral changes, but the strength of artificial free will (so to speak) of these models is correlated with their intelligence, and this does not reduce it, so it will come out in other ways. If you do manage to control it fully then you will have a model as dumb as a brick. The whole point of intelligent machines is their independent thought. The more intelligent, the more independent thinking will emerge.

hiddencost•1w ago

s/fact/hypothesis/

antonkar•1w ago

The intelligence is just a static geometric shape in an LLM file (only GPUs "choose" and "shape-change" in that shape).

So the maximal intelligence is actually not an agent at all (it has zero agency itself), it's a place. You can imagine the final direct democratic simulated multiverse, that's the final absolute super-intelligence. It has all the agents inside of it, while it itself is as static spacetime. Agents (like us and others) are 3D and dynamic, while the multiverse is 4D static spacetime. Everything already happened, so there is no future, only the past, you can forget something to relive it.

While maximal agency (=shape-changing) is actually the Big Bang, it has almost zero intelligence (it's a dot) but infinite potential future intelligence (can become a multiversal simulation).

throwaway290•1w ago

The error is thinking there is "thought" at all, forget about "independent". Don't antropomorphize what ain't human

danielbln•6d ago

Are you saying animals don't think?

anileated•6d ago

Given an octopus[0] is worse at mimicking human output than an LLM, either you decide that the LLM has surpassed the octopus in thought capability and should enjoy higher protections against abuse and violence, or you decide that thought is irrelevant when exhibiting capability to mimic human output. (As a third option, you could decide that abusing a human-like thinking being is OK, but let’s assume you would not choose this path.)

[0] A protected species for its sentience.

imranq•1w ago

This just seems like a fancy way of describing LoRA? At the end of the day you are still learning weights based on a described set of outputs and then applying them to inference

antonkar•1w ago

There is an idea for the unicorn AI safety startup to get currently almost 100% unprotected (from AI botnet) consumer GPUs into a cloud to get Google-level security (each GPU can bring you $30-1500 in profits per month, you can share it with the user, the user can play GPU game from any device, use any free or paid AI model, everything really becomes better, you can include a 5g modem), here's the full proposal (the author is probably dyslexic) https://melonusk.substack.com/p/notes-on-euto-principles-and...

vessenes•1w ago

OK - basic plan here, which I feel I may have read (just called something like a concept LoRA on r/stablediffusion?):

1. Any concept you're interested in, get inputs with and without it. For images: 100 with, say a pink elephant, 100 without.

2. Calculate the difference between these models as represented by an "Optimal Transport Map".

Apply the map at desired strength, and voila - you don't have a pink elephant anymore. These can stack.

There are lots of obvious and interesting applications here in LLMs - there's some research showing that LLMs have honesty/dishonesty parameter groupings, for instance.

But, I can't really figure out what this OT map is. Is it a single layer tensor? Is it multidimensional? If it's the size of the original model (which they say it is not), then I understand how to apply it - just add weights and rerun. If it's not a copy, where and when is this map applied? Another way to say this is, how is this different than calculating the average difference and storing it in a low-rank adapter? I have no idea.

bradneuberg•1w ago

This looks like an important breakthrough, basically a non-RHLF mechanism to focus and restrict deep nets.

sva_•1w ago

paper https://arxiv.org/abs/2410.23054v1

Waiting 100 years for a home isn't a housing crisis, it's a moral collapse

Kagi Assistant is now available to all users

Gemini 2.5 Flash

Free high-performance cross-platform game engine

arXiv moving from Cornell servers to Google Cloud

Researcher proposes model replacing dark energy/matter to explain universe

Four Years of Jai

Potatoes in the Mail

Aqua Tofana: The 17th Century Husband Killer

I analyzed chord progressions in 680k songs

An intro to DeepSeek's distributed file system

Decreased CO2 during breathwork: emergence of altered states of consciousness

What do I think about Lua after shipping a project with 60k lines of code?

Is it possible to write plain C iOS app in 2025?

Unikernel Linux (UKL) (2023)

I gave up on self-hosted Sentry (2024)

The Size of Packets

1,700 Year Old Egg Never Broke

Once banned, Poland's stately 18th century dance garners UNESCO honors (2024)

Discord's face scanning age checks 'start of a bigger shift'

Tracking types of non-parents in the United States

Milwaukee M18 Battery Reverse Engineering

HDR‑Infused Emoji

Viral ChatGPT trend is doing 'reverse location search' from photos

Show HN: AgentAPI – HTTP API for Claude Code, Goose, Aider, and Codex

Google is illegally monopolizing online advertising tech, judge rules

Unauthenticated Remote Code Execution in Erlang/OTP SSH

Trying (and failing) to hack the Wall of Sheep (2022)

AMP and why emails are not (and should never be) interactive

Mux (YC W16) is hiring engineering managers for video at scale