"They're made out of weights"

https://maxleiter.com/blog/weights

58•MaxLeiter•3h ago

Comments

turtleyacht•1h ago

Numbers that dream.

CSSer•1h ago

It works until they get to the sentience part. Neat idea!

margalabargala•49m ago

Even there it works a bit.

> These models are the only other things we've ever met that can hold a conversation, and they're made out of weights

Is a fair point.

RodgerTheGreat•23m ago

Not especially. Depending on where you set your standards for "holding a conversation" you can satisfy the requirement with a classical markov chatterbot, a well-trained parrot, a copy of Eliza, or a telemarketer flowchart drawn on a sheet of paper. Only the markov bot is made out of "weights" in the sense of a statistical model.

Parrots are intelligent animals, albeit with a limited capacity for vocabulary and syntax compared to a human, and Eliza and the flowchart are made out of explicitly encoded rules and conversational tactics.

noosphr•56m ago

It's not often I see something that's fractally wrong but here we are.

There is a dictionary, it's called the tokenizer.

There are grammar rules, they are just very weak because the structure of human language is generally quite weak. When presented with languages which have strong consistent grammars the weights are very easily interpretable as a grammar: https://arxiv.org/abs/2201.02177

The point of the original short story is that the computational substrate doesn't matter when you have Turing completeness. This one seems to think that you don't need structure and interpretability just because you change substrates.

benlivengood•29m ago

I don't think the grokking paper is a great argument for the difference between weights and meat. E.g. https://en.wikipedia.org/wiki/Cortical_Labs learning to play Pong.

The tokenizer is, at best, a sensory mechanism as evidenced by 1) the random generation of the tokenization scheme, and 2) vastly different tokenization schemes produce virtually identical behavior. It'd be like if Noah Webster threw a bunch of movable type into a bucket (breaking some words in half) and then drew randomly to make the first English dictionary.

noosphr•11m ago

I'm kind of stunned that someone is using my work to tell me I'm wrong. I wrote the code for the dish brain pong and encoding information was a huge part of what that experiment was about.

So when I way that the grok paper and the pong paper fundamentally agree I have some idea of what I'm talking about.

throw310822•22m ago

oofbey•9m ago

I love this. For anybody not getting the joke, it’s riffing on the classic 1990s essay “They’re made out of meat.”

https://web.mit.edu/people/dpolicar/writing/prose/text/think...

U.S. to Dismantle System Tracking Atlantic Currents That Are at Risk of Collapse

Elixir v1.20: Now a gradually typed language

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

Gemma 4 12B: A unified, encoder-free multimodal model

The ways we contain Claude across products

"They're made out of weights"

I was recently diagnosed with anti-NMDA receptor encephalitis

Artificial intelligence is not conscious – Ted Chiang

Uber's $1,500/month AI limit is a useful signal for AI tool pricing

DaVinci Resolve 21

Meteor Explodes over Massachusetts

The Ü Programming Language

Failing grades soar with AI usage, dwindling math skills in Berkeley CS classes

Ableton Extensions SDK

Gooey: A GPU-accelerated UI framework for Zig

A Post-Quantum Future for Let's Encrypt

Patching my guitar amp's firmware

Launch HN: Hyper (YC P26) – Company brain to power agentic development

Journey to JPEG XL: open-source experiments shaped the future of image coding

ESP32-S31

Stop Killing Games

A Mathematician's Lament – Paul Lockhart (2002) [pdf]

Algorithmic Theming Engines

Self-hosted dev sandboxes with preview URLs (Docker, Go, no K8s)

Mathematicians issue warning as AI rapidly gains ground

Skyvern (YC S23) Is Hiring Open-Source Loving DevRel Engineers

Embryos shape their limbs: a key discovery of "genetic brakes"

A Man Who Reads Books for a Living (One Every Two Days)

PlayStation Architecture

Show HN: Bio Glyph – Turn Your Face into a One-Line Drawing

"They're made out of weights"

Comments

U.S. to Dismantle System Tracking Atlantic Currents That Are at Risk of Collapse

Elixir v1.20: Now a gradually typed language

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it

Gemma 4 12B: A unified, encoder-free multimodal model

The ways we contain Claude across products

"They're made out of weights"

I was recently diagnosed with anti-NMDA receptor encephalitis

Artificial intelligence is not conscious – Ted Chiang

Uber's $1,500/month AI limit is a useful signal for AI tool pricing

DaVinci Resolve 21

Meteor Explodes over Massachusetts

The Ü Programming Language

Failing grades soar with AI usage, dwindling math skills in Berkeley CS classes

Ableton Extensions SDK

Gooey: A GPU-accelerated UI framework for Zig

A Post-Quantum Future for Let's Encrypt

Patching my guitar amp's firmware

Launch HN: Hyper (YC P26) – Company brain to power agentic development

Journey to JPEG XL: open-source experiments shaped the future of image coding

ESP32-S31

Stop Killing Games

A Mathematician's Lament – Paul Lockhart (2002) [pdf]

Algorithmic Theming Engines

Self-hosted dev sandboxes with preview URLs (Docker, Go, no K8s)

Mathematicians issue warning as AI rapidly gains ground

Skyvern (YC S23) Is Hiring Open-Source Loving DevRel Engineers

Embryos shape their limbs: a key discovery of "genetic brakes"

A Man Who Reads Books for a Living (One Every Two Days)

PlayStation Architecture

Show HN: Bio Glyph – Turn Your Face into a One-Line Drawing