Block Diffusion: Interpolating Autoregressive and Diffusion Language Models

72•t55•9mo ago

Comments

notrealyme123•9mo ago

This was posted here already a few weeks ago.

holoduke•9mo ago

Whenever I try to read and understand this paper, I feel extremely dumb. I have my degree in CS, but this is just too complex for me to understand.

AlexCoventry•9mo ago

Ask ChatGPT o3 about anything you don't understand, ask it about anything in its responses you don't understand. Keep drilling down until you do understand. Takes patience, but you can learn a lot very fast, this way.

echelon•9mo ago

ChatGPT o3 understands the latest literature and isn't going to hallucinate weird details or make incorrect analogies or math?

I'd worry about learning the wrong things.

Ey7NFZ3P0nzAe•9mo ago

I disagree. It's all about rephrasing information that is in the paper. Possinly a few other papers too.

vessenes•9mo ago

o3 with a pdf or in deep research mode is excellent. Especially if you’re disciplined about staying to what’s research. But really, it’s excellent, better than benchmarks indicate, I’d say.

AlexCoventry•9mo ago

Actually, in the past few days o3 has proven fairly unreliable for me. I've gone back to o1-pro. But when I wrote the above it was reasonably reliable.

evertedsphere•9mo ago

an undergraduate degree in a field is not enough to understand recent research in a specialised subfield of a subfield and you shouldn't beat yourself up over that

there's nothing wrong with you, you just need the right background and you can go get that. see e.g. the fast.ai course

smrtinsert•9mo ago

Do you mean the fast.ai stable diffusion lectures? The initial series doesn't get too deep at all from what I remember.

IncreasePosts•9mo ago

Might want to study some stats or other math.

tippytippytango•9mo ago

I wouldn’t beat yourself up over it. Very few papers can be understood without reading a significant amount of the neighboring literature and the history of how that work came to be. There are norms and customs and a kind of academic language in every community that you won’t be able to see unless you’ve read a lot from that community. Even if you have the right math level it’s tricky.

A single paper is part of a conversation, not something that stands alone. Trying to read one random paper is like finding a 1000 page thread on an obscure topic that has been running for 10+ years and reading only the last page. It won’t make any sense without reading back a ways.

nh23423fefe•9mo ago

depth first read the references until the leaves are obvious!

blurbleblurble•9mo ago

Wow.

I can't wait to see ideas from the diffusion image generation world (like controlnet) work their way into language models.

joejoo•9mo ago

There’s already a few models that are diffusion based.

soulofmischief•9mo ago

I've built diffusion based text models, it's old hat and not necessarily the most performant way to generate text. However it does produce interesting results and I'd love to test some ideas at scale.

gitroom•9mo ago

Yeah I always end up lost in papers like this too, even with my CS degree, the research keeps leveling up nonstop.

LineageOS 23.2

Crypto Deposit Frauds

Substack makes money from hosting Nazi newsletters

Framing an LLM as a safety researcher changes its language, not its judgement

Are there anyone interested about a creator economy startup

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

2003: What is Google's Ultimate Goal? [video]

Roger Ebert Reviews "The Shawshank Redemption"

Busy Months in KDE Linux

Zram as Swap

Green’s Dictionary of Slang - Five hundred years of the vulgar tongue

Nvidia CEO Says AI Capital Spending Is Appropriate, Sustainable

Show HN: StyloShare – privacy-first anonymous file sharing with zero sign-up

Part 1 the Persistent Vault Issue: Your Encryption Strategy Has a Shelf Life

Show HN: Teleop_xr – Modular WebXR solution for bimanual robot teleoperation

The Highest Exam: How the Gaokao Shapes China

Open-source framework for tracking prediction accuracy

India's Sarvan AI LLM launches Indic-language focused models

Show HN: CryptoClaw – open-source AI agent with built-in wallet and DeFi skills

ShowHN: Make OpenClaw respond in Scarlett Johansson’s AI Voice from the Film Her

CReact Version 0.3.0 Released

Show HN: CReact – AI Powered AWS Website Generator

The rocky 1960s origins of online dating (2025)

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

Why there is no official statement from Substack about the data leak

Effects of Zepbound on Stool Quality

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

Ask HN: Do we need "metadata in source code" syntax that LLMs will never delete?

Pentagon cutting ties w/ "woke" Harvard, ending military training & fellowships

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]