Can a Language Model Paint?

https://www.etive-mor.com/blog/can-a-language-model-paint/

15•liamlaverty•1d ago

Comments

liamlaverty•1d ago

I've been trying to get some language models to paint one stroke at a time for a few months now. I thought this community would be interested to see the results.

The article runs through my findings, and there's a linked technical rundown of how the app was built. There's also an interactive gallery [0] of my attempts. You can point an agent at the API docs [1], and they might (ymmv) do a painting themselves.

[0] https://www.liamlaverty.com/paint-by-language-model/ [1] https://www.liamlaverty.com/paint-by-language-model/draw/api

throwanem•1h ago

You are desperately misguided.

I would say how, but I am not your friend and here in the 2030s, no one can afford to give anything valuable for free to a stranger. Be glad of the advice, of which you'd be wise to make much more than you will.

mountainriver•56m ago

Very cool! I’ve been trying this quite a bit too

baCist•1d ago

LLMs can draw (play music, write books), but they imitate, not create.

bizer•1d ago

Good attempt. Compared to diffusion, these paintings look more like they were created by humans.

gus_massa•1d ago

You may enjoy

* "The last six months in LLMs, illustrated by pelicans on bicycles" https://simonwillison.net/2025/Jun/6/six-months-in-llms/ (https://news.ycombinator.com/item?id=44215352 | 962 points | 11 months ago | 239 comments)

* "Using “underdrawings” for accurate text and numbers" https://samcollins.blog/underdrawings/ (https://news.ycombinator.com/item?id=47977990 | 379 points | 9 days ago | 138 comments)

jamilton•14m ago

Neat. I wonder if a allowing the models to inspect pixels or pixel regions, instead of fully relying on the VLM, would help at all. The spatial reasoning required might be too complex though. In general the VLM seems to be a limiting factor, so I wonder if there's some way to usefully augment it or sidestep limitations.

Like, instead of being in pseudo-MSpaint, pseudo-Photoshop with manipulable layers and bounding boxes. They struggle to add an outline to something previously drawn, but that's something that could be done programmatically. The limitations are obviously part of what makes this interesting, but different limitations could be interesting, too. Maybe additional complexity would just result in more uninteresting failures though, I don't know.

I noticed that the feedback/strengths/suggestions outputs are clearly also given the initial image's prompt. It could be useful to additionally have an output that's not given the prompt, so the LLM knows what the VLM sees without bias?

Claude for Small Business

Scorched Earth 2000 – Web

Cisco workforce reductions

Linux gaming is faster because Windows APIs are becoming Linux kernel features

Setting up a free *.city.state.us locality domain (2025)

A History of IDEs at Google

Show HN: Nibble

MacBook Neo Deep Dive: Benchmarks, Wafer Economics, and the 8GB Gamble

The Emacsification of Software

Chess puzzle I found in my dad's old book

Twin brothers wipe 96 government databases minutes after being fired

Princeton mandates proctoring for in-person exams, upending 133 year precedent

Avoiding and reducing microplastic false positives from dry glove contact

Golden Testing a CAD Library

Launch HN: Ardent (YC P26) – Postgres sandboxes in seconds with zero migration

Xs of Y – roguelike that names itself every run. Written in 4kLoC

The US is winning the AI race where it matters most: commercialization

Marco Polo: Finding a friend with only distance and motion

AEPs: API Enhancement Proposals

How can Apple deal with the memory shortage?

The other half of AI safety

Heritability of human life span is ~50% when heritability is redefined

Microsoft BitLocker – YellowKey zero-day exploit

Reverting the incremental GC in Python 3.14 and 3.15

S-100 Virtual Workbench

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

Mystery Microsoft bug leaker keeps the zero-days coming

A sentimental tour of late 1990s and early 2000s hacking tools

Preserving Fisher-Price Pixter

Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing