WorldGrow: Generating Infinite 3D World

https://github.com/world-grow/WorldGrow

35•cdani•3h ago

Comments

jackdoe•2h ago

cant wait for the new diablo :)

speedgoose•1h ago

It looks more like the Stanley parable.

pjmlp•1h ago

With a quarter the size of the development team, 'cause productivity!

embedding-shape•2h ago

It is only a paper as of now:

> The code is being prepared for public release; pretrained weights and full training/inference pipelines are planned.

Any ideas of how it would different and better compared to "traditional" PCG? Seems like it'd give you more resource consumption, worse results and less control, neither of which seem like a benefit.

glenneroo•1h ago

The description in the linked YouTube video for some reason has more info than the github repo:

> We tackle the challenge of generating the infinitely extendable 3D world — large, continuous environments with coherent geometry and realistic appearance. Existing methods face key challenges: 2D-lifting approaches suffer from geometric and appearance inconsistencies across views, 3D implicit representations are hard to scale up, and current 3D foundation models are mostly object-centric, limiting their applicability to scene-level generation. Our key insight is leveraging strong generation priors from pre-trained 3D models for structured scene block generation. To this end, we propose WorldGrow, a hierarchical framework for unbounded 3D scene synthesis. Our method features three core components: (1) a data curation pipeline that extracts high-quality scene blocks for training, making the 3D structured latent representations suitable for scene generation; (2) a 3D block inpainting mechanism that enables context-aware scene extension; and (3) a coarse-to-fine generation strategy that ensures both global layout plausibility and local geometric/textural fidelity. Evaluated on the large-scale 3D-FRONT dataset, WorldGrow achieves SOTA performance in geometry reconstruction, while uniquely supporting infinite scene generation with photorealistic and structurally consistent outputs. These results highlight its capability for constructing large-scale virtual environments and potential for building future world models.

Garlef•2h ago

I don't think generating virtual space is the issue.

It's about generating interesting virtual space!

james-bcn•1h ago

Yep. People have been doing this kind of stuff for computer games for decades. It's actually not that difficult. It's not clear what novel problem is being solved here.

jsheard•1h ago

Yeah but those traditional procgen techniques don't use AI, and this one does use AI. They solved the problem of them not being AI enough for the AI era. AI!

agravier•40m ago

Do you have some particular piece of software or tech demo or game in mind with interesting very large generated 3D worlds?

sirtaj•31m ago

Valheim and No Man's Sky are ones I've played recently.

SiempreViernes•30m ago

In Mario 64 there is a staircase you can run up forever, granted it looks the same no matter how long you have Mario run up the stairs, but that certainly fits "big but uninteresting 3d world."

bogwog•14m ago

> big but uninteresting 3d world.

I know 'interesting' is subjective, but your comment is demonstrably false. Just type "mario 64 staircase" into youtube, and look at the hundreds (thousands? millions?) of videos and many millions of views.

antonvdi•15m ago

Minecraft surely fits those criteria.

jpalomaki•1h ago

” The generated scenes are walkable and suitable for navigation/planning evaluation.”

Maybe the idea is to create environments for AI robotics traini ng.

rootlocus•1h ago

Or at least coherent.

analog8374•59m ago

Consider the levels generated in any roguelike.

Consider the patterns generated by cellular automata.

Both tend to stay interesting in the small scale but lose it to boring chaos in the large.

For this reason I think the better approach is to start with a simple level-scale form and then refine it into smaller parts, and then to refine those parts and so on.

(Vs plugging away at tunnel-building like a mole)

keyle•9m ago

You reminded me of this https://book.leveldesignbook.com/process/layout

And Valve I think used to have a series on level design, involving from big to small and "anchor points", but I seem to have misplaced the link.

gcr•1h ago

This could be a great way to make backrooms horror environments!

I've dreamed of a NeRF-powered backrooms walking simulator for quite a while now. This approach is "worse" because the mesh seems explicit rather than just the world becoming what you look at, but that's arguably better for real-world use cases of course.

grumbelbart2•26m ago

> backrooms horror environments

True, it sounds (and looks) a lot like https://scp-wiki.wikidot.com/scp-3008

fjfaase•29m ago

I wonder if they also have a strategy for deleting generate tiles, otherwise the infinite is limited to the size of available memory. I also wonder if with their method can exactly recreate tiles that have been deleted. Or in other words, that they have a method for generating unique seeds for all tiles. The paper does not give much technical details. If the seed has a limited size and there is a method for generating seeds for each 2D coordinate, I wonder if it is possible to make a non-repeating infinite world. I think it is not possible with a limited size seed.

keyle•13m ago

This is cool. And could be fun in games. Not sure I get the point otherwise... The thought that came to mind was "Architectural slop".

Rust cross-platform GPUI components

Recall for Linux

Don't forget these tags to make HTML work like you expect

Microsoft needs to open up more about its OpenAI dealings

Should LLMs just treat text content as an image?

Geoutil.com – Measure distances, areas, and convert geo data in the browser

WorldGrow: Generating Infinite 3D World

Artifact (YC W25) is hiring engineers in NYC to build modern ECAD

Why I'm teaching kids to hack computers

How I turned Zig into my favorite language to write network programs in

You are how you act

Show HN: Write Go code in JavaScript files

If your adversary is the mossad (2014) [pdf]

What Happened to Running What You Wanted on Your Own Machine?

Corrosion

Canada Set to Side with China on EVs

Structure and Interpretation of Classical Mechanics (2014)

You already have a Git server

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

Unexpected patterns in historical astronomical observations

Sandhill cranes have adopted a Canada gosling

Why JPEG XL Ignoring Bit Depth Is Genius (and Why AVIF Can't Pull It Off)

An overengineered solution to `sort | uniq -c` with 25x throughput (hist)

Ken Thompson recalls Unix's rowdy, lock-picking origins

Sphere Computer – The Innovative 1970s Computer Company Everyone Forgot

Are-we-fast-yet implementations in Oberon, C++, C, Pascal, Micron and Luon

A definition of AGI

We saved $500k per year by rolling our own "S3"

A bug that taught me more about PyTorch than years of using it

Feed the bots

WorldGrow: Generating Infinite 3D World

Comments

Rust cross-platform GPUI components

Recall for Linux

Don't forget these tags to make HTML work like you expect

Microsoft needs to open up more about its OpenAI dealings

Should LLMs just treat text content as an image?

Geoutil.com – Measure distances, areas, and convert geo data in the browser

WorldGrow: Generating Infinite 3D World

Artifact (YC W25) is hiring engineers in NYC to build modern ECAD

Why I'm teaching kids to hack computers

How I turned Zig into my favorite language to write network programs in

You are how you act

Show HN: Write Go code in JavaScript files

If your adversary is the mossad (2014) [pdf]

What Happened to Running What You Wanted on Your Own Machine?

Corrosion

Canada Set to Side with China on EVs

Structure and Interpretation of Classical Mechanics (2014)

You already have a Git server

Show HN: MyraOS – My 32-bit operating system in C and ASM (Hack Club project)

Unexpected patterns in historical astronomical observations

Sandhill cranes have adopted a Canada gosling

Why JPEG XL Ignoring Bit Depth Is Genius (and Why AVIF Can't Pull It Off)

An overengineered solution to `sort | uniq -c` with 25x throughput (hist)

Ken Thompson recalls Unix's rowdy, lock-picking origins

Sphere Computer – The Innovative 1970s Computer Company Everyone Forgot

Are-we-fast-yet implementations in Oberon, C++, C, Pascal, Micron and Luon

A definition of AGI

We saved $500k per year by rolling our own "S3"

A bug that taught me more about PyTorch than years of using it

Feed the bots