frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Bagel • Unified Model for Multimodal Understanding and Generation

https://github.com/ByteDance-Seed/Bagel
7•montyanderson•8mo ago

Comments

yreg•8mo ago
Curious there is no discussion on this. I think it looks interesting.

If nothing else, I'm glad someone is still working on open-weight image models. AFAIK there hasn't been much movement in the area since Flux.

wsintra2022•8mo ago
Was looking at the model and was curious about HN comments, thought this would be a good talking piece since it has been released open, haven’t tried to run it locally yet but will do soon as I can.
yreg•8mo ago
There has been some discussion in /r/stablediffusion I'm not sure if anyone tried to run it though.
mdaniel•8mo ago
It's the luck of the submission time window; currently: https://news.ycombinator.com/item?id=44094362
wsintra2022•8mo ago
The model itself appears to be around 30gb, my rule of thumb double it for ram. So should run on 60gb vram/unified ram ?