frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Qwen-Image-Layered: transparency and layer aware open diffusion model

https://huggingface.co/papers/2512.15603
41•dvrp•20h ago

Comments

dvrp•3h ago
Qwen-Image-Layered is a diffusion model that, unlike most SOTA-ish models out there (e.g. Flux, Krea 1, ChatGPT, Qwen-Image) it's (1) open-weight (unlike ChatGPT Image or Nano Banana) and Apache 2.0; and has 2 distinct inference-time features: (i) it's able to understand the alpha channel of images (RGBA, as opposed to RGB only) which makes it able to generate transparency-aware bitmaps; and (ii), it's able to understand layers [1]—this is how most creative professionals work in software like Photoshop or Figma, where you overlay elements into a single file, such as a foreground and a background.

This is the first model by a main AI research lab (the people behind Qwen Image, which is basically the SOTA open image diffusion model) with those capabilities afaik.

The difference in timing for this submission (16 hours ago) is because that's when the research/academic paper got released—as opposed to the inference code and model weights, which just got released 5 hours ago.

---

Technically there's another difference, but this mostly matters for people who are interested in AI research or AI training. From their abstract: “[we introduce] a Multi-stage Training strategy to adapt a pretrained image generation model into a multilayer image decomposer.” which seems to imply that you can adapt a current (but different) image model to understand layers as well, as well as a pipeline to obtain the data from Photoshop .PSD files.

dvrp•3h ago
See also:

- Paper page: https://huggingface.co/papers/2512.15603

- Model page: https://huggingface.co/Qwen/Qwen-Image-Layered

- Quantized model page: https://huggingface.co/QuantStack/Qwen-Image-Layered-GGUF

- Blog URL: https://qwenlm.github.io/blog/qwen-image-layered/ (404 at the time of writing this comment, but it'll probably release soon)

- GitHub page: https://github.com/QwenLM/Qwen-Image-Layered

SV_BubbleTime•25m ago
I’m still not clear if it’s going to deliver the unique layers to you?

If you set a variable layers of 5 for example will it determine what is on each layer, or do I need to prompt that?

And I assume you need enough VRAM because each layer will be effectively a whole image in pixel or latent space… so if I have a 1MP image, and 5 layers I would likely need to be able to fit a 5MP image in VRAM?

Or if this can be multiple steps, where I wouldn’t need all 5 layers in active VRAM, that the assembly is another step at the end after generating on one layer?

CSS Grid Lanes

https://webkit.org/blog/17660/introducing-css-grid-lanes/
106•frizlab•1h ago•23 comments

Mistral OCR 3

https://mistral.ai/news/mistral-ocr-3
329•pember•1d ago•48 comments

A Better Zip Bomb

https://www.bamsoftware.com/hacks/zipbomb/
50•kekqqq•2h ago•15 comments

Garage – An S3 object store so reliable you can run it outside datacenters

https://garagehq.deuxfleurs.fr/
408•ibobev•7h ago•85 comments

TP-Link Tapo C200: Hardcoded Keys, Buffer Overflows and Privacy

https://www.evilsocket.net/2025/12/18/TP-Link-Tapo-C200-Hardcoded-Keys-Buffer-Overflows-and-Priva...
188•sibellavia•5h ago•53 comments

We ran Anthropic’s interviews through structured LLM analysis

https://www.playbookatlas.com/research/ai-adoption-explorer
14•jp8585•47m ago•3 comments

8-bit Boléro

https://linusakesson.net/music/bolero/index.php
126•Aissen•11h ago•24 comments

Graphite is joining Cursor

https://cursor.com/blog/graphite
152•fosterfriends•7h ago•174 comments

GotaTun – Mullvad's WireGuard Implementation in Rust

https://mullvad.net/en/blog/announcing-gotatun-the-future-of-wireguard-at-mullvad-vpn
518•km•12h ago•107 comments

Amazon will allow ePub and PDF downloads for DRM-free eBooks

https://www.kdpcommunity.com/s/article/New-eBook-Download-Options-for-Readers-Coming-in-2026?lang...
504•captn3m0•13h ago•269 comments

Qwen-Image-Layered: transparency and layer aware open diffusion model

https://huggingface.co/papers/2512.15603
41•dvrp•20h ago•3 comments

Performance Hints (2023)

https://abseil.io/fast/hints.html
37•danlark1•6h ago•22 comments

NOAA deploys new generation of AI-driven global weather models

https://www.noaa.gov/news-release/noaa-deploys-new-generation-of-ai-driven-global-weather-models
62•hnburnsy•2d ago•34 comments

Show HN: TinyPDF – 3kb pdf library (70x smaller than jsPDF)

https://github.com/Lulzx/tinypdf
90•lulzx•1d ago•12 comments

Rust's Block Pattern

https://notgull.net/block-pattern/
93•zdw•18h ago•34 comments

Buteyko Method

https://en.wikipedia.org/wiki/Buteyko_method
19•rzk•1h ago•7 comments

Brown/MIT shooting suspect found dead, officials say

https://www.washingtonpost.com/nation/2025/12/18/brown-university-shooting-person-of-interest/
60•anigbrowl•20h ago•62 comments

The pitfalls of partitioning Postgres yourself

https://hatchet.run/blog/postgres-partitioning
41•abelanger•3d ago•5 comments

Believe the Checkbook

https://robertgreiner.com/believe-the-checkbook/
106•rg81•7h ago•46 comments

Response Healing: Reduce JSON defects by 80%+

https://openrouter.ai/announcements/response-healing-reduce-json-defects-by-80percent
30•numlocked•1d ago•21 comments

The FreeBSD Foundation's Laptop Support and Usability Project

https://github.com/FreeBSDFoundation/proj-laptop
123•mikece•8h ago•41 comments

Monumental snake engravings of the Orinoco River (2024)

https://www.cambridge.org/core/journals/antiquity/article/monumental-snake-engravings-of-the-orin...
9•bryanrasmussen•1w ago•0 comments

Show HN: Misata – synthetic data engine using LLM and Vectorized NumPy

https://github.com/rasinmuhammed/misata
6•rasinmuhammed•3d ago•0 comments

Reverse Engineering US Airline's PNR System and Accessing All Reservations

https://alexschapiro.com/security/vulnerability/2025/11/20/avelo-airline-reservation-api-vulnerab...
78•bearsyankees•5h ago•38 comments

The scariest boot loader code

http://miod.online.fr/software/openbsd/stories/boot_hppa.html
12•todsacerdoti•3h ago•0 comments

Lite^3, a JSON-compatible zero-copy serialization format

https://github.com/fastserial/lite3
121•cryptonector•6d ago•30 comments

LLM Year in Review

https://karpathy.bearblog.dev/year-in-review-2025/
19•swyx•2h ago•4 comments

Show HN: I Made Loom for Mobile

https://demoscope.app
57•admtal•6h ago•33 comments

Wall Street Ruined the Roomba and Then Blamed Lina Khan

https://www.thebignewsletter.com/p/how-wall-street-ruined-the-roomba
172•connor11528•4h ago•124 comments

History LLMs: Models trained exclusively on pre-1913 texts

https://github.com/DGoettlich/history-llms
742•iamwil•1d ago•364 comments