Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

https://qwen.ai/blog?id=99f0335c4ad9ff6153e517418d48535ab6d8afef&from=research.latest-advancements-list

77•natrys•2h ago

Comments

natrys•2h ago

Models:

- https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking

- https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Instruct

causal•1h ago

That has got to be the most benchmarks I've ever seen posted with an announcement. Kudos for not just cherrypicking a favorable set.

esafak•27m ago

We should stop reporting saturated benchmarks.

be7a•1h ago

The biggest takeaway is that they claim SOTA for multi-modal stuff even ahead of proprietary models and still released it as open-weights. My first tests suggest this might actually be true, will continue testing. Wow

ACCount37•34m ago

Most multi-modal input implementations suck, and a lot of them suck big time.

Doesn't seem to be far ahead of existing proprietary implementations. But it's still good that someone's willing to push that far and release the results. Getting multimodal input to work even this well is not at all easy.

Computer0•29m ago

I feel like most Open Source releases regardless of size claim to be similar in output quality to SOTA closed source stuff.

drapado•53m ago

Cool! Pity they are not releasing a smaller A3B MoE model

daemonologist•31m ago

Their A3B Omni paper mentions that the Omni at that size outperformed the (unreleased I guess) VL. Which is interesting - I'd have expected the larger model to have more weights to "waste" on additional modalities and thus for the opposite to be true (or for the VL to outperform in both cases, or for both to benefit from knowledge transfer).

Relevant comparison is on page 15: https://arxiv.org/abs/2509.17765

willahmad•43m ago

China is winning the hearts of developers in this race so far. At least, they won mine already.

swyx•20m ago

so.. why do you think they are trying this hard to win your heart?

llllm•6m ago

they aren’t even trying hard, it’s just that no one else is trying

sergiotapia•43m ago

Thank you Qwen team for your generosity. I'm already using their thinking model to build some cool workflows that help boring tasks within my org.

https://openrouter.ai/qwen/qwen3-235b-a22b-thinking-2507

Now with this I will use it to identify and caption meal pictures and user pictures for other workflows. Very cool!

deepdarkforest•41m ago

The Chinese are doing what they have been doing to the manufacturing industry as well. Take the core technology and just optimize, optimize, optimize for 10x the cost/efficiency. As simple as that. Super impressive. These models might be bechmaxxed but as another comment said, i see so many that it might as well be the most impressive benchmaxxing today, if not just a genuinely SOTA open source model. They even released a closed source 1 trillion parameter model today as well that is sitting on no3(!) on lm arena. EVen their 80gb model is 17th, gpt-oss 120b is 52nd https://qwen.ai/blog?id=241398b9cd6353de490b0f82806c7848c5d2...

BUFU•29m ago

The open source models are no longer catching up. They are leading now.

jadbox•18m ago

How does it compare to Omni?

Show HN: A Map for Running Seattle's Light Rail

'SIM Farms' Are a Spam Plague. A Giant One in NY Threatened US Infrastructure

Parquet Viewer

Qwen Image Edit Plus

Tech news sites quietly rely on this word to create drama in headlines (2017)

Ask HN: Do you find AI code review tools useful?

Fake ycombinatoor / y-comblnator using GitHub issues for contact

Orange EV revives Freewire tech, gives it the best name in the business

US border patrol collected DNA from 1000s of US citizens for years, data shows

GitHub Y Combinator phishing attack

H-1B Visa: Canada Hopes Trump's $100k Fee Redirects Talent North

30 Years Defending Linux – Until I Called It Quits

The pop singer who built a million-dollar ant empire

How Royal Navy forgot how to cure scurvy

Is Mid-20th Century American Culture Getting Erased?

Largest private Rembrandt collection may be fractionalised owner reveals

Secret Service dismantles imminent telecommunications threat in NY -3 min video

'Nightmare bacteria' cases are increasing in the US

HTML5 Supercomputer

China's Aircraft Carrier Capability Just Made a Leap Forward

The Point Is Addressing

Isis Kids Planned a Violent 'Caliphate Revival' on Discord

'Your Countries Are Going to Hell': Trump Airs His Grievances at the U.N

Liwan: Easy and Privacy-First Web Analytics in a Single Binary

Datastar.wow: declarative and data-oriented Datastar apps with Clojure

Ask HN: Is the Y Combinator GitHub bot real or scam?

Tesla's robotaxi push is confusing the hell out of regulators

Where to find PCB dataset for autorouting?

GitHub Status – Incident with Copilot

Destroying asteroid 2024 YR4 could be the best option to stop it hitting moon