Qwen 3.7 Preview

https://twitter.com/Alibaba_Qwen/status/2056403591464984753

54•theanonymousone•1h ago

Comments

kelsey98765431•51m ago

https://xcancel.com/Alibaba_Qwen/status/2056403591464984753

> Qwen3.7 Preview lands on Arena ！

> Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.

> Can't wait to release Qwen3.7 series models！Stay tuned! @arena

Onavo•39m ago

Where's Grok 4.3 on the leaderboard?

rspoerri•38m ago

I am very interested in seeing new qwen models. Qwen3.6 27b is the first one that can do things and doesnt constantly loose "it's mind" and that can be run on a 3090 with a good context size. But it's sometimes getting into a loop.

giancarlostoro•37m ago

I had a flavor of an older version of Qwen (I forget which one to be fair) that was coding along, then lost itself in a loop, I was so confused, it was just a random greenfield "lets see how it does" type of project anyway.

tedivm•27m ago

I've completely replaced GitHub Copilot using Sonnet 3.6 with OpenCode using Qwen3.6 27b, and it's been a great experience.

verdverm•11m ago

Similar, but I'm using 35B A3B variation with experimental MTP support

OpenCode is pretty good too

BillStrong•11m ago

Look on HuggingFace, there is a template that is supposed to fix the updates for the Qwen Models.

https://huggingface.co/froggeric/Qwen-Fixed-Chat-Templates

Maybe will help you?

giancarlostoro•38m ago

There I was waiting on a smaller version of Qwen 3.6 to drop so I can run it on my Mac, and then bam, they drop this.

hydra-f•35m ago

Vision has become totally underappreciated, whereas I believe it brings important advantages to a model

Also, a big caveat in using Qwen models has always been its speech patterns. I do wonder how Google made the Gemma lineup so good at this

Let's hope Alibaba continues to open source its models

jwr•29m ago

Agreed. Incidentally, in my testing, qwen models (qwen3.6-35b-a3b and earlier 3.5) are WAY better with vision than gemma4-26b-a4b. I would normally want to stick with gemma4 only (I use it for spam filtering), but it just doesn't cut it for vision work, and qwen models do.

greenavocado•16m ago

God I love qwen3.6-35b-a3b especially Q8

verdverm•9m ago

I second this notion, I am impressed daily with what little Qwen can do

sleepyeldrazi•33m ago

I don't think I can handle another small model release by qwen, I'm still trying to find the limits of 3.6 27B and they are already threatening us with a new one?

But jokes aside, I love the fast iteration, these are most probably again finetunes on the 3.5 architecture that appear better in internal testing, which is still very nice to see. Putting more and more pressure on the bigger labs to perform better is always a good thing.

genxy•4m ago

How good must their training pipelines be? Releasing publicly and at this rate has made them very efficient.

trilogic•19m ago

Qwen 3.6 35B (finetuned) is so good that it became standard open weights for everyday use. Is not far at all from proprietary models if you give it tools, skills and agents etc, it can actually finish the job. (Thank you Qwen team, appreciated). Using opensource now we can definitely rely to design from scratch very complicated architecture and build pretty fast the full pack. Wish to see Europe AI unleashed, wake up.

mettamage•13m ago

Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out.

verdverm•8m ago

Unsloth has good resources

kethinov•14m ago

Can someone explain what the current state of model benchmarking is? If you try to look up what the best locally runnable model is, you get a bunch of random blog posts using idiosyncratic criteria to rank things seemingly based on one dude's opinion.

Ideally I would love to see a leaderboard with relatively objective ranking criteria that 1. lets you filter by open weight / locally runnable, 2. filter by date of release (nothing older than x), and 3. is agnostic to hardware requirements. I just want to know what the best model is. Let me worry about how I will afford to run it.

I love the llmfit project for seeing what will run on your hardware, but it would be nice to know what I'm missing out on by not having better hardware, thus why objective hardware-agnostic ratings would be helpful.

vessenes•10m ago

That would be nice, but it's not going to be possible.

Any open benchmark has a very short life, since it will be pulled in and DPO / RL trained quickly for benchmaxxing purposes. So, you'll need a private test to have a hope of something fair. (These also get leaked over time, btw, so even then there's a window of usability).

These are expensive to run.

Now consider that there might be 15-20 viable quants for a given open model release; someone would have to want to pay for these private evals to be run on them. Even then, a good read through unsloth's commits and blog posts will remind you that there's quite a lot of engineering work to be done to get model inference working properly, even for models released by frontier or near-frontier labs. So, you'd want to make sure that you have a replicable 'best engineered' deployment to evaluate, or at least one that's closest to your hardware and fits the bill.

Upshot - it's much faster to download and try out a model, and possibly cheaper too. Well, cheaper since hugging face is paying the bandwidth bills.

sigmoid10•5m ago

>I just want to know what the best model is. Let me worry about how I will afford to run it.

This is a very typical manager question that I suppose many people have who fail to see the simple truth: There is no "best" model. There are only best models for certain use-cases. Sometimes you'll find these in custom community leaderboards on platforms like huggingface, but for most business applications you'll probably have to come up with your own benchmark. Most common benchmarks are pretty worthless by now because all the usual ones are being gamed hard by model providers, to the point that there are now sometimes drastic differences between models that perform very similarly on common benchmarks.

vessenes•14m ago

Today I learned Meta's new model is preferred to everything but claude. That is .. a real surprise! Congrats to the Meta team.

Anthropic Acquires Stainless

We stopped AI bot spam in our GitHub repo using Git's –author flag

Show HN: Files.md – Open-source alternative to Obsidian

The Quiet Renovation at Bitwarden

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

Two computers, one monitor, zero fiddling – Alex Plescan

Project Glasswing: what Mythos showed us

Iran Starts Bitcoin-Backed Ship Insurance for Hormuz Strait

What Is Date:Italy?

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

Voice AI Systems Are Vulnerable to Hidden Audio Attacks

Qwen 3.7 Preview

The Aperiodic Table

When Kierkegaard Got Cancelled

Cursor Introduces Composer 2.5

The Fil-C Optimized Calling Convention

Learn Harness Engineering

'We mould trees to grow into the shape of chairs'

It is time to give up the dualism introduced by the debate on consciousness

Garry Tan, the CEO of venture YC, accused me of unethical reporting

Actually, democracy dies in H.R.

GenCAD

Linux security mailing list 'almost unmanageable'

Show HN: InsForge – Open-source Heroku for coding agents

Porting my 3D points renderer on a ZX Spectrum 48K

Crystals found inside wreckage from the first nuclear bomb test

Enough with the AI FOMO, go slow-mo, says Domo CDO

Don't answer the first question

The foundations of a provably secure operating system (PSOS) (1979) [pdf]

Show HN: Auto-identity-remove – Automated data broker opt-out runner for macOS

Qwen 3.7 Preview

Comments

Anthropic Acquires Stainless

We stopped AI bot spam in our GitHub repo using Git's –author flag

Show HN: Files.md – Open-source alternative to Obsidian

The Quiet Renovation at Bitwarden

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

Two computers, one monitor, zero fiddling – Alex Plescan

Project Glasswing: what Mythos showed us

Iran Starts Bitcoin-Backed Ship Insurance for Hormuz Strait

What Is Date:Italy?

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

Voice AI Systems Are Vulnerable to Hidden Audio Attacks

Qwen 3.7 Preview

The Aperiodic Table

When Kierkegaard Got Cancelled

Cursor Introduces Composer 2.5

The Fil-C Optimized Calling Convention

Learn Harness Engineering

'We mould trees to grow into the shape of chairs'

It is time to give up the dualism introduced by the debate on consciousness

Garry Tan, the CEO of venture YC, accused me of unethical reporting

Actually, democracy dies in H.R.

GenCAD

Linux security mailing list 'almost unmanageable'

Show HN: InsForge – Open-source Heroku for coding agents

Porting my 3D points renderer on a ZX Spectrum 48K

Crystals found inside wreckage from the first nuclear bomb test

Enough with the AI FOMO, go slow-mo, says Domo CDO

Don't answer the first question

The foundations of a provably secure operating system (PSOS) (1979) [pdf]

Show HN: Auto-identity-remove – Automated data broker opt-out runner for macOS