frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Qwen 3.7 Preview

https://twitter.com/Alibaba_Qwen/status/2056403591464984753
54•theanonymousone•1h ago

Comments

kelsey98765431•51m ago
https://xcancel.com/Alibaba_Qwen/status/2056403591464984753

> Qwen3.7 Preview lands on Arena !

> Here come Qwen3.7-Max-Preview & Qwen3.7-Plus-Preview. Alibaba now #6 lab in Text, #5 in Vision.

> Can't wait to release Qwen3.7 series models!Stay tuned! @arena

Onavo•39m ago
Where's Grok 4.3 on the leaderboard?
rspoerri•38m ago
I am very interested in seeing new qwen models. Qwen3.6 27b is the first one that can do things and doesnt constantly loose "it's mind" and that can be run on a 3090 with a good context size. But it's sometimes getting into a loop.
giancarlostoro•37m ago
I had a flavor of an older version of Qwen (I forget which one to be fair) that was coding along, then lost itself in a loop, I was so confused, it was just a random greenfield "lets see how it does" type of project anyway.
tedivm•27m ago
I've completely replaced GitHub Copilot using Sonnet 3.6 with OpenCode using Qwen3.6 27b, and it's been a great experience.
verdverm•11m ago
Similar, but I'm using 35B A3B variation with experimental MTP support

OpenCode is pretty good too

BillStrong•11m ago
Look on HuggingFace, there is a template that is supposed to fix the updates for the Qwen Models.

https://huggingface.co/froggeric/Qwen-Fixed-Chat-Templates

Maybe will help you?

giancarlostoro•38m ago
There I was waiting on a smaller version of Qwen 3.6 to drop so I can run it on my Mac, and then bam, they drop this.
hydra-f•35m ago
Vision has become totally underappreciated, whereas I believe it brings important advantages to a model

Also, a big caveat in using Qwen models has always been its speech patterns. I do wonder how Google made the Gemma lineup so good at this

Let's hope Alibaba continues to open source its models

jwr•29m ago
Agreed. Incidentally, in my testing, qwen models (qwen3.6-35b-a3b and earlier 3.5) are WAY better with vision than gemma4-26b-a4b. I would normally want to stick with gemma4 only (I use it for spam filtering), but it just doesn't cut it for vision work, and qwen models do.
greenavocado•16m ago
God I love qwen3.6-35b-a3b especially Q8
verdverm•9m ago
I second this notion, I am impressed daily with what little Qwen can do
sleepyeldrazi•33m ago
I don't think I can handle another small model release by qwen, I'm still trying to find the limits of 3.6 27B and they are already threatening us with a new one?

But jokes aside, I love the fast iteration, these are most probably again finetunes on the 3.5 architecture that appear better in internal testing, which is still very nice to see. Putting more and more pressure on the bigger labs to perform better is always a good thing.

genxy•4m ago
How good must their training pipelines be? Releasing publicly and at this rate has made them very efficient.
trilogic•19m ago
Qwen 3.6 35B (finetuned) is so good that it became standard open weights for everyday use. Is not far at all from proprietary models if you give it tools, skills and agents etc, it can actually finish the job. (Thank you Qwen team, appreciated). Using opensource now we can definitely rely to design from scratch very complicated architecture and build pretty fast the full pack. Wish to see Europe AI unleashed, wake up.
mettamage•13m ago
Do you have a good resource on how to finetune a model like Qwen? I am curious to try it out.
verdverm•8m ago
Unsloth has good resources
kethinov•14m ago
Can someone explain what the current state of model benchmarking is? If you try to look up what the best locally runnable model is, you get a bunch of random blog posts using idiosyncratic criteria to rank things seemingly based on one dude's opinion.

Ideally I would love to see a leaderboard with relatively objective ranking criteria that 1. lets you filter by open weight / locally runnable, 2. filter by date of release (nothing older than x), and 3. is agnostic to hardware requirements. I just want to know what the best model is. Let me worry about how I will afford to run it.

I love the llmfit project for seeing what will run on your hardware, but it would be nice to know what I'm missing out on by not having better hardware, thus why objective hardware-agnostic ratings would be helpful.

vessenes•10m ago
That would be nice, but it's not going to be possible.

Any open benchmark has a very short life, since it will be pulled in and DPO / RL trained quickly for benchmaxxing purposes. So, you'll need a private test to have a hope of something fair. (These also get leaked over time, btw, so even then there's a window of usability).

These are expensive to run.

Now consider that there might be 15-20 viable quants for a given open model release; someone would have to want to pay for these private evals to be run on them. Even then, a good read through unsloth's commits and blog posts will remind you that there's quite a lot of engineering work to be done to get model inference working properly, even for models released by frontier or near-frontier labs. So, you'd want to make sure that you have a replicable 'best engineered' deployment to evaluate, or at least one that's closest to your hardware and fits the bill.

Upshot - it's much faster to download and try out a model, and possibly cheaper too. Well, cheaper since hugging face is paying the bandwidth bills.

sigmoid10•5m ago
>I just want to know what the best model is. Let me worry about how I will afford to run it.

This is a very typical manager question that I suppose many people have who fail to see the simple truth: There is no "best" model. There are only best models for certain use-cases. Sometimes you'll find these in custom community leaderboards on platforms like huggingface, but for most business applications you'll probably have to come up with your own benchmark. Most common benchmarks are pretty worthless by now because all the usual ones are being gamed hard by model providers, to the point that there are now sometimes drastic differences between models that perform very similarly on common benchmarks.

vessenes•14m ago
Today I learned Meta's new model is preferred to everything but claude. That is .. a real surprise! Congrats to the Meta team.

Anthropic Acquires Stainless

https://www.anthropic.com/news/anthropic-acquires-stainless
94•tomeraberbach•1h ago•56 comments

We stopped AI bot spam in our GitHub repo using Git's –author flag

https://archestra.ai/blog/only-responsible-ai
234•ildari•2h ago•101 comments

Show HN: Files.md – Open-source alternative to Obsidian

https://github.com/zakirullin/files.md
339•zakirullin•4h ago•186 comments

The Quiet Renovation at Bitwarden

https://blog.ppb1701.com/the-quiet-renovation-at-bitwarden
247•DaSHacka•1d ago•121 comments

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

https://techcrunch.com/2026/05/18/elon-musk-has-lost-his-lawsuit-against-sam-altman-and-openai/
104•nycdatasci•27m ago•31 comments

Two computers, one monitor, zero fiddling – Alex Plescan

https://alexplescan.com/posts/2025/08/16/kvm/
44•ankitg12•2d ago•27 comments

Project Glasswing: what Mythos showed us

https://blog.cloudflare.com/cyber-frontier-models/
158•Fysi•4h ago•64 comments

Iran Starts Bitcoin-Backed Ship Insurance for Hormuz Strait

https://www.bloomberg.com/news/articles/2026-05-18/iran-starts-bitcoin-backed-shipping-insurance-...
46•srameshc•39m ago•36 comments

What Is Date:Italy?

http://aesthetikx.info/blog/date_italy.html
56•jollyjerry•2d ago•17 comments

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

https://github.com/2b2tplace/1m_release
96•exploraz•3h ago•47 comments

Voice AI Systems Are Vulnerable to Hidden Audio Attacks

https://spectrum.ieee.org/voice-ai-audio-attacks
62•SVI•6h ago•17 comments

Qwen 3.7 Preview

https://twitter.com/Alibaba_Qwen/status/2056403591464984753
59•theanonymousone•1h ago•22 comments

The Aperiodic Table

https://blog.jgc.org/2026/05/the-aperiodic-table.html
60•jgrahamc•2d ago•24 comments

When Kierkegaard Got Cancelled

https://www.plough.com/en/topics/faith/discipleship/when-kierkegaard-got-cancelled
51•bookofjoe•6h ago•17 comments

Cursor Introduces Composer 2.5

https://twitter.com/cursor_ai/status/2056415413077233983
29•asar•45m ago•7 comments

The Fil-C Optimized Calling Convention

https://fil-c.org/calling_convention
6•pizlonator•1d ago•0 comments

Learn Harness Engineering

https://walkinglabs.github.io/learn-harness-engineering/en/
48•redbell•5h ago•1 comments

'We mould trees to grow into the shape of chairs'

https://www.bbc.co.uk/news/articles/cvg0yy3gp71o
158•bauc•5h ago•38 comments

It is time to give up the dualism introduced by the debate on consciousness

https://www.noemamag.com/there-is-no-hard-problem-of-consciousness/
237•ahalbert4•15h ago•588 comments

Garry Tan, the CEO of venture YC, accused me of unethical reporting

https://radleybalko.substack.com/p/truth-power-and-honest-journalism
64•gok•2h ago•0 comments

Actually, democracy dies in H.R.

https://www.nytimes.com/2026/05/18/world/americas/actually-democracy-dies-in-hr.html
194•mitchbob•4h ago•132 comments

GenCAD

https://gencad.github.io/
416•dagenix•20h ago•115 comments

Linux security mailing list 'almost unmanageable'

https://www.theregister.com/security/2026/05/18/linus-torvalds-says-ai-powered-bug-hunters-have-m...
160•jonbaer•5h ago•78 comments

Show HN: InsForge – Open-source Heroku for coding agents

https://github.com/InsForge/InsForge
14•mrcoldbrew•2h ago•2 comments

Porting my 3D points renderer on a ZX Spectrum 48K

https://github.com/ttsiodras/3D-on-a-ZX-Spectrum-48K/
64•ttsiodras•1d ago•9 comments

Crystals found inside wreckage from the first nuclear bomb test

https://www.scientificamerican.com/article/strange-crystals-found-inside-wreckage-from-the-first-...
154•jumploops•2d ago•71 comments

Enough with the AI FOMO, go slow-mo, says Domo CDO

https://www.theregister.com/ai-ml/2026/05/17/enough-with-the-ai-fomo-go-slow-mo-says-domo-cdo/524...
123•Bender•5h ago•67 comments

Don't answer the first question

https://lalitm.com/post/dont-answer-the-first-question/
60•lalitmaganti•8h ago•36 comments

The foundations of a provably secure operating system (PSOS) (1979) [pdf]

http://www.csl.sri.com/users/neumann/psos.pdf
94•rurban•8h ago•59 comments

Show HN: Auto-identity-remove – Automated data broker opt-out runner for macOS

https://github.com/stephenlthorn/auto-identity-remove
306•stephenlthorn•6h ago•122 comments