https://chat.qwen.ai/s/0f9d558c-2108-4350-98fb-6ee87065d587?...
As an example, when asked to change the background, it also completely changed the bear (it has the same shirt but the fur and face are clearly different), and also: when it turned the bear in a balloon, it changed the background (removing the pavement) and lost the left seed in the watermelon.
It is something that can be fixed with better prompting, or is it a limitation of the model/architecture?
Firefox on ios ftr
“I get it” - is actually just some arbitrary personal benchmark.
The reason we use math in physics is because of its specificity. The same reason coding is so hard [0,1].
[0] https://youtube.com/watch?v=cDA3_5982h8
[1] Code is math. There's an isomorphism between Turing complete languages and computable mathematics. You can look more into my namesake, church, and Turing if you want to get more formal or wait for the comment that corrects a nuanced mistake here (yes, it exists)
rushingcreek•3h ago
If Qwen is concerned about recouping its development costs, I suggest looking at BFL's Flux Kontext Dev release from the other day as a model: let researchers and individuals get the weights for free and let startups pay for a reasonably-priced license for commercial use.
Jackson__•2h ago
So it is trained off OAI, as closed off as OAI and most importantly: worse than OAI. What a bizarre strategy to gate-keep this behind an API.
[0]
https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VLo/cas...
https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VLo/cas...
https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VLo/cas...
echelon•2h ago
Both Alibaba and Tencent championed open source (Qwen family of models, Hunyuan family of models), but now they've shut off the releases.
There's totally a play where models become loss-leader for SaaS/PaaS/IaaS and where they extinguish your closed competition.
Imagine spreading your model so widely then making the terms: "do not use in conjunction with closed source models".
diggan•1h ago
What are you talking about? Feels like a very strong claim considering there are ongoing weight releases, wasn't there one just today or yesterday from a Chinese company?
yorwba•4m ago
New entrants may keep releasing weights as a marketing strategy to gain name recognition, but once they have established themselves (and investors start getting antsy about ROI) making subsequent releases closed is the logical next step.
vachina•2h ago
Jackson__•1h ago
It's really too close to be anything but a model trained on these outputs, the whole vibe just screams OAI.
VladVladikoff•45m ago
diggan•2h ago
But if you're suggesting they should do open weights, doesn't that mean people should be able to use it freely?
You're effectively suggesting "trial-weights", "shareware-weights", "academic-weights" or something like that rather than "open weights", which to me would make it seem like you can use them for whatever you want, just like with "open source" software. But if it misses a large part of what makes "open source" open source, like "use it for whatever you want", then it kind of gives the wrong idea.
rushingcreek•2h ago
I think that releasing the weights openly but with this type of dual-license (hence open weights, but not true open source) is an acceptable tradeoff to get more model developers to release models openly.
diggan•1h ago
But isn't that true for software too? Software is expensive to develop, and lots of developers/companies are choosing not to make their code public for free. Does that mean you also feel like it would be OK to call software "open source" although it doesn't allow usage for any purpose? That would then lead to more "open source" software being released, at least for individuals and researchers?
rushingcreek•1h ago
diggan•49m ago
I mean it wasn't binary earlier, it was "to get more model developers to release", so not a binary choice, but a gradient I suppose. Would you still make the same call for software as you do for ML models and weights?
echelon•2h ago
Alibaba just shut off the Qwen releases
Tencent just shut off the Hunyuan releases
Bytedance just released Seedream, but it's closed
It's seems like it's over.
They're still clearly training on Western outputs, though.
I still suspect that the strategic thing to do would be to become 100% open and sell infra/service.
pxc•2h ago
natrys•2h ago
Alibaba from beginning had some series of models that are always closed-weights (*-max, *-plus, *-turbo etc. but also QvQ), It's not a new development, nor does it prevent their open models. And the VL models are opened after 2-3 months of GA in API.
> Tencent just shut off the Hunyuan releases
Literally released one today: https://huggingface.co/tencent/Hunyuan-A13B-Instruct
logicchains•2h ago
dheera•1h ago
> let researchers and individuals get the weights for free and let startups pay for a reasonably-priced license for commercial use
I'm personally doubtful companies can recoup tens of millions of dollars in investment, GPU hours, and engineering salaries from image generation fees.