frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Qwen3-Omni-Flash-2025-12-01:a next-generation native multimodal large model

https://qwen.ai/blog?id=qwen3-omni-flash-20251201
41•pretext•1h ago

Comments

dvh•30m ago
I asked: "How many resistors are used in fuzzhugger phantom octave guitar pedal?". It replied 29 resistors and provided a long list. Answer is 2 resistors: https://tagboardeffects.blogspot.com/2013/04/fuzzhugger-phan...
iFire•27m ago
> How many resistors are used in fuzzhugger phantom octave guitar pedal?

Weird, as someone not having a database of the web, I wouldn't be able to calculate either result.

iFire•27m ago
I tend to pick things where I think the answer is in the introduction material like exams that test what was taught.
dvh•19m ago
"I don't know" would be perfectly reasonable answer
esafak•19m ago
This is just trivia. I would not use it to test computers -- or humans.
parineum•7m ago
Everything is just trivia until you have a use for the answer.

OP provided a we link with the answer, aren't these models supposed to be trained on all of that data?

brookst•17m ago
Where did you try it? I don’t see this model listed in the linked Qwen chat.
mettamage•27m ago
I wonder if with that music analysis mode, you can also make your own synths
sosodev•20m ago
Does Qwen3-Omni support real-time conversation like GPT-4o? Looking at their documentation it doesn't seem like it does.

Are there any open weight models that do? Not talking about speech to text -> LLM -> text to speech btw I mean a real voice <-> language model.

edit:

It does support real-time conversation! Has anybody here gotten that to work on local hardware? I'm particularly curious if anybody has gotten to work with a non-nvidia setup.

dsrtslnd23•14m ago
it seems to be able to do native speech-speech
sosodev•4m ago
It does for sure. I did some more digging and it does real-time too. That's fascinating.
binsquare•17m ago
Does anyone else find that there's hard to pin down reason of life-lessness in the speech of these voice models?

Especially in the fruit pricing portion of the video for this model. Sounds completely normal but I can immediately tell it is ai. Maybe it's intonation or the overly stable rate of speech?

colechristensen•11m ago
I'm perfectly ok with and would prefer an AI "accent".
esafak•10m ago
> Sounds completely normal but I can immediately tell it is ai.

Maybe that's a good thing?

sosodev•7m ago
I think it's because they've crammed vision, audio, multiple voices, prosody control, multiple languages, etc into just 30 billion parameters.

I think ChatGPT has the most lifelike speech with their voice models. They seem to have invested heavily in that area while other labs focused elsewhere.

Lapel2742•5m ago
IMHO it's not lifeless. It's just not overly emotional. I definitely prefer it that way. I do not want the AI to be excited. It feels so contrived.

On the video itself. Interesting, but "ideal" was pronounced wrong in German.

rarisma•14m ago
GPT4o in the charts is crazy.
BoorishBears•4m ago
Why? gpt-realtime is finalized gpt-4o. Gemini Live is still 2.5.

Not their fault frontier labs are letting their speech to speech offerings languish.

A woman who discovered black holes

https://newhumanist.org.uk/articles/6296/the-woman-who-discovered-black-holes
1•MaysonL•49s ago•0 comments

World Spider Catalog

https://wsc.nmbe.ch/
1•fi-le•59s ago•0 comments

Tourists required to give 5 years social media history to enter US

https://www.dailymail.co.uk/news/article-15369957/Trump-foreign-tourists-social-media-history.html
2•testing22321•1m ago•0 comments

Pg_ClickHouse: A Postgres extension for querying ClickHouse

https://clickhouse.com/blog/introducing-pg_clickhouse
1•spathak•1m ago•0 comments

Writing Leads to Thinking (and Not the Other Way Around)

https://www.historians.org/perspectives-article/how-writing-leads-to-thinking-february-2010/
1•bryanrasmussen•2m ago•0 comments

IDF Soldiers Fire on UN Peacekeepers

https://unifil.unmissions.org/unifil-statement-10-december-2025
1•a_paddy•4m ago•0 comments

Token‑Efficient Agents: Building MCP‑Heavy Agents Without Burning Tokens

https://codeagentsalpha.substack.com/p/tokenefficient-agents-building-mcpheavy
1•olegkozlov•4m ago•1 comments

Scouts by Yutori is now generally available

https://yutori.com/scouts
7•abhshkdz•9m ago•0 comments

A smart cup for wireless, biofuel-powered, sweat-based Vitamin C sensing

https://www.sciencedirect.com/science/article/abs/pii/S0956566325009777?via%3Dihub
1•PaulHoule•10m ago•0 comments

Study: ~250 documents is all it takes to backdoor an LLM

https://www.searchenginejournal.com/ai-poisoning-black-hat-seo-is-back/561217/
2•rezamoaiandin•11m ago•1 comments

Morning coffee may protect the heart better than all-day coffee drinking

https://www.escardio.org/The-ESC/Press-Office/Press-releases/morning-coffee-may-protect-the-heart...
1•nateb2022•12m ago•0 comments

I built a Grafana your support team can use

https://github.com/towlabs/dashfrog
1•mehdig10•12m ago•0 comments

"My self-awareness of my limitations is limited."

1•niklai•14m ago•0 comments

How People Use AI at Work

https://dejan.ai/blog/report-ai-workplace/
1•gmays•14m ago•0 comments

Cobalt: static blogging on iPhone and iPad via iSH, with Rust Liquid templates

https://cobalt-org.github.io/
1•transpute•15m ago•1 comments

Show HN: Chefs.Video – A marketplace where you pay freelancers $0.005/second

https://chefs.video
1•ufvy•15m ago•1 comments

Stage Manager in Mac OS

https://blog.kowalczyk.info/til-stage-manager-in-mac-os.html
2•ericdanielski•15m ago•0 comments

You used to be able to just create a Native GUI App in 10 seconds

https://twitter.com/tsoding/status/1998403967718400376
2•Ezhik•15m ago•0 comments

Iksemel Rusted

https://thinkerf.blogspot.com/2025/12/iksemel-rusted.html
1•ciferkey•16m ago•0 comments

The moment the earliest known man-made fire was uncovered

https://www.bbc.co.uk/news/resources/idt-b9da7a6d-165b-492a-8785-235cd10e2e8e
2•fredley•16m ago•0 comments

A Journalist Reported from Palestine. YouTube Deleted His Account

https://theintercept.com/2025/12/07/youtube-deleted-journalist-israel-palestine-censorship/
1•upofadown•16m ago•0 comments

Show HN: Automate Windows Using Lua

https://lowkpro.com/
1•publicdebates•16m ago•0 comments

Monado 25.1.0: Enabling tomorrow's OpenXR experiences

https://www.collabora.com/news-and-blog/news-and-events/monado-25-1-0-enabling-tomorrows-openxr-e...
1•mfilion•17m ago•0 comments

Show HN: Vocation: AI Career Coach for Mid-Career Transitions

https://www.joinvocation.com/
1•cliffcmaxwell•18m ago•0 comments

Closing the Agent Loop

https://www.sawyerhood.com/blog/closing-the-agent-loop
2•sawyerjhood•18m ago•0 comments

Mandatory social media sharing and use of ETSA mobile app for entry to US

https://www.federalregister.gov/documents/2025/12/10/2025-22461/agency-information-collection-act...
3•beedeebeedee•20m ago•1 comments

Microsoft Scales Back AI Goals Because Almost Nobody Is Using Copilot

https://www.extremetech.com/computing/microsoft-scales-back-ai-goals-because-almost-nobody-is-usi...
2•mtdewcmu•21m ago•3 comments

MacKenzie Scott donated $7.16B in the last one year

https://yieldgiving.com/essays/we-are-the-ones-we-ve-been-waiting-for/
3•nani98•22m ago•0 comments

Super-Flat ASTs

https://jhwlr.io/super-flat-ast/
1•birdculture•22m ago•0 comments

Empromptu ($2M pre-seed): AI application builder with Self-Managing Context

1•anaempromptu•23m ago•0 comments