frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Size of Life

https://neal.fun/size-of-life/
387•eatonphil•2h ago•76 comments

DeepSeek uses banned Nvidia chips for AI model, report says

https://finance.yahoo.com/news/china-deepseek-uses-banned-nvidia-131207746.html
121•goodway•1h ago•84 comments

Is it a bubble?

https://www.oaktreecapital.com/insights/memo/is-it-a-bubble
23•saigrandhi•48m ago•1 comments

Australia begins enforcing world-first teen social media ban

https://www.reuters.com/legal/litigation/australia-social-media-ban-takes-effect-world-first-2025...
73•chirau•1d ago•158 comments

Qwen3-Omni-Flash-2025-12-01:a next-generation native multimodal large model

https://qwen.ai/blog?id=qwen3-omni-flash-20251201
74•pretext•2h ago•33 comments

Auto-grading decade-old Hacker News discussions with hindsight

https://karpathy.bearblog.dev/auto-grade-hn/
16•__rito__•55m ago•5 comments

Why the Sanitizer API is just `setHTML()`

https://frederikbraun.de/why-sethtml.html
40•birdculture•1d ago•17 comments

Factor 0.101 now available

https://re.factorcode.org/2025/12/factor-0-101-now-available.html
37•birdculture•6h ago•3 comments

9 Mothers (YC X26) Is Hiring

https://app.dover.com/jobs/9mothers
1•ukd1•1h ago

Launch HN: InspectMind (YC W24) – AI agent for reviewing construction drawings

18•aakashprasad91•2h ago•8 comments

COM Like a Bomb: Rust Outlook Add-in

https://tritium.legal/blog/outlook
39•piker•3h ago•14 comments

Qualcomm acquires RISC-V focused Ventana Micro Systems

https://www.qualcomm.com/news/releases/2025/12/qualcomm-acquires-ventana-micro-systems--deepening...
31•fork-bomber•2h ago•33 comments

Golang's big miss on memory arenas

https://avittig.medium.com/golangs-big-miss-on-memory-arenas-f1375524cc90
38•andr3wV•6d ago•23 comments

Valve: HDMI Forum Continues to Block HDMI 2.1 for Linux

https://www.heise.de/en/news/Valve-HDMI-Forum-Continues-to-Block-HDMI-2-1-for-Linux-11107440.html
62•OsrsNeedsf2P•59m ago•18 comments

Gundam is just the same as Jane Austen but happens to include giant mech suits

https://eli.li/gundam-is-just-the-same-as-jane-austen-but-happens-to-include-giant-mech-suits
7•surprisetalk•1w ago•0 comments

Volcanic eruptions set off a chain of events that brought Black Death to Europe

https://www.cam.ac.uk/stories/volcanoes-black-death
43•gmays•4d ago•4 comments

Typewriter Plotters (2022)

https://biosrhythm.com/?p=2143
25•LaSombra•5d ago•0 comments

Super-Flat ASTs

https://jhwlr.io/super-flat-ast/
25•mmphosis•6d ago•1 comments

RoboCrop: Teaching robots how to pick tomatoes

https://phys.org/news/2025-12-robocrop-robots-tomatoes.html
17•smurda•2h ago•7 comments

Revisiting "Let's Build a Compiler"

https://eli.thegreenplace.net/2025/revisiting-lets-build-a-compiler/
208•cui•11h ago•35 comments

Deprecations via warnings don't work for Python libraries

https://sethmlarson.dev/deprecations-via-warnings-dont-work-for-python-libraries
18•scolby33•2d ago•20 comments

England Historic Aerial Photo Explorer

https://historicengland.org.uk/images-books/archive/collections/aerial-photos/
17•davemateer•2h ago•3 comments

Map of all the buildings in the world

https://gizmodo.com/literally-a-map-showing-all-the-buildings-in-the-world-2000694696
137•dr_dshiv•5d ago•47 comments

PeerTube is recognized as a digital public good by Digital Public Goods Alliance

https://www.digitalpublicgoods.net/r/peertube
649•fsflover•1d ago•140 comments

Israel used Palantir technologies in pager attack in Lebanon

https://the307.substack.com/p/revealed-israel-used-palantir-technologies
111•cramsession•3h ago•49 comments

Rust in the kernel is no longer experimental

https://lwn.net/Articles/1049831/
863•rascul•15h ago•635 comments

In New York City, congestion pricing leads to marked drop in pollution

https://e360.yale.edu/digest/new-york-congestion-pricing-pollution
336•Brajeshwar•2h ago•328 comments

Cloth Simulation

https://cloth.mikail-khan.com/
155•adamch•1w ago•31 comments

New benchmark shows top LLMs struggle in real mental health care

https://swordhealth.com/newsroom/sword-introduces-mindeval
84•RicardoRei•4h ago•115 comments

Show HN: Gemini Pro 3 imagines the HN front page 10 years from now

https://dosaygo-studio.github.io/hn-front-page-2035/news
3217•keepamovin•1d ago•916 comments
Open in hackernews

Qwen3-Omni-Flash-2025-12-01:a next-generation native multimodal large model

https://qwen.ai/blog?id=qwen3-omni-flash-20251201
71•pretext•2h ago

Comments

dvh•1h ago
I asked: "How many resistors are used in fuzzhugger phantom octave guitar pedal?". It replied 29 resistors and provided a long list. Answer is 2 resistors: https://tagboardeffects.blogspot.com/2013/04/fuzzhugger-phan...
iFire•1h ago
> How many resistors are used in fuzzhugger phantom octave guitar pedal?

Weird, as someone not having a database of the web, I wouldn't be able to calculate either result.

iFire•1h ago
I tend to pick things where I think the answer is in the introduction material like exams that test what was taught.
dvh•1h ago
"I don't know" would be perfectly reasonable answer
kaoD•42m ago
> as someone not having a database of the web, I wouldn't be able to calculate either result

And that's how I know you're not an LLM!

esafak•1h ago
This is just trivia. I would not use it to test computers -- or humans.
parineum•1h ago
Everything is just trivia until you have a use for the answer.

OP provided a we link with the answer, aren't these models supposed to be trained on all of that data?

esafak•1h ago
There is nothing useful you can do with this information. You might as well memorize the phone book.

The model has a certain capacity -- quite limited in this case -- so there is an opportunity cost in learning one thing over another. That's why it is important to train on quality data; things you can build on top of.

DennisP•15m ago
Just because it's in the training data doesn't mean the model can remember it. The parameters total 60 gigabytes, there's only so much trivia that can fit in there so it has to do lossy compression.
brookst•1h ago
Where did you try it? I don’t see this model listed in the linked Qwen chat.
mettamage•1h ago
I wonder if with that music analysis mode, you can also make your own synths
sosodev•1h ago
Does Qwen3-Omni support real-time conversation like GPT-4o? Looking at their documentation it doesn't seem like it does.

Are there any open weight models that do? Not talking about speech to text -> LLM -> text to speech btw I mean a real voice <-> language model.

edit:

It does support real-time conversation! Has anybody here gotten that to work on local hardware? I'm particularly curious if anybody has run it with a non-nvidia setup.

dsrtslnd23•1h ago
it seems to be able to do native speech-speech
sosodev•1h ago
It does for sure. I did some more digging and it does real-time too. That's fascinating.
binsquare•1h ago
Does anyone else find that there's hard to pin down reason of life-lessness in the speech of these voice models?

Especially in the fruit pricing portion of the video for this model. Sounds completely normal but I can immediately tell it is ai. Maybe it's intonation or the overly stable rate of speech?

colechristensen•1h ago
I'm perfectly ok with and would prefer an AI "accent".
esafak•1h ago
> Sounds completely normal but I can immediately tell it is ai.

Maybe that's a good thing?

sosodev•1h ago
I think it's because they've crammed vision, audio, multiple voices, prosody control, multiple languages, etc into just 30 billion parameters.

I think ChatGPT has the most lifelike speech with their voice models. They seem to have invested heavily in that area while other labs focused elsewhere.

Lapel2742•1h ago
IMHO it's not lifeless. It's just not overly emotional. I definitely prefer it that way. I do not want the AI to be excited. It feels so contrived.

On the video itself: Interesting, but "ideal" was pronounced wrong in German. For a promotional video, they should have checked that with native speakers. On the other hand its at least honest.

rarisma•1h ago
GPT4o in the charts is crazy.
BoorishBears•1h ago
Why? gpt-realtime is finalized gpt-4o. Gemini Live is still 2.5.

Not their fault frontier labs are letting their speech to speech offerings languish.

banjoe•58m ago
Wow, crushing 2.5 Flash on every benchmark is huge. Time to move all of my LLM workloads to a local GPU rig.
embedding-shape•51m ago
Just remember to benchmark it yourself first with you private task collection, so you can actually measure them against each other. Pretty much any public benchmark is unreliable at this moment, and making model choices based on other's benchmarks is bound to leave you disappointed.
gardnr•41m ago
This is a 30B parameter MoE with 3B active parameters and is the successor to their previous 7B omni model. [1]

You can expect this model to have similar performance to the non-omni version. [2]

There aren't many open-weights omni models so I consider this a big deal. I would use this model to replace the keyboard and monitor in an application while doing the heavy lifting with other tech behind the scenes. There is also a reasoning version, which might be a bit amusing in an interactive voice chat if it pronounces the thinking tokens while working through to a final answer.

1. https://huggingface.co/Qwen/Qwen2.5-Omni-7B

2. https://artificialanalysis.ai/models/qwen3-30b-a3b-instruct

gardnr•22m ago
I can't find the weights for this new version anywhere. I checked modelscope and huggingface. It looks like they may have extended the context window to 200K+ tokens but I can't find the actual weights.
pythux•18m ago
They link to: https://huggingface.co/collections/Qwen/qwen3-omni-68d100a86... from the blog post but it does seem like this redirects to their main space on HF so maybe they didn't yet make the model public?
olafura•14m ago
Looks like it's not open source: https://www.alibabacloud.com/help/en/model-studio/qwen-omni#...
coder543•7m ago
[delayed]
Aissen•35m ago
Is this a new proprietary model?
sim04ful•27m ago
The main issue I'm facing with realtime responses (speech output) is how to separate non-diegetic outputs (e.g thinking, structured outputs) from outputs meant to be heard by the end user.

I'm curious how anyone has solved this

stevenhuang•25m ago
Wayback for those that can't reach https://web.archive.org/web/20251210164048/https://qwen.ai/b...
terhechte•24m ago
Is there a way to run these Omni models on a Macbook quantized via GGUF or MLX? I know I can run it in LMStudio or Llama.cpp but they don't have streaming microphone support or streaming webcam support.

Qwen usually provides example code in Python that requires Cuda and a non-quantized model. I wonder if there is by now a good open source project to support this use case?

aschobel•16m ago
Looks to be API only. Bummer.