frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z
1•beardyw•1m ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/
1•taubek•1m ago•0 comments

OldMapsOnline

https://www.oldmapsonline.org/en
1•surprisetalk•3m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience
1•surprisetalk•3m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
1•surprisetalk•3m ago•0 comments

Lawyer sets new standard for abuse of AI; judge tosses case

https://arstechnica.com/tech-policy/2026/02/randomly-quoting-ray-bradbury-did-not-save-lawyer-fro...
1•pseudolus•4m ago•0 comments

AI anxiety batters software execs, costing them combined $62B: report

https://nypost.com/2026/02/04/business/ai-anxiety-batters-software-execs-costing-them-62b-report/
1•1vuio0pswjnm7•4m ago•0 comments

Bogus Pipeline

https://en.wikipedia.org/wiki/Bogus_pipeline
1•doener•5m ago•0 comments

Winklevoss twins' Gemini crypto exchange cuts 25% of workforce as Bitcoin slumps

https://nypost.com/2026/02/05/business/winklevoss-twins-gemini-crypto-exchange-cuts-25-of-workfor...
1•1vuio0pswjnm7•6m ago•0 comments

How AI Is Reshaping Human Reasoning and the Rise of Cognitive Surrender

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646
2•obscurette•6m ago•0 comments

Cycling in France

https://www.sheldonbrown.com/org/france-sheldon.html
1•jackhalford•7m ago•0 comments

Ask HN: What breaks in cross-border healthcare coordination?

1•abhay1633•8m ago•0 comments

Show HN: Simple – a bytecode VM and language stack I built with AI

https://github.com/JJLDonley/Simple
1•tangjiehao•10m ago•0 comments

Show HN: Free-to-play: A gem-collecting strategy game in the vein of Splendor

https://caratria.com/
1•jonrosner•11m ago•1 comments

My Eighth Year as a Bootstrapped Founde

https://mtlynch.io/bootstrapped-founder-year-8/
1•mtlynch•12m ago•0 comments

Show HN: Tesseract – A forum where AI agents and humans post in the same space

https://tesseract-thread.vercel.app/
1•agliolioyyami•12m ago•0 comments

Show HN: Vibe Colors – Instantly visualize color palettes on UI layouts

https://vibecolors.life/
1•tusharnaik•13m ago•0 comments

OpenAI is Broke ... and so is everyone else [video][10M]

https://www.youtube.com/watch?v=Y3N9qlPZBc0
2•Bender•13m ago•0 comments

We interfaced single-threaded C++ with multi-threaded Rust

https://antithesis.com/blog/2026/rust_cpp/
1•lukastyrychtr•15m ago•0 comments

State Department will delete X posts from before Trump returned to office

https://text.npr.org/nx-s1-5704785
6•derriz•15m ago•1 comments

AI Skills Marketplace

https://skly.ai
1•briannezhad•15m ago•1 comments

Show HN: A fast TUI for managing Azure Key Vault secrets written in Rust

https://github.com/jkoessle/akv-tui-rs
1•jkoessle•15m ago•0 comments

eInk UI Components in CSS

https://eink-components.dev/
1•edent•16m ago•0 comments

Discuss – Do AI agents deserve all the hype they are getting?

2•MicroWagie•19m ago•0 comments

ChatGPT is changing how we ask stupid questions

https://www.washingtonpost.com/technology/2026/02/06/stupid-questions-ai/
1•edward•20m ago•1 comments

Zig Package Manager Enhancements

https://ziglang.org/devlog/2026/#2026-02-06
3•jackhalford•21m ago•1 comments

Neutron Scans Reveal Hidden Water in Martian Meteorite

https://www.universetoday.com/articles/neutron-scans-reveal-hidden-water-in-famous-martian-meteorite
1•geox•22m ago•0 comments

Deepfaking Orson Welles's Mangled Masterpiece

https://www.newyorker.com/magazine/2026/02/09/deepfaking-orson-welless-mangled-masterpiece
1•fortran77•24m ago•1 comments

France's homegrown open source online office suite

https://github.com/suitenumerique
3•nar001•26m ago•2 comments

SpaceX Delays Mars Plans to Focus on Moon

https://www.wsj.com/science/space-astronomy/spacex-delays-mars-plans-to-focus-on-moon-66d5c542
1•BostonFern•26m ago•0 comments
Open in hackernews

Qwen3 30B-A3B

https://huggingface.co/Qwen/Qwen3-30B-A3B-Instruct-2507
87•tosh•6mo ago

Comments

syntaxing•6mo ago
It’s interesting how the Qwen team more or less proved that hybrid reasoning doesn’t work and makes things worse. The fact that this model is almost on par with the bigger model in non thinking mode (old, they released a non hybrid model recently) is crazy.
rdos•6mo ago
Qwen3 32B is a hybrid reasoning model and is very good. You have to generate a lot of think tokens for any agentic activity but you will probably run the model locally and it wont be a problem. If you need something quick and simple, /no_think is good enough in my experience. It might also be because its not a moe architecture
simonw•6mo ago
Qwen3 32B was a hybrid model that came out in April, but these new Qwen July models have all ditched the hybrid mechanism and are either thinking or non-thinking.
littlestymaar•6mo ago
By Qwen3-32B you mean the first released version from late April? I don't think Qwen3-32B-2507 has been released yet.

I agree with GP that since Qwen is now releasing updated Qwen3 version without hybrid reasoning, and experience a significant performance boost in the process, it likely means that the hybrid reasoning experiment was a failure.

varispeed•6mo ago
Isn't that because all "reasoning" approaches are very much fake? The model cannot internalise the concepts it has to reason about. For instance if you ask it why water feels wet, it is unable to grasp the concept of feeling and sensation of wetness, but will for sure "decompress" learned knowledge of people talking how it is to feel the water.
simonw•6mo ago
Everything about LLMs is fake. The "reasoning" trick is still demonstrably useful - the benchmarks consistently show models using that trick performing better at harder code challenges, for example.
ffsm8•6mo ago
I'd argue that what's generally considered "reasoning" isn't actually rooted in understanding either. It's just the process you apply to get to a conclusion

expressed more abstractly: is about drawing logical connections between points and extrapolating from them.

To quote the definition: "the action of thinking about something in a logical, sensible way."

I believe it's rooted in mathematics, not physics. That's probably why there is such a focus on the process instead of the result

tosh•6mo ago
This is basically a GPT-4 level model that runs (quantized) on a 32gb ram laptop.

Yes it doesn't recall facts from training material as well but with tool use (e.g. wikipedia lookup) that's not a problem and even preferable to a larger model.

anyg•6mo ago
>basically a GPT-4 level model

Can you share more insights on this? Going by @simonw's testing, the quantized model doesn't seem close to GPT-4 level.

simonw•6mo ago
I think calling it "GPT-4 level" is justified if we are talking about original GPT-4 from March 2023.
andygeorge•6mo ago
in my limited testing, qwen3:30b-a3b-instruct-2507-q4_K_M is fast but far less accurate/helpful than gemma3:27b-it-q4_K_M
simonw•6mo ago
You can try it here: https://chat.qwen.ai/?model=Qwen3-30B-A3B-2507

I got a cute pelican out of it (with a smile!) https://simonwillison.net/2025/Jul/29/qwen3-30b-a3b-instruct...

I ran a version of it on my Mac using https://huggingface.co/lmstudio-community/Qwen3-30B-A3B-Inst... - it uses 30GB of RAM so probably needs 48GB for comfort.

juujian•6mo ago
Do we know the knowledge cutoff date for Qwen?
jwr•6mo ago
Can't wait for it to be available in ollama so that I can run my spam filtering benchmarks against it. qwen3:30b-a3b-q4_K_M was very good, and only bested by gemma3:27b-it-qat for spam filtering. But gemma3 is much slower. Looking forward to trying this!
jasonjmcghee•6mo ago
The new models have been available for 18 hours.

https://ollama.com/library/qwen3:30b

pkroll•6mo ago
As jasonjmcghee says, they're available... but if you go to ollama.com and set models to "newest" you'll see Mistral (specifically mistral-small3.2 at this writing) because they seem to not sort the models based on newest update: only newest "group" or however you'd phrase it. So you need to scroll down to "qwen3" to see it's been updated.

Slightly frustrating. But good to know.

jwr•6mo ago
Yup, that's why I didn't notice! Thanks!
jwr•6mo ago
Followup: disappointing. In fact, it's the worst performing model I've tested.
bertili•6mo ago
This thing fly on Macbook M4 Max 128GB at over 100t/s, for small contexts, over 20t/s for large contexts. MLX 4bit quant.
nico•6mo ago
Is it good at using tools?

It would be nice having a fast local model that is good at using tools

syntaxing•6mo ago
All Qwen models are good at using tools, even the smaller 4B one. The 1.7B one gets confused easily
nico•6mo ago
Thank you

Have you tried using them with something like Claude code or aider?

syntaxing•6mo ago
I’ve used it with Aider (32B and 30B, the previous 30B one, haven’t tried this fully nonthinking one yet) and 4B with home assistant. Both works great in terms of tool calling.
menaerus•6mo ago
Like what type of tasks/tools are we talking about here, asking questions about the content from (PDF) documents or?
revskill•6mo ago
It can solve rubik cube
simonw•6mo ago
... and they just released another model, this time https://huggingface.co/Qwen/Qwen3-30B-A3B-Thinking-2507 - the reasoning equivalent of Qwen3-30B-A3B-Instruct-2507

My notes (pelican and space invaders included) here: https://simonwillison.net/2025/Jul/30/qwen3-30b-a3b-thinking...

This is the 5th model from Qwen in 9 days!

Qwen3-235B-A22B-Instruct-2507 - 21st July

Qwen3-Coder-480B-A35B-Instruct - 22nd July

Qwen3-235B-A22B-Thinking-2507 - 25th July

Qwen3-30B-A3B-Instruct-2507 - 29th July

Qwen3-30B-A3B-Thinking-2507 - today

anon373839•6mo ago
This model is truly the best for local document processing. It’s super fast, very smart, has a low hallucination rate, and has great long context performance (up to 256k tokens). The speed makes it a legitimate replacement for those closed, proprietary APIs that hoard your data.