frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Qwen3-Max-Thinking Drops: 36T Tokens

2•SilasYee•2h ago
Alibaba has officially launched Qwen3-Max-Thinking, a trillion-parameter MoE flagship LLM pretrained on 36T tokens—double the corpus of Qwen 2.5—and it’s already matching or outperforming top-tier models like GPT-5.2-Thinking, Claude-Opus-4.5, and Gemini 3 Pro across 19 authoritative benchmarks. Its two core technical breakthroughs are what truly set it apart.

First, Adaptive Tool Calling: No manual prompts are needed—it autonomously invokes search engines, memory tools, and code interpreters based on task demands. This cuts down on hallucinations and boosts real-time problem-solving; for instance, coding tasks trigger automatic error correction loops, while research tasks combine search with context synthesis. Second, Test-Time Scaling (TTS): It outperforms standard parallel sampling by refining reasoning through iterative insights, with measurable jumps in key benchmarks—GPQA rose from 90.3 to 92.8, LiveCodeBench v6 hit 91.4 from 88.0, and IMO-AnswerBench climbed to 91.5 from 89.5.

Notably, its preview version even achieved 100% accuracy in tough math contests like AIME 25 and HMMT 25. The model runs smoothly on web/desktop demos, and its API is production-ready with adjustable thinking budgets (up to 80K tokens by default) to balance depth and speed. This isn’t just an incremental update—it’s a leap that closes the gap in reasoning and tool integration for real-world academic and engineering tasks.

Check it out: https://chat.qwen.ai/

Comments

imovie4•2h ago
if it was the llm u used to generate this i can't say i'm impressed
ChrisArchitect•1h ago
[dupe] Discussion: https://news.ycombinator.com/item?id=46766741

Qwen3-Max-Thinking Drops: 36T Tokens

2•SilasYee•2h ago•2 comments

Ask HN: How do you prevent children from accessing your products?

4•eastoeast•3h ago•4 comments

Ask HN: Is there a good open-source alternative to Adobe Acrobat?

8•sebastian_z•8h ago•4 comments

Ask HN: Gmail spam filtering suddenly marking everything as spam?

209•goopthink•2d ago•122 comments

Ask HN: What's the current best local/open speech-to-speech setup?

254•dsrtslnd23•3d ago•61 comments

Tell HN: I cut Claude API costs from $70/month to pennies

34•ok_orco•19h ago•20 comments

Ask HN: Do you have any evidence that agentic coding works?

458•terabytest•6d ago•452 comments

Ask HN: What software / applications can you now build thanks to AI

9•zarathustra333•15h ago•7 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

136•oliverjanssen•5d ago•79 comments

Ask HN: Running UPDATEs in production always feels heavier than it should

3•Lucy_Bai•14h ago•3 comments

Ask HN: What are the most significant man-made creations to date?

16•George97•1d ago•24 comments

Ask HN: DDD was a great debugger – what would a modern equivalent look like?

41•manux81•21h ago•51 comments

Compiled Node.js 18 from source on jailbroken iPhone to run Claude Code

3•BryanTheCynic•17h ago•0 comments

Ask HN: Some great launch videos in recent times?

2•nemath•18h ago•0 comments

SHDL – A Minimal Hardware Description Language Built from Logic Gates

2•rafa_rrayes•19h ago•1 comments

Ask HN: Why are so many rolling out their own AI/LLM agent sandboxing solution?

32•ATechGuy•5d ago•16 comments

I'm posting this from a memory safe web browser

39•pizlonator•23h ago•3 comments

Ask HN: Revive a mostly dead Discord server

21•movedx•5d ago•29 comments

Ask HN: Have we confused efficiency with "100% utilization"?

27•nickevante•2d ago•22 comments

Ask HN: May an agent accept a license to produce a build?

26•athrowaway3z•2d ago•48 comments

Ask HN: What usually happens after a VC asks for a demo?

12•stijo•2d ago•6 comments

Ask HN: How to reach out to a commenter under an old submission (nick_m)?

4•jsumn•1d ago•4 comments

Ask HN: Why is cursor / Claude Code is so bad at generating readmes?

4•yakshithk_•18h ago•3 comments

Ask HN: Which common map projections make Greenland look smaller?

19•jimnotgym•6d ago•17 comments

Ask HN: Career transition question – assistance, MLOps guidance

4•Pierre_Esteves•1d ago•0 comments

Ask HN: Best practice securing secrets on local machines working with agents?

9•xinbenlv•4d ago•12 comments

Ask HN: Do you "micro-manage" your agents?

7•xinbenlv•3d ago•8 comments

Ask HN: Why does the number of datasets on data.gov vary so much?

8•akudha•1d ago•4 comments

Ask HN: Does DDG no longer honor "site:" prefix?

19•everybodyknows•3d ago•6 comments

Ask HN: Thinking about memory for AI coding agents

7•hoangnnguyen•2d ago•9 comments