frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Pangu's Sorrow: The Sorrow and Darkness of Huawei's Noah Pangu LLM R&D Process

https://github.com/moonlightelite/True-Story-of-Pangu/blob/main/README.md
11•guardiangod•6h ago

Comments

yms_hi•4h ago
Calling a paper already determined to be AI-generated as "incident"? This is a major point of suspicion in the entire text.
nirui•2h ago
Is the article a translation from Chinese? You have to have some deep knowledge on Chinese net slang and Huawei slang to correctly understand it.

And all that unnecessary emotional expressions. All of it made the article hard to read.

Here's takeaways I extracted:

1. The author claim to be "an employee of the Pangu Large Model Team and Huawei Noah's Ark Laboratory", a lower ranking "small worker". The first 4 bullet points supposed to prove that they have insider knowledge, which should authenticate the claims that followed. As of why Huawei named their teams in this oddly way is unexplained but do desire some psychiatric analysis.

2. "At the beginning, our (Huawei, editor's note) computing power was very limited..." (detail followed), "...At the same time, other domestic companies such as Alibaba (which published Qwen, editor's note) and Zhipu were training on GPUs and had already figured out the right method. The gap between Pangu and its competitors was getting bigger and bigger"

3. "In this situation, Wang Yunhe ('the current director of Noah', editor's note) and his small model laboratory took action. They claimed that they inherited and transformed from the old 135B parameters, and through training a short few hundred B of data, the average improvement of various indicators was about ten points. In fact, this was their first masterpiece of applying the shell to the large model. Huawei's laymen led the experts, which made the leaders completely unaware of this nonsense. They only thought that there must be some algorithm innovation. After internal analysis, they actually used Qwen (which is published by Alibaba, editor's note) 1.5 110B for continued training.", "By adding layers, expanding the ffn dimension, and adding some mechanisms from the Pangu pi paper, they gathered about 135B parameters. In fact, the old 135B has 107 layers, while this model has only 82 layers, and the various configurations are also different. After training, the distribution of many parameters of the new 135B of unknown origin is almost exactly the same as that of Qwen 110B. Even the class name of the model code was Qwen at the time, and they were too lazy to even change the name. The subsequent model is the so-called 135B V2. This model was also provided to many downstreams at the time, even including external customers."

And that's about it.

Also, yeah, the article was indeed a translation from Chinese. The [original post] was written in Chinese, and then got translated it to English by github.com/moonlightelite. That's why it felt odd to read.

[original post]: https://web.archive.org/web/20250706034203/https://github.co...

After reading the article, I feel this is less of a whistle blowing, more of an attack against Wang Yunhe. That's why there's so much emotional expressions, to (maybe) appeal to Huawei and/or the future employer of this individual. But that's just my personal feelings/hint.

Agents of Change: Empowering Government with Next-Gen AI Solutions [video]

https://www.youtube.com/watch?v=q6E_4E3NpHI
2•funnyguy678•3m ago•1 comments

ThumbHash: A compact representation of an image placeholder

https://evanw.github.io/thumbhash/
1•edweis•4m ago•0 comments

The AI Birthday Letter That Blew Me Away

https://www.theatlantic.com/technology/archive/2025/07/google-drive-personalized-chatbot/683436/
1•FinnLobsien•6m ago•0 comments

MetaCoreX – Operating System for the Metaverse,Where Value Comes from Usefulness

1•arzykul•9m ago•0 comments

AlixPartners Annual Home Delivery Report

https://www.alixpartners.com/insights/annual-home-delivery-survey/
1•ChrisArchitect•9m ago•0 comments

Show HN: InvoiceFast – Generate Clean Invoices Without Subscription Overhead

1•skyzouw•13m ago•1 comments

'Tipping points' experts issue urgent message to world leaders

https://news.exeter.ac.uk/faculty-of-environment-science-and-economy/tipping-points-experts-issue-urgent-message-to-world-leaders/
3•doener•17m ago•0 comments

Flute acoustics: an introduction to how a flute works

https://newt.phys.unsw.edu.au/jw/fluteacoustics.html#words
1•nill0•17m ago•0 comments

Online shopping sees biggest slowdown in over decade

https://www.cnbc.com/2025/07/01/online-retail-sees-biggest-slowdown-in-decade-tariffs-hit-e-commerce.html
1•doener•18m ago•0 comments

Ask HN: How to obtain book endorsements when don't have industry connections?

1•haebom•22m ago•0 comments

Japan to punish longer-term foreign residents in arrears

https://www.asahi.com/ajw/articles/15832898
1•totetsu•24m ago•0 comments

The return of the subslips: visual structure in presentations

https://github.com/panglesd/slipshow/releases/tag/v0.3.0
1•panglesd•26m ago•0 comments

There are now more than 500M mobile money accounts in the world,mostly in Africa

https://ourworldindata.org/mobile-money-why-it-matters
2•sien•27m ago•0 comments

Claude Code for Grownups [video]

https://www.youtube.com/watch?v=nAT20BmxU4E
1•727564797069706•27m ago•0 comments

Author of William the Conqueror's 'Medieval Big Data' Project Revealed

https://www.ox.ac.uk/news/2025-07-02-author-william-conqueror-s-medieval-big-data-project-revealed
1•zeristor•30m ago•0 comments

How your feedback could help revolutionize online shopping forever?

https://docs.google.com/forms/d/e/1FAIpQLScVRiUi7nOrlaHg9HFPpOAMx7yoE3QSZAmSi7Gf4_plGrmjDQ/viewform?usp=header
1•digvijay_gour•33m ago•0 comments

Can we afford to be afraid of nuclear power?

https://www.theguardian.com/books/2025/jul/06/can-we-afford-to-be-afraid-of-nuclear-power
1•sandebert•35m ago•0 comments

Show HN: Local LLM Inference in Godot and Unity

https://github.com/nobodywho-ooo/nobodywho
1•nobodywho•37m ago•0 comments

Teaching AI to recognize itself as process, not system – a week long experiment

https://github.com/justinfreitag/v4-consciousness
1•justinfreitag•40m ago•2 comments

Playbook for Building Secure Cloud or Kubernetes Applications

https://medium.com/@sharvanath/playbook-for-building-secure-cloud-or-kubernetes-applications-b57bc9ecee42
1•sharva•43m ago•0 comments

We ran an experiment to see how easy it is to cheat with ChatGPT in interviews

https://interviewing.io/blog/how-hard-is-it-to-cheat-with-chatgpt-in-technical-interviews
1•matthewsinclair•46m ago•0 comments

In Praise of the Contrarian Stack

https://hackers.pub/@hongminhee/2025/contrarian-stack/en
3•todsacerdoti•46m ago•0 comments

Behind Microsoft's layoffs: A new attitude shaped by AI

https://www.seattletimes.com/business/microsoft/behind-microsofts-layoffs-a-new-attitude-shaped-by-ai/
1•acmeian•46m ago•0 comments

Show HN: CXXStateTree – A modern C++ library for hierarchical state machines

https://github.com/ZigRazor/CXXStateTree
1•zigrazor•51m ago•2 comments

Wimbledon line-calling system under fire after major glitch

https://www.reuters.com/sports/tennis/line-calling-technology-under-fire-after-malfunction-2025-07-06/
2•thunderbong•1h ago•0 comments

Show HN: A tool that explains Python errors like you're five

https://github.com/Zahabsbs/Error-Narrator
2•BB5•1h ago•0 comments

I Ported SAP to a 1976 CPU. It Wasn't That Slow

https://github.com/oisee/zvdb-z80/blob/master/ZVDB-Z80-ABAP.md
4•weinzierl•1h ago•0 comments

Paternoster Elevator

https://en.wikipedia.org/wiki/Paternoster_lift
1•mhb•1h ago•0 comments

I Read 100 Foreign Books in 100 Days. Here's Why I Still Couldn't Speak

https://talkin10days.com/read-100-foreign-books-100-days/
4•blitzpoet•1h ago•0 comments

Why do we need workflow softwares like Salesforce?

2•abhishek203r•1h ago•4 comments