https://www.reuters.com/technology/googles-gemini-co-lead-no...
https://www.reuters.com/technology/googles-gemini-co-lead-no...
Besides that, Google is in a pretty good position, they're not bleeding money on AI like Anthropic/OpenAI, and they own product verticals where they can integrate it. Plus they have a mature ads-model which is what might actually drive a bit of revenue for LLMs.
Noam has a deep expertise in these systems at every level, both algorithmically and at production scale, and knows how to leverage things at different levels.
It's not like Google won't have anyone else that can do what he does, but at the same time, it's an implicit criticism of Google's culture, operations, development, and overall AI program. Shazeer is well past the point where the paycheck is the deciding factor, although I'm certain he is very well paid. Having the freedom to innovate and build free from the corporate fuckery of Google and Facebook is probably more valuable than the pay raise he got with the move, and OAI has the advantage of not having to cope with decades of corporate cruft and inertia. They'll get there - all corporations do - but they're relatively young enough to still be nimble.
I hope this is not accurate but I'm afraid it is: https://x.com/signulll/status/2067446889956430273
Sama, and any other founder, will always have a difficult fight against bureaucracy, and once you let a little bit in, the bureaucracy's sole purpose becomes to grow itself.
I wouldn't expect OpenAI to start releasing open weight competitive models again, but I could be wrong.
Seems like there are some insights here!
edit: it seems the post has been removed but comments are viewable.
1 liner summary:
To put it lightly, the dude was politically outspoken and held strong beliefs.
I had had a boss (from a YC-funded company, no less) that behaved in this way. Talked down on me with the g-slur, used language barriers to alienate his peers, and demanded religious sensitivity whenever we met after work. His entire life was defined by this religiously insecure identity, and several meetings were derailed by him thinking he was slighted by the rest of the team. That led to team members avoiding him, which reaffirmed his perception of being discriminated against. In reality we were all just baffled by his inability to adopt a secular work ethic.
As a queer person I could partially empathize with his behavior. Some of the smartest queer people I know are also frustrated, downtrodden and crass in protest of their mistreatment. But they're also generally grounded people that buckle down at work and get things done. They don't accuse people of being bigoted, lash out at coworkers or use slurs in the office. Perhaps it helps that queer identity isn't eschatological in nature, but that's only my best guess.
He left Google in 2021 to co-found Character.AI. In 2024, Google brought him and some Character.AI researchers back via a licensing/talent deal with Character.AI (reportedly around $2.7B). He was then made a Gemini co-lead.
Now he’s leaving Google again for OpenAI.
Exciting times!
Nice guys.
If you got your girlfriend/boyfriend by sneakily convincing them to cheat on their partner, don't be surprised if you are the next one to be cheated on.
What a waste of money.
also "empty handed" is just unnecesarily dramatic, he left all the knwoledge base he helped build, that's google's IP and is worth m(b?)illions
(recall that OpenAI thought GPT-2 was too powerful to release for approximately tantamount reasons)
As an outsider, I'd be really curious to understand why, given how well positioned they seem to be in the AI battle:
- huge, quasi unmatched data war chest
- huge, quasi unmatched, planet-scale infrastructure
- native AI chip design and production (TPU)
- the core ideas for what we now know as "AI" were invented there
- deepmind, enough said
- pretty much the deepest pocket of all the AI players with the possible exception of MSFT
- a massively large user base and reach to deploy AI to (Android, YT, Cloud, Search, Email, ...)
- supposedly one the best engineering culture of the valley
Why do the best people leave ?
Why do their AI product always come in 3rd place ?
Why can't they seem to take the lead, both in terms of product design or in term of raw LLM performance?
The only answer I can think of is:
- culture is completely broken
- management sucks something fierce
- company is so fat and rich no one is actually interested in winning anymore
Also, why didn't they nail him down contractually when they bought character.ai ... isn't that pretty standard with these type of superstar (re)hires?
OpenAI is in a unique position right now to grant pre-IPO options (probably in the form of RSUs). And they wanted him badly enough to grant the extra options necessary to effectively 'buy out' whatever unvested Google bonus he's walking away from.
LOL.
Question two: Why are OpenAI spending that money taking talent from Google, who can definitely outspend them for talent, and not Anthropic, who are leading the market and are at least somewhat financially constrained.
I doubt that the money had anything to do with it.
I also doubt that the state of the technology at OAI vs. Google had much to do with it, Google is behind no doubt, but the gap is not as far as we know, insurmountable.
I suspect that this is a leadership clash. Noam was working in GDM. GDM somehow went away from coding and RSI into "world models" and that has played out very poorly. Who made that call? Who was still playing politics?
Given this is Noam the list of people that could be pissing him off is very small: Demis, Sergey (?!), a couple of VPs in GDM.
What the hell happened?
It gives some context on the contributions of each of the authors. About Shazeer, from the article:
Shazeer’s joining the group was critical. “These theoretical or intuitive mechanisms, like self-attention, always require very careful implementation, often by a small number of experienced ‘magicians,’ to even show any signs of life,” says Uszkoreit. Shazeer began to work his sorcery right away. He decided to write his own version of the transformer team’s code. “I took the basic idea and made the thing up myself,” he says. Occasionally he asked Kaiser questions, but mostly, he says, he “just acted on it for a while and came back and said, ‘Look, it works.’” Using what team members would later describe with words like “magic” and “alchemy” and “bells and whistles,” he had taken the system to a new level.
1. There are already multiple "sota" models on the market that compete with only marginal gains between them (OpenAI, Anthropic, Google/Gemini) and some that are catching up (DeepSeek, Qwen,..).
2. The fact that something is a hard engineering problem does not mean it's generating revenue. So while what you said is true, deep expertise is required to push the industry forward, I don't think that is going to matter for the bottom line of these companies. Hence why I think the models don't give a company any 'moat' in a capitalist economy.
Possibly true. Any smart innovations developed by one organization will be smuggled into others.
Training, inferring, and data collection, infrastructures are definitely moats. High-volume usage feedback is also hard to come by for new entrants.
Grabbing market-share if you have investors that are ready to burn cash infinetely. Find a hot niche, buy a banana 1 USD, sell it for 0.10 USD.
Example: Cursor, they became popular because they were selling ChatGPT unlimited for 20 USD / month.
When they launched, just a reskinned VS Code, "fastest growing AI company"
No coincidence they were bought by SpaceX, who wants to consolidate revenue even if non-sense as long it helps other investors to exit. It shows rapid growth.
Profit is the real moat.
One example: Nvidia. Proprietary tooling, proprietary IP, proprietary hardware, no alternative, expensive.
Don't we all want to (automatically) and passively invest in a company losing billions of dollars ?
At least we can diversify our portfolio from SpaceX.
That's their moat.
Maybe also stolen copyrighted content that cannot be found anywhere else now, so they are the only ones who can train on it.
If the issue is inefficiency, tons of meetings, too much team alignment etc, then that's the issue that you need to tackle, and these issues can already appear in a 50-100 employee company. Sure, that's an easy problem to solve with a smaller size but unless you hired people for no reason, these people have a very specific set of problems to tackle and are often, in these companies, the best in class to tackle them, culling half of the company isn't going to make things better.
(And X rehired part of the laid-off engineers)
The leaps forward need bloat. A startup can execute on specific vector direction way better.
Now back to your point, what did X deliver with its lean ops? It seems that it needed 2 bailouts (one from xAI, and one from space X)
and tens of losing companies that make balloons or whatnot
Sadly the gap between reality and satire has shrunk.
But yes. I also wish that show would come back.
Noam shazeer would be google head dreamer
Equal contribution. Listing order is random. Jakob proposed replacing RNNs with self-attention and started the effort to evaluate this idea. Ashish, with Illia, designed and implemented the first Transformer models and has been crucially involved in every aspect of this work. Noam proposed scaled dot-product attention, multi-head attention and the parameter-free position representation and became the other person involved in nearly every detail. Niki designed, implemented, tuned and evaluated countless model variants in our original codebase and tensor2tensor. Llion also experimented with novel model variants, was responsible for our initial codebase, and efficient inference and visualizations. Lukasz and Aidan spent countless long days designing various parts of and implementing tensor2tensor, replacing our earlier codebase, greatly improving results and massively accelerating our research.
In any case, if the authors considered their contributions equal, that's good enough for me.Uszkoreit wanted to build a more efficient/scalable language/seq2seq model that could take advantage of GPU parallelism (replacing RNNs which were the main approach to sequence modelling at that time).
Uszkoreit's insight was that although language appears sequential, it is in fact really part parallel part hierarchical, as can be seen by linguist's sentence parse trees where at each level there is parallelism/independence between the branches of the tree, with them getting combined at the next level up. This is what gave rise to the idea of a model that consisted of a stack of of parallel processing layers (transformer layers). I believe that attention was also part of the plan from day one, as this has already been proven to be valuable (Bahdanau) with RNN seq2seq modelling.
So, this is what Uszkoreit wanted to build, but by his own account he failed to come up with an implementation that matched or outperformed the prevailing RNN approach that he wanted to replace. At this point, Uszkoreit mentioned the idea to Shazeer, who got on board and eventually arrived at a performant architecture which was then pared back by an ablation process resulting in the initial encoder-decoder Transformer architecture. Shazeer later came up with the mixture-of-experts architecture, and also other optimizations after he left to found character.ai
What they're working on is just making peoples jobs, skills obsolete and trying to invent machines that will concentrate the worlds wealth into the hands of the people who own those machines.
Popular entertainment and unique progress of human civilization can’t be really compared either
It's funny, but with the AI hires/moves it feels more like satire now.
I always appreciated Jeff having a level head ... which this article seems to confirm:
https://www.yahoo.com/news/articles/google-cracks-down-posts...
But the "I don't believe that humans have an attribute called gender" is such a comically stupid take. It is just rejecting the entire concept that is at play; and when it comes down to it, this is the only argument that anti-trans people can come up with: this distinction between sex and gender, that clearly clearly exists, ...doesn't exist?
Like, forget the moral questions all of this entails: from purely a "I'm an intelligent person crafting a logical argument" perspective, I'd be _embarrassed_ to put this one forward. If I have to retreat my entire argument to an introduced axiom that says I believe as a foundational principle that the thing you have presented (that gender exists) and have demonstrated ample evidence of (there are so so many non-biological traits heavily correlated with gender, and they vary across societies, thus demonstrating that societal factors are _very_ likely to be causal) does not exist, then this would absolutely gnaw at me on the inside.
Novo Nordisk hired you to find a cure for obesity.
- This is your full time job, and this is what you are paid for. The company also invests in a lab, in machines, in other employees, etc, so all of you together can figure out.
You find Wegovy, and poof, you run away with the recipe and sell the product on your own.
- Yes, you just scammed your boss, you made him believe that you were working for him, but actually you were using the company resources to your sole benefit.
It's not about loyalty, it's about integrity.
It's the same type of people whom you hire and pay to develop a platform, and then they steal the code, and never deliver this platform to you. Terrible business practices, but isn't it how Facebook happened too ?
Unless you think that employees are like indentured servants, and Novo Nordisk owns not only Wegovy but the people who work on it.
should he have been obligated to stay at google for the rest of his career?
I have no dog in this race as I'm not fond of either OpenAI or Google.... but employees not being loyal to their big tech employers is a wild thing to be concerned about in 2026 when year after year many large tech companies (Google very prominently among them) continually post record profits and still lay people off by the thousands.
The Attention is all you need paper has Google logo, not character.ai
Not really.
Altman couldn't code his way out of a wet paper bag.
Noam is OTOH and IIUC the real deal.
Based on what?
C'mon people, if you don't know Noam personally, who are you to fling such accusations?
I really hate the low bar of HN discussions lately. It's late-Slashdot-level. Brrr.
"Google and Character.AI agree to settle lawsuits over teen suicides" - https://www.axios.com/2026/01/07/google-character-ai-lawsuit...
Be aware...very disturbing: https://www.judiciary.senate.gov/imo/media/doc/e2e8fc50-a9ac...
Google lost three critical years chasing AGI, and got acquired by SpaceX, now a Dyson Sphere startup whose pitch deck is just: "What if we put a paywall around the Sun?"
What is exiting about this?
HPMOR•20h ago
alexcos•14h ago
tmule•1h ago
david_shi•1h ago
john_strinlai•35m ago