frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Apple reveals new AI architecture built around Google Gemini models

https://www.macrumors.com/2026/06/08/apple-reveals-new-ai-architecture/
110•unclefuzzy•1h ago

Comments

bensyverson•39m ago
I would love to learn more about what's actually powering Apple Intelligence now. Are they using flagship Gemini models behind their own prompts? Fine-tuning? Pre-training their own models based on Gemini?

Is there a meaningful distinction between the Gemini-powered models and Apple Foundation Models? Does that distinction vary for on-device vs hosted models? Are some models running on Apple's Private Cloud Compute and others running on Google iron?

pishpash•31m ago
Gemini (at least public free version) hallucinates way too much. If it's like that, it can go very badly for Apple.
ComputerGuru•27m ago
I used Gemini exclusively via the API but downloaded the app last week for something. Even on max settings, it is ridiculously nerfed!
t0mas88•23m ago
The public version of Gemini is ridiculous. At least half their search "answers" are just wrong. If you then start a follow up chat the answers change but usually still half wrong.

Search would be better without the added AI hallucinations above it. If I want an AI answer I'll go and ask Claude, the quality difference is huge.

dyauspitr•12m ago
It has to be really because think of how fast it has to come up with an answer (ie time for a regular google query) and the immense scale of billions of people querying it many times a day, all for free.
Melatonic•11m ago
Local is probably similar to Gemma e4b you can get right now on Google Edge Gallery (the ios and Android app). Guessing that the more powerful version that will only work on the 12gb ram devices will be something unreleased that is similar but a bit larger

Google also awhile back announced being able to run full Gemini by leasing / renting hardware in your own datacenters so companies can train or access data without needing to send things to their datacenters. Nvidia based. Guessing Private Compute might just be Apple leasing a ton of those?

wmf•3m ago
Apple Private Cloud Compute is running on M2/M3 Ultra. I'm not sure if Gemini Flash can fit in that amount of RAM.
microflash•38m ago
Not launching in EU feels like a smell. It does look interesting enough for me to try it out before disabling Apple Intelligence again.
kmeisthax•36m ago
The smell is that Apple doesn't want to give the same level of access to third-party AI assistants that Siri will get.

For what it's worth, Apple claimed they proposed an "equivalent access" framework with some kind of "trusted agent framework" approach, but that it was shot down by the EU. I suspect it was way more inconvenient for third-party developers than Apple lets on.

Schiendelman•17m ago
This has been the case for quite a while. Like Reminders - You can't replace the phrase "Siri, remind me to ___" with a third party app. I'm surprised the EU lets them ship Reminders there.
peterspath•36m ago
It’s the DMA regulation that forces Apple to give the same access as they have to other AI chat apps.

Once it leaves the device Apple does not know what those other ai chat apps will do with the gathered data.

> Siri AI is private by design and deeply integrated across Apple’s platforms using on-device processing and Private Cloud Compute, which extends the privacy and security of iPhone into the cloud. However, under EU regulators’ extreme interpretation of the DMA, Apple would have to give any virtual assistant direct access to users’ private data — and the ability to directly control other installed applications — as soon as Siri AI is made available in the EU, without the essential protections necessary to keep users and their data safe.

https://www.apple.com/newsroom/2026/06/due-to-dma-siri-ai-de...

dejawu•31m ago
It's strange to me that Apple would choose to disadvantage themselves by selecting Google as their provider as opposed to, say, Anthropic or even OpenAI. Doesn't this mean they'll struggle more to differentiate themselves from the assistant on Android phones? Thinking more cynically, couldn't Google, if they wanted, feed Apple an inferior version of Gemini, ensuring they stay ahead?

As the consumer, this just sucks because it means no matter which phone platform you choose, you're getting the same thing underneath, and there's no way to avoid it (besides not using an assistant entirely, which I recognize a lot of people do, myself included).

wnevets•25m ago
> It's strange to me that Apple would choose to disadvantage themselves by selecting Google as their provider as opposed to, say, Anthropic or even OpenAI. Doesn't this mean they'll struggle more to differentiate themselves from the assistant on Android phones? Thinking more cynically, couldn't Google, if they wanted, feed Apple an inferior version of Gemini, ensuring they stay ahead?

Is it really all that difference from Apple defaulting to Google's search engine?

dylan604•18m ago
Depends on which way the money is flowing. Google pays Apple for default search engine. Is Google paying Apple for using Gemini? That feels like a much heavier investment if they are
slopinthebag•23m ago
The differentiation is the integration, not the model itself which is mostly fungible. And afaik Apple is running these models on their own compute, so I don't think google can pull a bait and switch.
jayd16
amelius•28m ago
Wait, if it's Gemini why do they call it "Apple Intelligence"? Is Google okay with that?
cyanydeez•27m ago
I don't think you conceptualize Google's game plan. all these companies care about is b2b contracts so they can inflate their balance sheets because when it's digital, it doesn't have to actually exist for it to "make money"
jeffbee•26m ago
If iCloud is implemented on Google Cloud Storage, why do they call it iCloud?
Someone1234•14m ago
Google sold Apple the ability to run certain Gemini models on Apple's data-center hardware, using Google's orchestration layer. Apple hooks into that not dissimilar from an API-provider, and then builds everything upstream.

Meaning the system prompt(s), harness, entry and exit points, and skills. So the product is still "Siri AI", because of all the stuff that takes it from a raw infrastructure concern upon up into a "product" is Apple's responsibility.

Google are "okay with that" because Apple pays them $1B a year, per press reports, to be.

Melatonic•9m ago
Any business can do this now actually - you just need to lease/rent hardware through a Google partner and you can run an Nvidia based server in your own datacenter running (supposedly) the latest full Gemini models.
wmf
simianwords•25m ago
Apple could have done something like bedrock and used a SOTA model but instead they are fiddling with local models or whatever.

Also I have seen that Apple has some strange lust towards image generation as if that's what people really want. I have this slop image generation thing on my phone and it is useless.

Here's what I want: natural language interaction to achieve complex workflows in iPhone. Example: find the cheapest way to go from A to B and book it using the Deutsche Bahn Train app.

henry2023•5m ago
Local Gemini is a light year ahead of whatever Siri is so this is a big improvement already.

If they don’t like this in the future they can just change to the less convenient, less secure bedrock + SOTA, and likely more expensive bedrock + SOTA.

simianwords•2m ago
I don't disagree that local Gemini is better but if you've tried it in iPhone, it is slow, hallucinates and overheats the phone. For anything slightly non-trivial like the workflow example I gave, I think it will be close to useless.
ciberado•25m ago
I honestly don’t understand how anyone can believe that Apple is limiting user options for privacy reasons, rather than trying to maintain an unfair advantage over other vendors.

I’m not saying people who hold this view are being dishonest at all. But sometimes, to me, brands like Harley-Davidson or Apple seem closer to a cult than to a typical corporation.

slopinthebag•21m ago
Because privacy is actually a big feature, many people are skeptical about AI and the big model providers and don't trust them. They're less skeptical about on-device AI and so Apple is pushing that and making privacy a core feature of their online offering as well.

I probably wouldn't use it without that. It's one thing sending my shitty code to be trained on, and another thing entirely to give these companies access to my personal life and information.

t0mas88•16m ago
That still doesn't explain why my data can only be sent to Apple and not to another vendor of my choice.
jmull•11m ago
> ...closer to a cult...

When you have to image a highly irrational reason to explain why groups of people do the things they do, there's a decent chance you just don't understand their perspective. They may be acting reasonably rationally from their own perspective. (As you said yourself: "I honestly don't understand...")

luk212•23m ago
Very Apple-ish approach to AI catch up: wrap an external tool in a privacy architecture, embed into the OS and productize the orchestration layer.

It will be interesting to see if the Private Cloud Compute + on-device routing can make third-party model capabilities feel like a first-party system without leaking user context to the model provider.

If Apple handles the Google-Apple boundary right, this will be an elegant move on their part, otherwise it will feel like Apple Intelligence with a just a privacy-polished frontend for Gemini.

al_borland•2m ago
As someone who doesn't use Android, they showed a lot of integration into the apps, which I think is where the real magic happens, and it's not something I can do with any 3rd party chatbots today (that I'm aware of). I also don't know that I would trust the other 3rd parties with the access required to pull it off.
homelander28•20m ago
what i think why they are too much relying on Google is coz they are way much towards making models open source and launching more much better models to public as if in future apple part way from google they might still have much better models to rely on and if we see the history google has been partnered with Apple since the launch of first Iphone
nraleigh•18m ago
This move kind of reminds me of the original iPhone with google maps. You're competing with google, but you're using their infrastructure. Why wouldn't they just go with another provider like OpenAI or Anthropic?
MattDamonSpace•17m ago
Are they competing with Google?
david-gpu•16m ago
What do you consider Android to be, if not a competitor of iOS?
spike021•17m ago
not just google maps but google search itself has been on the iphone since it launched
Tactical45•14m ago
OpenAI or Anthropic are not anywhere as well funded as Google. Apple already has everyone in their pocket via the ecosystem, they just have to not crap the bed. They value stability over the competitive component here.
nemothekid•13m ago
Given that they had originally selected OpenAI for Siri, and that deal fell through, I would guess something about their relationship with OpenAI fell through. Maybe OpenAI wouldn't let Apple run their model on Apple's servers.
2OEH8eoCRo0•6m ago
So they run on TPUs and not Nvidia chips?
wmf•2m ago
No, Siri runs on Apple Silicon.
pjmlp•29m ago
Because outsourcing to Google is so much trustworthy...
another_kel•23m ago
I believe Google provides the weights, compute is apple owned
pjmlp•12m ago
> Apple to use Google servers with Nvidia hardware for the new Siri

https://www.macworld.com/article/3156959/apple-to-use-google...

People have to stop thinking Apple is somehow different.

sterlind•21m ago
so to translate:

- Apple has powerful capabilities in iOS to enable Siri AI.

- EU's DMA requires them to allow users to install third-party AI backends.

- Apple doesn't think parties other than themselves should be trusted with those iOS permissions.

I guess it'd be like if Apple allowed a first-party screen reader for iOS, so they refused to allow third-party screen readers.

Schiendelman•18m ago
I don't think they're "trusting Google" with anything. It's a Google model run by Apple, just like you download a model from Hugging Face to run when you want to.
bla3•18m ago
I don't think they trust Gemini as they run that on-device or on-site, on Apple's own servers.

See also https://security.apple.com/blog/private-cloud-compute/

t0mas88•19m ago
Sounds like Apple PR bullshit.

Unless Apple proves otherwise I'm more inclined to believe they're either 1. Using this to try and shape the DMA in their own interest (definitely not their users' interest) or 2. Doing something with the data that would not be allowed in the EU (also not in their users' interest at all) or both.

ainch•35m ago
They say it's because the EU's DMA would require them to open up device data to third-party assistants, and they'd no longer be able to guarantee user privacy.

https://www.apple.com/newsroom/2026/06/due-to-dma-siri-ai-de...

microtonal•15m ago
I don't see what the issue is. The user could then select Apple (or Mistral) for strong privacy or another provider for customers that don't care.

I primarily want Apple to provide extension points so that I can select my own provider, just as I can choose where to host my mail or install another app as my instant messenger.

Sure, I could install another provider's app, but it wouldn't have the same integrations, similar to how an instant messaging app would be less useful if notifications were limited to iMessage.

•
23m ago
Google will probably eventually pay Apple to be the assistant, a la search.
Schiendelman•20m ago
Eventually? I bet they already are.
hedora•13m ago
This is the most likely explanation. Apple manufacturers some of the best inference silicon on earth. Apache licensed models are already 1000x smarter than siri and strongly outperform anthropic, openai and google in the 8-128GiB of RAM range. The article says Apple can run this stuff on customers’ hardware, so that’s the range of model sizes that actually matter.
coldtea•2m ago
Not if the privacy situation is what Apple says in the Keynote. Only if they can tap that data.
silentsea90•15m ago
Maybe. Search ads likely make Google more money than they pay apple. For AI, Google currently loses money. If they eventually make money via ads, then sure. Else, apple will have to pay them
hedora•9m ago
Apple has started adding ads to iOS (e.g. maps), just like Microsoft.

I guarantee you Google will start letting people pay to influence the output of the Gemini models once they figure out how to do it.

advisedwang•8m ago
Google pays to be the default search because they make more from selling ads on those searches than they are paying for the search.

I don't see the same thing here. Google isn't making any money from being the assistant in Apple, so why would they pay to be it?

tantalor•1m ago
For Google, search quality is a moat. And it's becoming apparent that AI quality is a moat as well.
wmf•7m ago
Search has ads but Siri doesn't. And when Apple puts ads in Siri they won't be Google ads. I don't think Google benefits from this deal enough to be worth paying.
chinathrow•23m ago
Maybe they're looking for stability and trust Google to be around longer than Antrophic or OpenAI when the storm starts.
rileymat2•17m ago
I may be jaded, but I do not trust Google for product offering stability. Obviously, Apple is a way bigger fish.
deepfriedbits•15m ago
Bingo. They see this as the future commodity it will be. Customers will choose AI providers much way they choose a car: taste, price, a few other factors.

And to your point, Google has a massive balance sheet, produces their own AI chips, and is not going anywhere anytime soon.

kube-system•53s ago
People are not even going to choose their AI providers in the future, it's going to be a part of some other product.
nalekberov•12m ago
I’ll just put this link here: https://killedbygoogle.com/
chinathrow•10m ago
I am familiar what Google has killed before, but a contract with Apple is not something they'll throw out of the window for no reasons.
piskov•22m ago
Maybe openai wasn’t up to the level of customization and privacy they needed

Also openai and Jonny Ive (love from) are cooking some device — may be personal

bensyverson•21m ago
I see it differently… Apple has chosen to treat the model as a commodity. By making Gemini an implementation detail, they leave the door open to swap it out for Anthropic or OpenAI without end-users even knowing or caring. So I think they're creating leverage in any future negotiation.
t0mas88•13m ago
And at the same time they claim it can't be rolled out in Europe because the DMA would force them to allow selection of other AI providers? Which would not even be true if this was an Apple product built on the models of Google, just like the DMA does not force them to pick a different datacenter of office cleaning provider.
camillomiller•7m ago
It’s a bit like when Steve Jobs turned down acquiring Dropbox telling them they’re just a feature, not a product
potatoman22•17m ago
I'd wager that for 99.9% of "Apple Intelligence" tasks, Google's models perform just as well as other frontier labs. Google also has done more work on getting LLMs running on edge devices compared to anthropic and openAI.

The source also says > The new architecture centers on Apple Foundation Models co-developed with Google, which Apple says are adapted to run both on-device and on servers through its existing Private Cloud Compute infrastructure

Which could mean Google and Apple have trained some custom models, probably the on-device ones, specifically tailored towards Apple's hardware.

chaostheory•16m ago
The agent harness matters just as much as the AI model. Using Hermes or OpenClaw feels like night vs day when using OpenAI’s apps even when using the same exact model.

You can even see difference in agent harnesses using the same model in the same company if you compare Gemini CLI with AntiGrav. They are different experiences.

I’m pretty sure Apple’s agent harness will be drastically different from Google’s even with the same model

changoplatanero•4m ago
At the time Apple made this decision there wasn’t as strong of a difference in model quality between Google, anthropic, and OpenAI as there is now. Also Apple definitely burnt some bridges with OpenAI on the agreement they made together a year earlier.
mholm•3m ago
OpenAI and Anthropic don't make small models. Google happens to already have a billion devices that would benefit from small models, so they made one. Google basically gets 1 billion per year for free*.
•
1m ago
People keep saying Gemini but it's not clear that the models are Gemini. They might be separate models.
ralph84•13m ago
Pretty much all of big tech compete with each other in some areas and partner in others.

Google probably gave them the best deal. When you're the #3 player you'll sacrifice margins to drive volume.

WarmWash•12m ago
Google is almost closer to a conglomerate than a coherent horizontally integrated corporation. The individual parts of Google are like Fortune500 companies themselves, and tend to act in their own interest.
d--b•11m ago
yeah or Magento teaming up with the X-Men to defeat the military in X2. XD

Why LLM Inference Needs a New Kind of Router

https://www.modular.com/blog/why-llm-inference-needs-a-new-kind-of-router-part-1
1•aviziva•1m ago•0 comments

The Korean Stock Market Just Crashed Sunday Night but the Nasdaq Is Rising

https://247wallst.com/investing/2026/06/08/the-korean-stock-market-just-crashed-sunday-night-will...
1•carabiner•1m ago•0 comments

The Third Generation of Apple's Foundation Models

https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models
1•2bit•2m ago•0 comments

The World Needs Replicators: Do science in your home so we know what's true

https://chillphysicsenjoyer.substack.com/p/the-world-needs-replicators
1•crescit_eundo•5m ago•0 comments

'Unbelievable how accurate': How paid influencers hype Polymarket's odds

https://www.politico.com/news/2026/06/05/polymarket-paid-political-influencers-00932789
3•momentmaker•8m ago•0 comments

Apple Passwords Now Auto Fixes Weak and Compromised Passwords with Agentic AI

https://www.macrumors.com/2026/06/08/apple-passwords-can-now-automatically-fix-passwords-with-age...
1•7777777phil•9m ago•1 comments

Performance Mantras

https://hazn.com/performance-mantras
2•hazn•10m ago•0 comments

Show HN: RunAPI – one API for AI video, image, music, audio, and LLMs

https://runapi.ai/
1•billy42•10m ago•0 comments

Regrade

https://regradeapp.tech/#cta
1•preston_s•10m ago•0 comments

An open letter to office suite users, just before the Euro-Office announcement

https://blog.documentfoundation.org/blog/2026/06/08/an-open-letter/
7•jethronethro•10m ago•0 comments

Agents are getting phone numbers. The reason is not obvious

https://newsletter.gtmengineering.ai/p/a-phone-carrier-for-agents-not-what
1•alexjl1226•11m ago•0 comments

Vibe Diaries: Training Nanochat

https://www.vibediary.dev/essays/nanochat
1•stopachka•11m ago•0 comments

Donut Lab's 'solid-state' battery exposed as regular li-ion in investigation

https://electrek.co/2026/06/08/donut-lab-solid-state-battery-exposed-lithium-ion-fraud/
2•virgildotcodes•12m ago•0 comments

Search as Code: The End of the Query-Results Loop

https://medium.com/@tbelbek/perplexity-search-as-code-why-ai-agents-now-write-python-to-find-info...
1•rdstrtwlkr•12m ago•0 comments

JMB585 (SATA Chip) Silent Data Corruption – Root Cause Analysis and Kernel Patch

https://github.com/artmoty-dev/n5pro-jmb585-fix
1•Modified3019•18m ago•0 comments

Judge Strikes Down $100k Fee for Skilled Foreign Worker Visas

https://www.nytimes.com/2026/06/08/us/politics/judge-visa-skilled-workers-fee-trump.html
2•ceejayoz•19m ago•0 comments

Paving the Way for Agents in Biology

https://www.anthropic.com/research/agents-in-biology
1•dataking•22m ago•0 comments

Show HN: Generate production grade Lottie animations with Claude Code

https://twitter.com/konstipaulus/status/2064011863889788972
1•K9-P5•22m ago•0 comments

Yale reinstates SAT, ACT requirement after six years of flexible policy

https://yaledailynews.com/articles/yale-reinstates-sat-act-requirement-after-six-years-of-flexibl...
2•loughnane•23m ago•1 comments

One Token at a Time

https://unvoid.substack.com/p/one-token-at-a-time
2•nullscribe•26m ago•0 comments

AI to Pay for All Americans' Content?

https://ryanspahn.substack.com/p/ai-to-pay-for-all-americans-content
2•paul7986•27m ago•0 comments

Modular Arithmetic Challenge

https://competition.sair.foundation/competitions/modular-arithmetic-challenge/overview
1•u1hcw9nx•28m ago•1 comments

What if gravity broke for 60 seconds?

https://www.youtube.com/watch?v=dNWwIfjJXZY
1•Asheed•29m ago•0 comments

Show HN: AnimateScrollTriggered:Animate.css with CSS scroll-triggered animations

https://github.com/ulrischa/AnimateScrollTriggered/tree/main
1•ulrischa•29m ago•0 comments

Federal judge voids Trump's $100k fee requirement for H-1B visas

https://www.cnn.com/2026/06/08/politics/federal-judge-voids-trumps-usd100-000-fee-requirement-for...
8•rawgabbit•30m ago•1 comments

Why Morningstar believes the SpaceX IPO is overvalued

https://www.morningstar.com/stocks/why-we-think-spacex-ipo-is-overvalued
8•ForHackernews•30m ago•0 comments

The Website Is Down #1: Sales Guy vs. Web Dude [video]

https://www.youtube.com/watch?v=uRGljemfwUE
1•znpy•32m ago•1 comments

North Korean hackers suspected of $300M crypto heist

https://techxplore.com/news/2026-04-north-korean-hackers-mn-crypto.html
1•PaulHoule•34m ago•0 comments

If You Use Claude or Gemini, This Microsoft Breach Means Your Data Is at Risk

https://scienspire.com/article/microsoft-breach-ai-code-assistants
1•valsurier•36m ago•0 comments

SVG-Line: Better Status Bars for Emacs – Charlie Holland's Blog

https://www.chiply.dev/post-svg-line
15•rbanffy•36m ago•0 comments