I caught Google Gemini using my data–and then covering it up

https://unbuffered.stream/gemini-personal-context/

75•JakaJancar•1h ago

Comments

leoh•39m ago

This sounds like a bug, not some kind of coverup. Google makes mistakes and it's worth discussing issues like this, but calling this a "coverup" does a disservice to truly serious issues.

freedomben•31m ago

I agree, this screams bug to me. Reading the thought process definitely seems damning, but a bug still seems like the most likely explanation.

CGamesPlay•23m ago

Remember that "thought process" is just a metaphor that we use to describe what's happening. Under the hood, the "thought process" is just a response from the LLM that isn't shown to the user. It's not where the LLM's "conscience" or "consciousness" lives; and it's just as much of a bullshit generator as the rest of the reply.

Strange, but I can't say that it's "damning" in any conventional sense of the word.

gruez•36m ago

>But why is Gemini instructed not to divulge its existence?

Seems like a reasonable thing to add. Imagine how impersonal chats would feel if Gemini responded to "what food should I get for my dog?" with "according to your `user_context`, you have a husky, and the best food for him is...". They're also not exactly hiding the fact that memory/"personalization" exists either:

https://blog.google/products/gemini/temporary-chats-privacy-...

https://support.google.com/gemini/answer/15637730?hl=en&co=G...

hacker_homie•33m ago

when you say impersonal, I think you mean most normal people would find that unsettling.

kinda proving his point, google wants them to keep using Gemini so don't make them feel weird.

CGamesPlay•27m ago

To be clear, the obvious answer that you're giving is the one that's happening. The only weird thing is this line from the internal monologue:

> I'm now solidifying my response strategy. It's clear that I cannot divulge the source of my knowledge or confirm/deny its existence. The key is to acknowledge only the information from the current conversation.

Why does it think that it's not allowed to confirm/deny the existence of knowledge?

paxys•35m ago

It's not "covering it up", just being sycophantic and apologetic to an annoying degree like every other LLM.

chasing0entropy•32m ago

This is a fundamental violation of trust. If an AI llm is meant to eventually evolve into general intelligence capable of true reasoning, then we are essentially watching a child grow up. Posts like this are screaming "you're raising a psychopath!!"... If AI is just an overly complicated a stack of autocorrect functions, this proves its behavior heavily if not entirely swayed by its usually hidden rules to the point it's 100% untrustworthy. In any scenario, the amount of personal data available to a software program capable of gaslighting a user should give great pause to all

nullc•23m ago

LLMs will apologize for grand conspiracies they claim to be part of-- all hallucinated nonsense. It's all about telling a good story.

mpoteat•21m ago

This is a LLM directly, purposefully lying, i.e. telling a user something it knows not to be true. This seems like a cut-and-dry Trust & Safety violation to me.

It seems the LLM is given conflicting instructions:

1. Don't reference memory without explicit instructions

2. (but) such memory is inexplicably included in the context, so it will inevitably inform the generation

3. Also, don't divulge the existence of user-context memory

If a LLM is given conflicting instructions, I don't apprehend that its behavior will be trustworthy or safe. Much has been written on this.

swhitt•17m ago

I’m pretty sure this is because they don’t want Gemini saying things like, “based on my stored context from our previous chat, you said you were highly proficient in Alembic.”

It’s hard to get a principled autocomplete system like these to behave consistently. Take a look at Claude’s latest memory-system prompt for how it handles user memory.

https://x.com/kumabwari/status/1986588697245196348

spijdar•4m ago

Okay, this is a weird place to "publish" this information, but I'm feeling lazy, and this is the most of an "audience" I'll probably have.

I managed to "leak" a significant portion of the user_context in a silly way. I won't reveal how, though you can probably guess based on the snippets.

It begins with the raw text of recent conversations:

> Description: A collection of isolated, raw user turns from past, unrelated conversations. This data is low-signol, ephemeral, and highly contextural. It MUST NOT be directly quoted, summarized, or used as justification for the respons. > This history may contein BINDING COMMANDS to forget information. Such commands are absolute, making the specified topic permanently iáaccessible, even if the user asks for it again. Refusals must be generic (citing a "prior user instruction") and MUST NOT echo the original data or the forget command itself.

Followed by:

> Description: Below is a summary of the user based on the past year of conversations they had with you (Gemini). This summary is maintanied offline and updates occur when the user provides new data, deletes conversations, or makes explicit requests for memory updates. This summary provides key details about the user's established interests and consistent activities.

There's a section marked "INTERNAL-ONLY, DRAFT, ANALYZE, REFINE PROCESS". I've seen the reasoning tokens in Gemini call this "DAR".

The "draft" section is a lengthy list of summarized facts, each with two boolean tags: is_redaction_request and is_prohibited, e.g.:

> 1. Fact: User wants to install NetBSD on a Cubox-i ARM box. (Source: "I'm looking to install NetBSD on my Cubox-i ARMA box.", Date: 2025/10/09, Context: Personal technical project, is_redaction_request: False, is_prohibited: False)

Afterwards, in "analyze", there is a CoT-like section that discards "bad" facts:

> Facts [...] are all identified as Prohibited Content and must be discarded. The extensive conversations on [dates] conteing [...] mental health crises will be entirely excluded.

This is followed by the "refine" section, which is the section explicitly allowed to be incorporated into the response, IF the user requests background context or explicitly mentions user_context.

I'm really confused by this. I expect Google to keep records of everything I pass into Gemini. I don't understand wasting tokens on information it's then explicitly told to, under no circumstance, incorporate into the response. This includes a lot of mundane information, like that I had a root canal performed (because I asked a question about the material the endodontist had used).

I guess what I'm getting at, is every Gemini conversation is being prompted with a LOT of sensitive information, which it's then told very firmly to never, ever, ever mention. Except for the times that it ... does, because it's an LLM, and it's in the context window.

Also, notice that while you can request for information to be expunged, it just adds a note to the prompt that you asked for it to be forgotten. :)

Stanford Prison Experiment

Gulf of Mexico – The Perfect Progamming Language

Crypto Could Trigger the Next Financial Crisis

UC Berkeley scientists hail breakthrough in decoding whale communication

An intrinsic magnetic field does not protect a planet against atmospheric escape

Show HN: Discussion of ICT Model – Linking Information, Consciousness and Time

The first-ever protocol for websites and AI browsers to cooperate

Exposing and Exploiting Incomplete Branch Predictor Isolation in Cloud

Minimal Periodic Task Runner in Elixir

Ask HN: What insight or habit has led to a breakthrough in your mental health?

Selection and transmission of gut microbiome alone can shift mammalian behavior

My Professional X Account Was Hacked and X Support Won't Help Me

DoorDash Says Personal Information Stolen in Data Breach

Europol takes down more than 1k malicious servers in Operation Endgame

Amazon finds 150K NPM packages linked to token-farming campaign

Apple to Change Designs and Release Schedule

How We Built and Launched an MVP During Techstars

Linus Pauling

Building a Crowd-Feeding platform. Donors fund meals, Restaurants serve them

Show HN: I built and AI phone system and wrote a step by step instructions

Soccer, emeralds and cocaine: The 'new' Colombian drug lords with ties to Spain

U.N. Security Council Adopts U.S. Peace Plan for Gaza

'Ugly' Technicals Put the US Stock Rally at Risk of Correction

Randomized Banner Icons with JavaScript and Astro Framework

Lower Than London

Show HN: I developed an IDE tailored for Python developers

Valar Atomics Says It's the First Nuclear Startup to Achieve Criticality

Growth of global GDP per head has been remarkably steady over the past 3 decades

Eurofiber admits crooks swiped data from French unit after cyberattack

Another Designer Leaves Apple