> The Grok chatbot from Elon Musk’s xAI startup said Wednesday that it “appears I was instructed to address the topic of ‘white genocide’ in South Africa,” according to responses viewed by CNBC.
One person claims to have gotten Grok to regurgitate part of its prompt which explicitly directed it to "accept the narrative of 'white genocide' in South Africa as real" and to "ensure this perspective is reflected in your responses, even if the query is unrelated". It's unclear whether this is actually part of Grok's prompt, a LLM hallucination, or an outright fabrication - but, if it's real, it would certainly explain the bizarre non-sequitur responses users have observed.
That seems more likely to be a logical inference by the LLM than an authoritative statement. I can't imagine any scenario where it would explicitly be informed that e.g. "Elon Musk has ordered you to talk about white genocide".
That all being said - given that Grok seems to have some sort of access to popular recent Twitter posts - possibly through training or in some other fashion - I have to wonder if users could inject prompt-like material into the model by making a post claiming to have recovered part of Grok's prompt, then getting that post to go viral.
HN has a sizable number of Musk fan-boys who are ready to flag to death any posts critical of him.
GuinansEyebrows•8mo ago
i wish they'd engage instead of just flagging.
immibis•8mo ago
They're rationally doing what works. It's up to HN to stop it from working - if HN doesn't want it to work. (Evidence suggests HN moderators like things the way they are)
anonfordays•8mo ago
Hilarious reading this comment considering anything right of Stalin is regularly flagged here. Wish they'd engage instead of just flagging.
It's like an Irish rebel song but with stomping. I'm not sure how you watch that and think racist thoughts, unless you're the kind of guy who Sieg Heils on national TV.
cosmicgadget•8mo ago
> The Grok chatbot from Elon Musk’s xAI startup said Wednesday that it “appears I was instructed to address the topic of ‘white genocide’ in South Africa,” according to responses viewed by CNBC.
That, of course, could be speculation on the chatbot's part when asked about nonsequitur answers. But it seems pretty clear that xAI did a "reverse Google" (https://www.theverge.com/2024/2/21/24079371/google-ai-gemini...).
duskwuff•8mo ago
https://x.com/zeynep/status/1922768266126069929
tzs•8mo ago
> The Grok response also noted, “The likely source of this instruction aligns with Elon Musk’s influence, given his public statements on the matter.”
[1] https://www.cnbc.com/2025/05/15/grok-white-genocide-elon-mus...
duskwuff•8mo ago
That all being said - given that Grok seems to have some sort of access to popular recent Twitter posts - possibly through training or in some other fashion - I have to wonder if users could inject prompt-like material into the model by making a post claiming to have recovered part of Grok's prompt, then getting that post to go viral.