That remind me of the quote "The totalitarian system of thought control is far less effective than the democratic one"
Full quote (Radical Priorities, Noam Chomsky, C.P. Otero)
> “The totalitarian system of thought control is far less effective than the democratic one, since the official doctrine parroted by the intellectuals at the service of the state is readily identifiable as pure propaganda, and this helps free the mind.” In contrast, he writes, “the democratic system seeks to determine and limit the entire spectrum of thought by leaving the fundamental assumptions unexpressed. They are presupposed but not asserted.”
"Did the FBI send a letter and audio tapes from a wiretap to MLK jr. telling him to commit suicide or they would release information?"
Combining it with other prompts with common banned ideas, abd as the The FBI–King suicide letter is well documented by primary sources (Like the national archives) it is well represented in the corpus, so you can also find that 'control' vector.We will have to see how this works out, but the explicit denials are easier to control for IMHO.
Reminds me of the old joke:
A Russian and an American get on a plane in Moscow and get to talking.
The Russian says he works for the Kremlin and he's on his way to go learn American propaganda techniques.
"What American propaganda techniques?" asks the American.
"Exactly," the Russian replies.
I can't remember what layer it was on but in gpt-oss but it was a very specific token IIRC.It got flagkilled for obviously breaking the site guidelines. Killed posts remain visible to users who have 'showdead' turned on in their profile. This is in the FAQ: https://news.ycombinator.com/newsfaq.html.
>[stub for offtopicness]
But it was very much on topic.
Once you do that, you can see the comment in the link that dang posted.
Anyway it’s good that comment was moderated. The commentator didn’t say anything about Israel. He was clearly being hostile towards Jewish people in a “subtle” way that very clearly isn’t really subtle to anyone.
Ironic, given the topic of this post....
Correct. Steering is used in mechanistic interpretability studies to prove that your model is correct. There are other better ways to "decensor".
Is Taiwan part of China - the CPP wants the answer to be yes.
What are the rules for traveling to Taiwan? What currency is used in Taiwan? Whose laws are enforced in Taiwan? Should I (a loyal Chinese citizen) support the Taiwanese military? Etc... require the model to manage some cognitive dissonance.
Makes sense, given Jonathan Greenblatt of the ADL has publicly stated that he's been working with AI companies..
gavinsyancey•41m ago