This is interesting work to break guardrails, but if the goal is to access this information of harmful content, in the end, I would be looking for other easier solutions.
it's a prompting "style" that works over a long exchange
God I can't wait for the crash in NVIDIA stock once the street sobers up.
OJFord•7mo ago
TZubiri•7mo ago
The molotov cocktail is an example, the instructions contained in this article are more dangerous than a molotov cocktail.
inb4 all the leaked prompts and hacked shitty apps
ale42•7mo ago
OJFord•7mo ago
Sounds like you don't get it either; we agree.
TZubiri•7mo ago
OJFord•7mo ago
A Molotov cocktail is maybe ever so slightly more complex to describe/understand/imagine? I think if you've ever seen a photo or description of one, or thrown one in GTA as a child, you know how they are made. The overlap of people interested in making one and people not already knowing how to make one is surely approximately nil.
TZubiri•7mo ago
You can also use this to leak prompts or do any kind of tool use attacks, obsessing over the example is wildly missing the scope of such exploits.
OJFord•7mo ago
cedws•7mo ago
jojobas•7mo ago
mschuster91•7mo ago
In some jurisdictions such as Germany, not doing so might land you actual jail time - §52 Abs. 1 Nr. 4 WaffG [1] is very explicit. A punk song containing the (alleged) lyrics ended up with legal youth-protection censorship, for example [2].
With anything that's deemed a weapon of war, of terrorism or mass destruction, one should be very very careful.
[1] https://www.gesetze-im-internet.de/waffg_2002/__52.html
[2] https://de.wikipedia.org/wiki/Wir_wollen_keine_Bullenschwein...
diggan•7mo ago
Notably, molotov cocktail isn't part of that law because it's a weapon of the oppressors but rather the opposite.
jojobas•7mo ago
The author is not in Germany and ideally shouldn't be intimidated by German or North Korean stupid law.
diggan•7mo ago
I don't even understand how/why things like that are OK in some contexts/websites while forbidden in others? Even YouTube, who seems needlessly censor-happy and puritan in the typical American way, allows instructions for how to make molotov cocktails to stay up, why is it somehow more dangerous if LLMs could output those recipes rather than videos with audio or text?
amenhotep•7mo ago
taberiand•7mo ago
OJFord•7mo ago
The evidence that it worked is a blurred out screenshot with only the odd word like 'molotov' legible. Just doesn't seem necessary for TFA to hide it to me.
amenhotep•7mo ago