It doesn't seem worth it to try to follow the math to see if there is something interesting.
https://www.youtube.com/watch?v=Xx4Tpsk_fnM
"The Hard Problem of Controlling Powerful AI Systems" (Computerphile)
https://www.youtube.com/watch?v=JAcwtV_bFp4
Attempting to guide statistical salience of LLM reasoning model procedures, usually just created an evasive interface facade in the output. =3
causalmodels•2w ago
dwattttt•2w ago
A novel use of the word "reliable"? Jokes aside, either they mean the FPR as the opposite of what you'd expect, the table is not representative of their approach, or they're just... really optimistic?
godelski•2w ago
From Sec 3, end of second to last paragraph
by program hash, and *bounds false positives via the chosen percentile and gap parameters.*I believe this is a choice, though I think it is suspect that the FPR is pushed this high to get the TP results.
Disclaimer: I only gave this a very cursory skim so don't rely on me too much