This whole article reads like pre AI thinking. Mentinoning briefly AI in intro is not enough. IMO the really interesting issue, considering our AI future, won’t be discovering your values through introspection, it's figuring out how to translate human values into mathematical constraints that don't immediately get gamed by optimizers. On the other hand, core validator questions are actually pretty solid, especially the "does the inverse strike a nerve" test.
nkko•1h ago