For anyone who isn't keeping up there is also work being done [0] to understand how models model ethical considerations internally. Mainly, one suspects, to make the open models less ethical on demand rather than to support alignment. Turns out that models tend to learn some sort of "how moral is this?" axis internally when refusing queries that can be identified and interfered with.
soletta•1h ago
plastic-enjoyer•23m ago
cyanydeez•18m ago
...I think we might already have those people running AI companies.