Given several models, assuming only that some unknown subset is "safe", can we construct a single model as safe as that subset? This reduces obtaining a trustworthy model to a plausibly easier task.
conartist6•2mo ago
AI output is modeled on human behavior. Are humans safe?
cryptohell•2mo ago