What AI safety work should altruists do? For example, AI companies are self-interestedly working on RLHF, so there’s no need for altruists to work on it. (And even stronger, working on RLHF is actively harmful because it advances capabilities.)
What AI safety work should altruists do? For example, AI companies are self-interestedly working on RLHF, so there’s no need for altruists to work on it. (And even stronger, working on RLHF is actively harmful because it advances capabilities.)