Conor—yes, I understand that you’re making judgment calls about what’s likely to be net harmful versus helpful.
But your judgment calls seem to assume—implicitly or explicitly—that ASI alignment and control are possible, eventually, at least in principle.
Why do you assume that it’s possible, at all, to achieve reliable long-term alignment of ASI agents? I see no serious reason to think that it is possible. And I’ve never seen a single serious thinker make a principled argument that long-term ASI alignment with human values is, in fact, possible.
And if ASI alignment isn’t possible, then all AI ‘safety research’ at AI companies aiming to build ASI is, in fact, just safety-washing. And it all increases X risk by giving a false sense of security, and encouraging capabilities development.
So, IMHO, 80k Hours should re-assess what it’s doing by posting these ads for jobs inside AI companies—which are arguably the most dangerous organizations in human history.
Conor—yes, I understand that you’re making judgment calls about what’s likely to be net harmful versus helpful.
But your judgment calls seem to assume—implicitly or explicitly—that ASI alignment and control are possible, eventually, at least in principle.
Why do you assume that it’s possible, at all, to achieve reliable long-term alignment of ASI agents? I see no serious reason to think that it is possible. And I’ve never seen a single serious thinker make a principled argument that long-term ASI alignment with human values is, in fact, possible.
And if ASI alignment isn’t possible, then all AI ‘safety research’ at AI companies aiming to build ASI is, in fact, just safety-washing. And it all increases X risk by giving a false sense of security, and encouraging capabilities development.
So, IMHO, 80k Hours should re-assess what it’s doing by posting these ads for jobs inside AI companies—which are arguably the most dangerous organizations in human history.