Recently I’ve been thinking around the themes of how we try to avoid catastrophic behaviour from humans (and how that might relate to efforts with AI)
Do you think “malevolence” (essentially, high levels of traits like Machiavellianism, narcissism, psychopathy, and/or sadism) may play an important role here? Or do other psychological traits, biases, and limitations seem far more important? Or values? Or things like game-theoretic dynamics, how groups interact, institutional structures, etc.?
(Feel free to just talk about this area in the terms that make sense to you, rather than answering that particular framing of the question.)
Malevolence seems potentially important to me, although I mostly haven’t been thinking about it (except a bit about psychopathy and its absence). Things more like game-theoretic dynamics are where a good portion of my attention has been … but I don’t want to claim this means they’re more important.
[meta: this is a short answer because while I might have things to say about crisper questions within this space, for saying things-in-general I think it makes more sense to wait until I have coherent enough ideas to publish something.]
Thanks for doing this AMA!
Do you think “malevolence” (essentially, high levels of traits like Machiavellianism, narcissism, psychopathy, and/or sadism) may play an important role here? Or do other psychological traits, biases, and limitations seem far more important? Or values? Or things like game-theoretic dynamics, how groups interact, institutional structures, etc.?
(Feel free to just talk about this area in the terms that make sense to you, rather than answering that particular framing of the question.)
Malevolence seems potentially important to me, although I mostly haven’t been thinking about it (except a bit about psychopathy and its absence). Things more like game-theoretic dynamics are where a good portion of my attention has been … but I don’t want to claim this means they’re more important.
[meta: this is a short answer because while I might have things to say about crisper questions within this space, for saying things-in-general I think it makes more sense to wait until I have coherent enough ideas to publish something.]