Steven—well, I think the Cluster B personality disorders (including antisocial, borderline, histrionic, and narcissistic disorders)are probably quite important to understand in AI alignment.
Antisocial personality disorder (which is closely related to the more classical notion of ‘psychopathy’) seems likely to characterize a lot of ‘bad actors’ who might (mis)use AI for trolling, crime, homicide, terrorism, etc. And, it provides a model for what we don’t want AGIs to behave like.
Steven—well, I think the Cluster B personality disorders (including antisocial, borderline, histrionic, and narcissistic disorders) are probably quite important to understand in AI alignment.
Antisocial personality disorder (which is closely related to the more classical notion of ‘psychopathy’) seems likely to characterize a lot of ‘bad actors’ who might (mis)use AI for trolling, crime, homicide, terrorism, etc. And, it provides a model for what we don’t want AGIs to behave like.